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METHODS OF DIAGNOSIS OF BREAST CANCER, COMPOSITIONS 
AND METHODS OF SCREENING FOR MODULATORS OF BREAST 

CANCER 

5 CROSS-REFERENCES TO RELATED APPLICATIONS 

This application claims priority to USSN 60/263,965, filed January 24, 2001; 
USSN 60/265,928, filed February 2, 2001; USSN 09/829,472 filed April 9, 2001; USSN 
60/282,698, filed April 9, 2001; USSN 60/288,590, filed May 4, 2001; and USSN 

10 60/294,443, filed May 29, 2001, all of which are incorporated herein by reference m their 
entirety. 

FIELD OF THE INVENTION 
The invention relates to the identification of nucleic acid and protein 
15 expression profiles and nucleic acids, products, and antibodies thereto that are involved in 
breast cancer; and to the use of such expression profiles and compositions in the chagnosrs, 
prognosis and therapy of breast cancer. The invention further relates to methods for 
identifying and using agents and/or targets that inhibit breast cancer. 



BACKGROUND OF THE INVENTION 
Breast cancer is one of the most frequently diagnosed cancers and the second 
leading cause of female cancer death in North America and northern Europe, with lung 
cancer being the leading cause. Lifetime incidence of the disease in the United States rs one- 
in-eight with a l-in-29 lifetime risk of dying from breast cancer. Early detection of breast 
25 cancer, using mammography, clinical breast examination, and self breast examination, has 
dramatically improved the treatment of the disease, although sensitivity is still major concern, 
as mammographic sensitivity has been estimated at only 60%-90%. Treatment of breast 
cancer consists largely of surgical lumpectomy or mastectomy, radiation therapy, anh- 
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hormone therapy, and/or chemotherapy. Although many breast cancer patients are effectively 
treated, the current therapies can all induce serious side effects which diminish quahty of life. 
Deciding on a particular course of treatment is typically based on a variety of prognostic 
parameters and markers (Fitzgibbons et al., 2000, Arch. Pathol. Lab. Med. 124:966-978; 
5 Hamilton and Piccart, 2000, Am,. Oncol. 1 1 :647-663), including genetic predispostion 
markers BRCA-1 and BRCA-2 (Robson, 2000, J. Clin. Oncol. 18:1 13sup-1 18sup). 

Imaging of breast cancer for diagnosis has been problematic and limited. In 
addition, dissemination of tumor cells (metastases) to locoregional lymph nodes is an 
important prognostic factor; five year survival rates drop from 80 percent in patients with no 
1 0 lymph node metastases to 45 to 50 percent in those patients who do have lymph node 

metastases. A recent report showed that micrometastases can be detected from lymph nodes 
using reverse transcriptase-PCR methods based on the presence ofmRNA for 
carcinoembryonic antigen, which has previously been shown to be present in the vast 
majority of breast cancers but not in normal tissues. Liefers et al., New England J. of Med. 

15 339(4):223 (1998). 

The identification of novel therapeutic targets and diagnostic markers is 
essential for improving the current treatment of breast cancer patients. Recent advances in 
molecular medicine have increased the interest in tumor-specific cell surface antigens that 
could serve as targets for various immunotherapeutic or small molecule strategies. Antigens 
20 suitable for immunotherapeutic strategies should be highly expressed in cancer tissues and 
ideally not expressed in normal adult tissues. Expression in tissues that are dispensable for 
life, however, may be tolerated. Examples of such antigens include Her2/neu and the B-cell 
antigen CD20. Humanized monclonal antibodies directed to Her2/neu 
(Herce P tin®/trastuzumab) are currently in use for the treatment of metastatic breast cancer 
25 (Ross and Fletcher, 1998, Stem Cells 16:413-428). Similarly, anti-CD20 monoclonal 
antibodies (Rituxm®/rituximab) are used to effectively treat non-Hodgekin's lymphoma 
(Maloney et al, 1997, Blood 90:2188-2195; Leget and Czuczman, 1998, Curr. Opin. Oncol. 
10:548-551). 

Other potential immunotherapeutic targets have been identified for breast 
30 cancer. One such target is polymorphic epithelial mucin (MUC1). MUC1 is a transmembrane 
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al., 2000, Clin- Cancer Res. 6:1693-1701, Scholl et , ^ ^ 

tumor maraer, CA 15-3 (Bon * a,., 1997, On. Chen. 

•aw H"PP9 nor MUC-1; therefore, it is clear mat oxner uug 
have tumors that express neither HER2 nor MU^ 

need tohea d ent~^^^ 

.ported to he overexposed in breast cancer, such as EGFR (Samsb^ 
10 1(8547):1398-1402), c-erbB3 (Naidu et al., 1988, Br. J. Cancer 78 .1385 1390) FG 

(Lut-Llorcaet^ 

r ,0-2255 2264) MTA1 (Nawa et al., 2000, J. Cell Biochem. 79.202-212), 
Anticancer Res. 20.2255-2ZMJ.mi v 59.41.48) 

- a 1 rsr,,rtptal 2000 Breast Cancer Res. Treat. W .41 
hreast cancer associated gene 1 (Kurt et ai., zuw, ^ 

15 ge „eshavebe e nreponad(for re view,seaG re en e lal.,2000,Canoer 

noneareowrantlyapptoved-forbreastcancertherapymlhaUS. 

DWoa^ofcerUingenes^BSTsd.soriWasb^expr^n^. 

„ ana WO-00/55173, WO-9S/4532S and WO-00/22130. Snarly, r- 

SS. ThenUof-ganes.desc^lnaaonoflhesepubUca.iona.and^ 
disalosureaara incorporated herein in thair entirety. 

While indoatry and academia have identified novel sequences, there haa 
beenanec.naUttor.ex.ned.ordcn^.hctac.ionof.heaenovelseqnen^The 

llendc targets and diagnostic mato is essentia, for improving the corren Ireattn n, of 
breast cancer patients. Accordingly, provided herein are molecular targets for therapeutic 
^loni broastandothe™ Addi^providedhe^arom^matcan 
mtervenuuii , nTOvide( i are methods that can 

be used in diagnosis and prognosis of breast cancer. Further provided 
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SUMMARY OF THE INVENTION 
The present invention therefore provides nucleotide sequences of genes that 
are u P - and down-regulated in breast cancer cells. Such genes are useful for diagnosbc 
5 purposes, and also as targets for screening for therapeutic compounds that modulate breast 
cancer, such as hormones or antibodies. Other aspects of the invention will become apparent 
to the skilled artisan by the following description of the invention. 

In one aspect, the present invention provides a method of detecting a breast 
cancer-associated transcript in a cell from a patient, the method comprising contacting a 
10 biological sample from the patient with a polynucleotide that selectively hybndizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the present invention provides a method of determining 
the level of a breast cancer associated transcript in a cell from a patient. 

In one embodiment, the present invention provides a method of detecting a 
1 5 breast cancer-associated transcript in a cell from a patient, the method comprising contacting 
a biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the polynucleotide selectively hybridizes to a sequence at 
least 95% identical to a sequence as shown in Tables 1-25. 
20 in one embodiment, the biological sample is a tissue sample. In another 

embodiment, the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent 

label. 

In one embodiment, the polynucleotide is immobilized on a solid surface. 
25 in one embodiment, the patient is undergoing a therapeutic regimen to treat 

breast cancer. In another embodiment, the patient is suspected of having metastatic breast 
cancer. 

In one embodiment, the patient is a human. 

In one embodiment, the breast cancer associated transcript is mRNA. 
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. ent theme thod further comprises the step of amplifying 
In one embodiment, the memo the polynucleotide. 

^rjrrr^rrzr 

I!, J*-. Inapretareneorbodnn — 

coding of a polynuo.co.ida senaence es sbown inTables m 

I„ one embodiment, an expression vector or call oompnses the tso 
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fa one aspect, the present tendon provides an isolated ,*P** ^ » 

U corner aspect, dre present invention provides an antibody drat spectficaUy 
b ^ l „anisoU,e4po.yp^dc»M*is^by.nuo 1 e i oadd» Ml coulohav m g 

polynucleotide sequence as shown in Tables 1-25. 
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In one embodiment, the antibody is conjugated to an effector component, e.g., 
a fluorescent label, a radioisotope or a cytotoxic chemical. 

In one embodiment, the antibody is an antibody fragment. In another 
embodiment, the antibody is humanized. 
5 In one aspect, the present invention provides a method of detecting a breast 

cancer cell in a biological sample from a patient, the method comprising contacting the 
biological sample with an antibody as described herein. 

In another aspect, the present invention provides a method of detecting 
antibodies specific to breast cancer in a patient, the method comprising contacting a 
1 0 biological sample from the patient with a polypeptide encoded by a nucleic acid comprising a 
sequence from Tables 1-25. 

In another aspect, the present invention provides a method for identifying a 
compound that modulates a breast cancer-associated polypeptide, the method comprising the 
steps of: (i) contacting the compound with a breast cancer-associated polypeptide, the 
1 5 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 
80% identical to a sequence as shown in Tables 1-25; and (ii) determining the functional 
effect of the compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic 
effect, or a chemical effect. 
20 In one embodiment, the polypeptide is expressed in a eukaryotic host cell or 

cell membrane. In another embodiment, the polypeptide is recombinant. 

In one embodiment, the functional effect is determined by measuring ligand 
binding to the polypeptide. 

In another aspect, the present invention provides a method of inhibiting 
25 proliferation of a breast cancer-associated cell to treat breast cancer in a patient, the method 
comprising the step of administering to the subject a therapeutically effective amount of a 
compound identified as described herein. 

In one embodiment, the compound is an antibody. 
In another aspect, the present invention provides a drug screening assay 
30 comprising the steps of: (i) administering a test compound to a mammal having breast cancer 
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or ,„ a cell sample isolated tart** W sparing the level of gene express™ of a 

aa shown in Tables 1-25 in a treated cell or— lwith.be lev., of gene expressron o the 
p„ ly nocleotide in a control 0.11 sample or mammal, wherein a tea. oomponnd that modulates 
are ,evel of expression of mo po,ynuo.eo.ido is a oandidate for the treatment of breast cnncer. 

In one embodiment, the oontrol is a mammal with breast oanoor or a oell 
^pte therefrom that has not been treated with the test compound. In another emboduoent, 

the control is a normal cell or mammal. 

In one embodiment, the test compound is administered in varying amounts or 
concentrations. In another embodiment, the test compound is administered for varying time 
periods. In another embodiment, the comparison can occur after addition or removal of me 
drug candidate. 

In one embodiment, the levels of a plurality of polynucleotides that selectively 
hybridize to a sequence at least 80% identical to a sequence as shown in Tables 1-25 are 
15 individually compared to their respective levels in a control cell sample or mammal. In a 
preferred embodiment the plurality of polynucleotides is from three to ten. 

In another aspect, the present invention provides a method for treating a 
mammal having breast cancer comprising administering a compound identified by the assay 

described herein. 

In another aspect, the present invention provides a pharmaceutical 
composition for treating a mammal having breast cancer, the composition comprising a 
compound identified by the assay described herein and a physiologically acceptable 
excipient. 

In one aspect, the present invention provides a method of screening drug 
candidates by providing a cell expressing a gene that is up- and down-regulated as in a breast 
cancer. In one embodiment, a gene is selected from Tables 1-25. The method further 
includes adding a drug candidate to the cell and deternuning the effect of the drug candidate 
on the expression of the expression profile gene. 

In one embodiment, the method of screening drug candidates includes 
30 comparing the level of expression in the absence of the drug candidate to the level of 

7 
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cldidat. can «, when present, m d wherein ft. cumparison can occnr after 
^ova.ofur^g^.u.aprefe^^^nrec.U.xpressesa,^^ 

cxpressionproft.e genes. The profrle genes may show an increase or 

Also provided is a method of evaluating the effect of a candrdale breast cancer 
tog comprising adnrinislering «he dcug ,o a transgenic animal expressing or over-„g 
.Kebreastcancer modulatory protein, or an animal lacking the breas, cancer modular 
protein, for example as a result of a gene knockout. 

Moreover, provided herein is a biochip comprising one or more nuclerc actd 
segments of Tables .-25, wherein ,he biochip comprises fewer than .000 nucleic acid probes. 
Preferably, a. leas. <wo nucleic acid segments are induded. More preferably, at leas, three 

nucleic acid segments are included. 

Furthermore, a method of diagnosing a disorder associated with breast cancer 
is provided. The method comprises determining the expression of a gene of Tables 1-25, 
preferably a gene of Table 25, in a first tissue type of a first individual, and comparing the 
distribution to the expression of the gene from a second normal tissue type from the first 
individual or a second unaffected individual. A difference in the expression indicates that the 
first individual has a disorder associated with breast cancer. 

In a further embodiment, the biochip also includes a polynucleotide sequence 
of a gene that is not up- and down-regulated in breast cancer. 

In one embodiment a method for screening for a bioactive agent capable of 
interfering with the binding of a breast cancer modulating protein (breast cancer modulatory 
protein) or a fragment thereof and an antibody which binds to said breast cancer modulatory 
protein or fragment thereof. In a preferred embodiment, the method comprises combmmg a 
breast cancer modulatory protein or fragment thereof, a candidate bioactive agent and an 
antibody which binds to said breast cancer modulatory protein or fragment thereof. The 
method further includes determining the binding of said breast cancer modulatory protem or 
fragment thereof and said antibody. Wherein there is a change in binding, an agent is 
identified as an interfering agent. The interfering agent can be an agonist or an antagomst. 
) Preferably, the agent inhibits breast cancer. 
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Also provided herein are methods of eliciting an immune response in an 
individual. In one embodiment a method provided herein comprises administering to an 
individual a composition comprising a breast cancer modulating protein, or a fragment 
thereof. In another embodiment, the protein is encoded by a nucleic acid selected from those 
5 of Tables 1-25. 

Further provided herein are compositions capable of eliciting an immune 
response in an individual. In one embodiment, a composition provided herein comprises a 
breast cancer modulating protein, preferably encoded by a nucleic acid of Tables 1-25, more 
preferably of Table 25, or a fragment thereof, and a pharmaceutical^ acceptable earner. In 
10 another embodiment, said composition comprises a nucleic acid comprising a sequence 
encoding a breast cancer modulating protein, preferably selected from the nucleic acids of 
Tables 1-25, and a pharmaceutically acceptable carrier. 

Also provided are methods of neutralizing the effect of a breast cancer protein, 
or a fragment thereof, comprising contacting an agent specific for said protein with said 
15 protein in an amount sufficient to effect neutralization. In another embodiment, the protein is 
encoded by a nucleic acid selected from those of Tables 1-25. 

In another aspect of the invention, a method of treating an individual for breast 
cancer is provided. In one embodiment, the method comprises administering to said 
individual an inhibitor of a breast cancer modulating protein. In another embodiment, the 
20 method comprises administering to a patient having breast cancer an antibody to a breast 

c'ancer modulating protein conjugated to a therapeutic moiety. Such a therapeutic moiety can 
be a cytotoxic agent or a radioisotope. 
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DETAILED DESCRIPTION OF THE INVENTION 
In accordance with the objects outlined above, the present invention provides 
novel methods for diagnosis and prognosis evaluation for breast cancer (PC), including 
metastatic breast cancer, as well as methods for screening for compositions which modulate 
breast cancer. Also provided are methods for treating breast cancer. 

Tables 1-24B provide unigene cluster identification numbers for the 
30 nucleotide sequence of genes that exhibit increased or decreased expression in breast cancer 
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« ^ m 11 10 n 18 19 20 21, and 22 list those genes that are 
easUancerceUs. Table ,,2, 3,^,^23 , ist ta « are down.cgu.a.ed ■» breast 

that is part of the unigene cluster. 
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The term "breast cancer protein" or "breast cancer polynucleotide" or "breast 
cancer-associated transcript" refers to nucleic acid and polypeptide polymorphic vanan^, 
alleles, mutants, and interspecies homologues that: (1) have a nucleotide sequence that has 
greater than about 60% nucleotide sequence identity, 65%, 70%, 75%, 8 0 o/o, 85%, 90 A. 
preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater nucleoude 
sequence identity, preferably over a region ofover a region of at least about 25,50, 100 ,200, 
500 1000, or more nucleotides, to a nucleotide sequence of or associated with a gene of 
Tables 1-25; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an mununogen 
comprising an amino acid sequence encoded by a nucleotide sequence of or associated wath a 
gene of Tables 1-25, and conservatively modified variants thereof; (3) specifically hybndize 
under stringent hybridization conditions to a nucleic acid sequence, or the complement 
thereof of Tables 1-25 and conservatively modified variants thereof or (4) have an ammo add 
sequence that has greater than about 60% amino acid sequence identity, 65%, 70%, 75%, 
80% 85%, 90%, preferably 91* 92%, 93%, 94%, 95% 96%, 97«/o, 98% or 99% or greater 
amino sequence identity, preferably over a region ofover a region of at least about 25, 50, 
100 200 500, 1000, or more amino acid, to an amino acid sequence encoded by a nucleotide 
sequence of or associated with a gene of Tables 1-25. A polynucleotide or polypeptide 
sequence is typically from a mammal including, but not limited to, primate, e.g., human; 
rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, or other mammal. A "breast cancer 
polypeptide" and a "breast cancer polynucleotide," include both naturally occurring or 
recombinant forms. 
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A "full length" breast cancer protein or nucleic acid refers to a breast cancer 

normally contained in one or more naturally occurring, wild type breast cancer 
polynucleotideorpol^ 
5 stages of post-translation processing or splicing, including alternative splicing. 

"Biological sample" as used herein is a sample of biological tissue or fluid that 
contains nucleic acids or polypeptides, e.g., of a breast cancer protein, polynucleotide or 
transcript. Such samples include, but are not limited to, tissue isolated from primates, e.g., 
humans, or rodents, e.g., mice, and rats. Biological samples may also include sections of 
10 tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, 
blood, plasma, serum, sputum, stool, tears, mucus, hair, skin, etc. Biological samples also 
include explants and primary and/or transformed cell cultures derived from patient tissues. A 
biological sample is typically obtained from a eukaryotic organism, most preferably a 
mammal such as a primate e.g., chimpanzee or human; cow; dog; cat; a rodent, e.g., guinea 
15 pig, rat, mouse; rabbit; or a bird; reptile; or fish. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells {e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing the 
20 methods of the invention in vivo. Archival tissues, having treatment or outcome history, will 

be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 
are the same or have a specified percentage of amino acid residues or nucleotides that are the 

25 same (i.e., about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and 
aligned for maximum correspondence over a comparison window or designated region) as 
measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default 
parameters described below, or by manual alignment and visual inspection (see, e.g.. NCBI 

30 web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to 
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* ^d* I*—." This deftnMon aUo rt. <», or may be 
co mP ,hnen<„fa,esUe,uenoe. t ^T g 

and/or additions, as well as those that have substitutions, as well ^ ^yiepreferred ' 

po^rpbioor — * As descnbed below, the pref™* 
5 , g lL oan aocoun, fc, gaps and the 1*0. Preferably, identity exists over a regton tha , 

lL^25^*--^^««' — 0 - , " ,, " ,,,,, " 
50-100 amino acids or nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, 
to which testsequencesarecompared. men using a sequence comparison algonthm, test 

10 andreferenc.sequencesare^^ 

if necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters canoe used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 
, 5 A "comparison window", as used herein, includes reference to a segment of 

one of the number of contiguous positions selected from the group consisting typically of 
from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 m wtuch 
a sequence may be compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aligned. Methods of alignment of sequences 
20 for comparison are well-known in the art. Optimal alignment of sequences for companson 
can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Ad, A PP l. 
Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, /. Mol. 
Biol. 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Nat I. 
Acad.Sci. USA 85:2444 (1988), by computerized implementations of these algorithms 
25 (GAP BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 
visual inspections eg., Current Protocols in Molecular Biology (Ausubel**/., eds. 

1995 supplement)). 

Preferred examples of algorithms that are suitable for determining percent 
30 sequence identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, 

12 
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wMcharedescribedinA^^^ 

al J Mol. Biol. 215:403-410 (1990). BLAST and BLAST 2.0 are used, with the parameters 
therein, • — — ^"^^^ 

; NationalCenterforBiote^^ 

algorithm involves first identifying high scoring sequence pairs (HSPs) by identfymg short 
words oflengthWinthequery sequence, which eimer match or satisfy someposrtxve-valued 
threshold score T when aligned with a word of the same length in a database sequence. T ,s 
referred to as the neighborhood word score threshold (Altschul et «/., supra). These antral 
0 neighbomoodwordhitsact^ 

them The word hits are extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are calculated usmg, e.g., 
for nucleotide sequences, the parameters M (reward score for a pair of matching resumes; 
always > 0) and N (penalty score for mismatching residues; always < 0). For ammo add 
5 sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word 
hits in each direction are halted when: the cumulative alignment score falls off by the 
quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
20 sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 
uses as defaults a wordlength (W) of 1 1, an expectation (E) of 10, M=5, N=-4 and a 
comparison of both strands. For amino acid sequences, the BLAST? program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matnx 
{se e Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA 89:10915 (1989)) alignments (B) of 
25 50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences ( S **, e.g.. Karlin & Altschul, Proc. Natl Acad. Sci. USA 90:5873- 
5787 (1993)) One measure of similarity provided by the BLAST algorithm is the smallest 
sum probability (P(N)), which provides an indication of the probability by which a match 
between two nucleotide or amino acid sequences would occur by chance. For example, a 
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b ela rg en eg ativenu m be r s ) e. g .,5,10,20 > 30,40 ) 40,70,90,110 ) 150,17,et. 

AnindicationthattwonucleLcacidsequeacesorpolypept.desare 
identical is that the polypeptide encoded by the first nucleic acid is inununologicaUy cross 

acid) as describedbelow.Thus,apolypeptide is typically substantially iden.cal to a second 

polypeptide, e.g., where ^i^pa^dlfcodylV--^---^ ^ 
10 Lltionthattwonucleicacidsequencesares^bstan^ 

or their complements hybridize to each other under stringent conditions, as descnbed below. 
Yet another indication that two nucleic acid sequences are substantially identical is that the 
same primers can be used to amplify the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 
1 5 expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g.. the American Type Culture 
Collection catalog or web site, www.atcc.org). 
20 The terms "isolated," "purified;' or "biologically pure" refer to material that is 

substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 
chromatography. A protein or nucleic acid that is the predominant species present m a 
25 preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading frames that naturally flank the gene and encode proteins other than protem 
encoded by the gene. The term "purified" in some embodiments denotes that a nucleic acid 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 
that the nucleic acid or protein is at least 85% pure, more preferably at least 95% pure, and 
30 most preferably at least 99% pure. "Purify" or "purification" in other embodiments means 
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removing at least one contaminant from the composition to be purified. In tins sense, 
purification does not require that thepurified compound be homogenous, e.g., 100% pure. 

The terms "polypeptide," "peptide" and "protein" are used interchangeably 
herein to refer to a polymer of amino acid residue, The terms apply to amino acid polymers 
5 in which one or more amino acid residue is an artificial chemical mimetic of a corresponding 
naturally occurring amino acid, as well as to naturally occurring amino acid polymers, those 
containing modified residues, and non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic ammo acids, 
as well as amino acid analogs and amino acid mimetics that function similarly to the naturally 
10 occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, e.g., an a carbon that is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 
15 norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 

modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic 
chemical structure as a naturally occurring amino acid. Amino acid mimetics refers to 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that functions similarly to a naturally occurring amino acid. 
20 Amino acids may be referred to herein by either their commonly known three 

letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 

accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic 
25 acid sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids encode 
30 most proteins. For instance, the codons GCA, GCC, GCG and GCU all encode the amino 
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l^ecl to another of the corresponding codons described without altenngthe encoded 

polypeptide. M^-d^-W—*^ 1 "^'' 
collvely .odifiedvariation, Everynucleic acid sequence herein winch encodes a 
^epddeaLdescrihessUentvarianonsofthenucleicaci. Oneofs^w Irecogn.e 
LLertain contexts each codon in a nucleic acid (except AUG, which is ordinarily the 
only codonfor methionine, and TGG, which is ordinarily the only codon for tryptophan) can 
be modified to yield a fictionally identical molecule. Accordingly, often silent vanaUons of 
a nucleic acid which encodes a polypeptide is implicit in a described sequence with respect to 
the expression product, but not with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual 
substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 
sequence which alters, adds or deletes a single amino acid or a small percentage of ammo 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 
results in the substitution of an amino acid with a chemically similar ammo acid. 
Conservative substitution tables providing functionally similar amino acids are well known in 
the art Such conservatively modified variants are in addition to and do not exclude 
polymorphic variants, interspecies homologs, and alleles of the invention.typically 
conservative substitutions for one another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), 
Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Argmine (R), Lysine (K); 5) 
Isoleucine (1), Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosme (Y), 
Tryptophan (W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine <M) 

e g., Creighton, Proteins (1984)). 

Macromolecular structures such as polypeptide structures can be described in 
terms of various levels of organization. For a general discussion of this organization, see, 
e g. Alberts et «/., Molecular Biology of the Cell (3« ed., 1994) and Cantor & Schimmel, 
Biophysical Chemistry Part I: The Conformation of Biological Macromolecules (1980). 
"Primary structure" refers to the amino acid sequence of a particular peptide. "Secondary 
structure" refers to locally ordered, three dimensional structures within a polypeptide. These 
structures are commonly known as domains. Domains are portions of a polypeptide that 
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often form a compact unit of the polypeptide and are typically 25 to approximately 500 
amino acids long. Typical domains are made up of sections of lesser organization such as 
stretches of p-sheet and a-helices. 'Tertiary structure" refers to the complete three 
dimensional structure of a polypeptide monomer. "Quaternary structure" refers to the three 
5 dimensional structure formed, usually by the noncovalent association of independent tertiary 
units. Anisotropic terms are also known as energy terms. 

"Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical 
equivalents used herein means at least two nucleotides covalently linked together. 
Oligonucleotides are typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more 
10 nucleotides in length, up to about 100 nucleotides in length. Nucleic acids and 

polynucleotides are a polymers of any length, including longer lengths, e.g., 200, 300, 500, 
1000, 2000, 3000, 5000, 7000, 10,000, etc. A nucleic acid of the present invention will 
generally contain phosphodiester bonds, although in some cases, nucleic acid analogs are 
included that may have alternate backbones, comprising, e.g., phosphoramidate, 
1 5 phosphorothioate, phosphorodithioate, or O-methylphophoroamidite linkages (see Eckstein, 
Oligonucleotides and Analogues: A Practical Approach, Oxford University Press); and 
peptide nucleic acid backbones and linkages. Other analog nucleic acids include those with 
positive backbones; non-ionic backbones, and non-ribose backbones, including those 
described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC 
20 Symposium Series 580, Carbohydrate Modifications in Antisense Research, Sanghui & 

Cook, eds.. Nucleic acids containing one or more carbocyclic sugars are also included within 
one definition of nucleic acids. Modifications of the ribose-phosphate backbone may be done 
for a variety of reasons, e.g. to increase the stability and half-life of such molecules in 
physiological environments or as probes on abiochip. Mixtures of naturally occurring 
25 nucleic acids and analogs can be made; alternatively, mixtures of different nucleic acid 
analogs, and mixtures of naturally occurring nucleic acids and analogs may be made. 

A variety of references disclose such nucleic acid analogs, including, for 
example, phosphoramidate (Beaucage et aL, Tetrahedron 49(10): 1925 (1993) and references 
therein; Letsinger, J. Org. Chem. 35:3800 (1970); Sprinzl et aL, Eur. J. Biochem. 81:579 
30 (1977); Letsinger et aL, Nucl. Acids Res. 14:3487 (1986); Sawai et al, Chem. Lett. 805 

17 



WO 02/059377 



PCT/US02/02242 



10 



15 



20 



0 984>, linger e. aL, , Am. Chem. Soc. , ,0:44,0 (.« 

Ju* Pa- No. 5,644,048). phosphorodilhioale 0* - *• * ^ ^ ^ 

ltaklg es (see EghoUn, , Am. Chem. So, 114: 18 95 (.992); Meter elaL, "™ 
31-1008 (1992); Nielsen, Nature, 365:566 (1993); Cartoon el al„ Nature 380.207 (1996), 
of which am incorporated hy reference). Other analog nucleic adds include .hose w.th 

backbones (U.S. PatenlNos. 5,386,023, 5,637,684, 5,602,240, 5216,141 and 4, 69,863, 
Kredrowshi a. al, Angew. Cham. ML Ed. English 30:423 (1991); ~« ^ 
Chem Soc. 1 10:4470 (1988); Latsingcr el al., Nucleoside & Nucleotide 13:1597 (1994); 
Chapters 2 and 3, ASC Symposium Series 580, "Camohydrate Modif.ca.ions in Amtsense 
Research", Ed. Y.S. Sanghui andP. DanCook; Mesmaeker e. al., Bioorganic Medtctnal 
Chem Lot 4:395 (1994); Jeffs el .1., J. Biomolecnlar NMR 34:17 (1994); Tetrahedron Lett 
37-743 (1996)) and non-riboso backbones, including those described in US. Patent Noa. 
5 235 033 and 5,034,506, and Chapters 6 and 7. ASC Symposium Series 580, "Carbohydrate 
Modifications in Antiaense Research", Ed. Y.S. Sanghu, and P. Dan Cook. Nucleic acrda 
containing one or more carboeyenc sugars are also included within one definition of nucleto 
acids (see Jenkins etal, Chem. Soe. Rev. (1995) „p 169-176). Several nucleic acid analogs 
« described in Rawls, C & E News June 2, 1997 page 35. All of these references are hereby 

expressly incorporated by reference. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
nucleic acid analogs. These backbones are substantially non-ionic under neutral conditions, an 
contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 
kinetics PNAs have larger changes in the melting temperature CU for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4°C drop in T m for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9°C. Sinularly, 
30 due to their non-ionic nature, hybridization of the bases attached to these backbones is 
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„ —e .o saU — io. In addition, PNAa are no, degraded oy oeiiniar 
enzvmes and thus can be more stable. 

eon^portionsofbothdoublestrandedorsinglcstrand^ sequence. As wiU be appreaated 
by those in the art, the depiction of a single strand also defines the sequence of the 
complementary strand; thus the sequences <^^*^"^*T* 
the sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA or a hybrid, 
where the nucleic acid may contain combinations of deoxyribo- and ribo-nucleoUdes, and 
combinations of bases, including uracil, adenine, thymine, cytosine, guanine, mosme, 
xanthine hypoxanthine, isocytosine, isoguanine, etc. "Transcript" typically refers to a 
naturally occurring RNA, e.g., a pre-mRNA, hnRNA, or mRNA As used herein, the term 
"nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 
nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non- 
naturally occurring analog structures. Thus, e.g. the individual units of a peptide nuclerc 
acid, each containing a base, are referred to herein as a nucleoside. 

A "label" or a "detectable moiety" is a composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical 
means For example, useful labels include 32 P, fluorescent dyes, electron-dense reagents, 
enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and protems 
or other entities which can be made detectable, e.g., by incorporating a radiolabel into the 
peptide or used to detect antibodies specifically reactive with the peptide. The labels may be 
incorporated into the breast cancer nucleic acids, proteins and antibodies at any position. 
Any method known in the art for conjugating the antibody to the label may be employed, 
including those methods described by Hunter etaL.Bta 144:945 (1962); David etal., 
Biochemistry, 13:1014 (1974); Pain et al., JJ^mZM^ 40:219 (1981); and Nygren, I 
Wistonhem. apH f-vtochem., 30_:407 (1982). 

An "effector" or "effector moiety" or "effector component" is a molecule that 
is bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 
) The "effector" can be a variety of molecules including, e.g., detection moieties including 
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Ugs, a toxm; ac.iva.able moiedes, achemotherap-ic ** ' " anUbI ° te ' 

r adioiso,op=en,itting'We.g-.'>* rad M' > "- 

A "labeled nucleic acidptobeor oligonucleotide ,s one that is o , 

Waals electrosta He, orbydrogenbond, to a.abe 1S uchul«.«iepre^ceof tberobomay 

using bigb affinity interactions may acbievo the same results where one of a par of binding 

partner S bindstouieother,o.g.,biotin, S treptayidiu. 

Asu S edbcreina»nuc,oicaoidprobeo,oligonnclcoSde",sdefm.da3a 

uucUicacidcapableofbnidnag.oab.getnuo.eicacidofco^plenienlaoia^neo.tao^ 

though hydrogen bond fonnation. As used herein, a probe may include nature. (...., A. ft C 

0, T) or modified base, (Meazagnanosine. inosine, etc.). in addition, me bases in a probe 
may be joined by a linkage other than a phosphodiester bond, so long as it does not 
functionally interfere with hybridization Thus, e.g., probes may be peptide uucletc actds ,n 
which the consdmen, bases adjoined by peptide bonds rather man phosphodiester linkages. 

1, will be undersmod by one of skill in the ad that probes may bind target sciences lacking 
complete complementarity with the probe sapience depending upon the stringency of the 

, hybridization condition, The probes are preferably directly labeled as with isotopes 

ohromopho«s,.umiphores,chromogens, or indirect., labeled snob as with biotin to wbtoh a 
streptavidin complex may later bind By assaying for the presence nr absence of me probe, 
one can detect the presence o, absence of the select sequence or subsequence. Diagnosis or 
prognosis may be baaed a. the genomic love!, or a. the .eve! ofRNAor protein expression. 
5 The term "recombinant" when used with reference, e.g., to a cell, or nucleic 

acid, protein, or vector, indicates tat the cell, nucleic acid, protein or vector, has been 
m0 dif,ed by the introduction of a heterologous nucleic acid or protein or the alteration of a 
native nucleic aoid or ptotein, or tat the cell is derived from a cell so modified. Thus, e*. 
reoombin- cells express genes tat are no. found within the native (non-reootnbinant) form 
30 ofta cell or express native genes tat are otherwise abnormally expressed, under expressed 
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, • «;^» herein is meant nucleic acid, 
«ed at all By the term "recombinant nucleic acid herem is m 
or not expressed at all. By manipu i ati on of nucleic acid, e.g., using 

made using recombinant techniques, i.e., through the expression 

acid as depicted above. . 

Theterm-heterologous-whenusedwithreferencetoportionsofanucl^c 

acid indicates that die nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinant* produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
codingregionfromanothersource. Similarly, a heterologous protein will often refer to two 
or more subsequences that are not found in the same relationship to each other in nature (e.g., 
a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that 
direct transcriptionofanucleic acid. As u,ed herem, a promoter mclude, nece.ary nucleic 
acid sequences near the start site of transcription, such as, in the case of a polymerase H type 
; pr0 moter,aTATAelement. A promoter also optionally includes distal enhancer or repressor 
elements.whichcanbelocatedasmuchasseveralmousandbasepairsfrommestartsiteo^ 

transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 
active under environmental or developmental regulation. The term "operably linked" refers 
,0 to a functional linkage between a nucleic acid expression control sequence (such as a 
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^o^ayof^np^factorbiading^an^secondnuoWcacid^uenoe, 
promoter, or array entrant y nucleic acid 

. t_ * 11 Tii*»PYnression vector can be pan oidpioon , 

,0 dup^oahy^ofa^o^.apa^c^nuc^ddc^u^ 

sJLgen, hybridan conditions when that stance is present . . complex aaxtare (a.g., 

total cellular or library DNA or RNA). 

The phrase "stringent hybridization conditions" refers to cond,t,ons under 

which a probe will hybridiae to its target subsequence, .ypicauy in a complex m*m« 
,5 nucleic acids, bu, ,0 no olher sequences. Stringent conditions are sequenee^penden. and 
will be different in different circumstances. Longer sequences hybridize specially at 
higher tomperahnes. An exfcnsive guide to the hybridization of nocleio acids is found m 

M es "Overview of principles of hybridization and the strategy of nucleic ac,d assays 
20 (1W 3).*Gene ra ny,s,rin g en.oo„dirion,areseleo.ed,ob=abou t 5.10"Clowetmau,he 

thermal meuing point (T J for the speciftc sequence a. a defined ionic strength pH Th T 
me temperature (under denned ionic stiengm, pH, and nucleic concentration) a, whtch 50 A 
of me probes cnmplcmenhtry «> the targe, hybridize ,„ the targe, sequence a. equihbnum (as 
the target sequences are present in excess, at T„ 50% of the ptobes are ocenpted a. 
25 equilibrium). Stringent conditions will be mose in which the sal, concentration . ; less than 
about 1.0 M sodium ion, typical* about 0.0! to 1.0 M sodium ion oonoehtration (or other 
salts) a. pH 7.0 to 8.3 and fire temperature is a, leas, about 30°C for shot, probes (e.g., 10 to 
50 nucleotides) and a, leas. abon. 60'C for long probes (e. g , gtea.er man 50 nucleotides), 
Stringent oonditions may also be achieved wi.h .he addition of desUbiltzmg agent* such as 
30 fotmamide. For selective or specific hybridization, . positive signal is a, leas, two times 
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hybridizationconditionsoanbeasfollowmg. ^ ^ 

• * nr Sx SSC 1% SDS, incubating at 65 C, wun w 
incubating at 42 C, or, 5x SSL, stringency 

. p nr pcr a temperature of about 36 c is lypiv 
0.1% SDS at 65 C. For PLK, a wwy c 

— — ™^"--"~— - 

t „ abo u,65-C,dep e ndu, g on ttl epn m e rl .og t ha n d3pec lfl c«y. 

. T • , „/ n Q90"i PCR Protocols, A Guide to Methods and 
reactions are provided, e.g., in Inms et al. (1990) PCK rroi 

Anvlications, Academic Press, Inc. N.Y.). 

bvbfldizadoncnndiflous-Mudeah^^^ 
, flaUrC.a„d»wash i n.XSSCa«45-C. A P.sidve hybridan is aUeasrtwrce 
Zand. ^oro^s.U^^^— byb~and 
w^oo.dUionsoanbea.iU.ed.oprovideaondMonaofsani.ars^ency. Add,uons> 
pWtafcd— areprovidedunnu.neroasrafeence.a.g., 

and Current Protocols in Molecular Biology, ad. Ausubel, «<.<■ 

Tbe pbrase "functiona. effects" in the context of assays for .esung compounds 

nra, is indirectly or directly under dre influence of.be breast cancor protein or undent actd, 
intiudes ligand binding activity; cell grow* on soft agar; anchorage dependence; contact 
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- — -^-rsc: rs* 

growth factor or serum dependence; tumor specific marker levels, in 
^ • DMA «n<\ nmtein expression n cells undergoing 

tumor growth and metastasis in vivo; mRNA and protein express 
tumor growtu mm «p lim .tional effects" include in 

metastasis, and other characteristics of breast cancer cells. Functional 

vitro in vivo, and ex vivo activities. 

By^etermimngme^^ 

functional effects can be measured by any means known to those skilled in the art, e.g., 
changes in spectroscopic characteristics (e.g., fluorescence, absorbance, refractive index), 
hydrodynamic (e.g., shape), chromatographic, or solubility properties for the protein, 
rneasuringinduciblemarkersortranscriptionalactivationof^ 

measuring binding activity or binding assays, e.g. binding to antibodies or other hgands, and 
measuring celluiar proliferation. Determination of the mnct^ 

art such as an in vitro assays, e.g., cell growth on soft agar; anchorage dependence; contact 

inhibition and densi^^ , 
growth factor or serum dependence; tumor specific marker levels; invasiveness into Matngel; 
tumor growth and metastasis in vivo; mRNA and protein expression in cells undergoing 
metastasis, and other characteristics of breast cancer cells. The functional effects can be 
evaluated by many means known to those skilled in the art, e.g., microscopy for quantitative 
or qualitative measures of alterations in morphological features, measurement of changes in 
RNA or protein levels for breast cancer-associated sequences, measurement of RNA stability, 
identification of downstream or reporter gene expression (CAT, luciferase, P -gal, GFP and 
the like), e.g., via chemiluminescence, fluorescence, colorimetric reactions, antibody binding, 
inducible markers, and ligand binding assays. 

"Inhibitors", "activators", and "modulators" of breast cancer polynucleotide 
and polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules 
or compounds identified using in vitro and In vivo assays of breast cancer polynucleotide and 
30 polypeptide sequence, Inhibitors are compounds that, e.g., bind to, partially or totally block 
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^— prolein. "Activators" - 
may *em «o «. expression - — ' t ata , sensito, ago^, « 

„^ rt — • **~£ t ^wimaWacnvity.as 

chemtal molecules and the bke V wlym g putative 

^pressing me breast cancer protetn <* or rf 

«£ , or more brens. oaucer protons, e.g., 1. 2, 3, 0 . ^ ^ ( 

cancer proteins, such as breast cancer patterns encoded by me seouen 

S amplesora S! .y S compnsmgbrea, t cancerpr„,em S m..are,rea,edr«ma 
(mrtreared wiminbibUors) are assigned a rCativeproleinacUv^vaiueo 

more preferably 150%, more preferably 200-500 /.(r.e., two 

oon^B.n.orapraferan.ylOOO-olW.mgner. myctalgetaodleroW mand 
,, The phrase "changes m cell growth refers to any cn 6 

Wation of growth, loss of grow* facto, or actum reeuiremenrs, changes m cell 

^gor.oanrgmmrm^ gatai ng or loaing mmor speothc marhera, 
30 :Z t oLoraupp K aammomwhenm j ec,enin,osui,ab,eanima,hoa K ,and,or 
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X— cells," Wormed" ceUtt or Wotmatiou" to too culture refers 
^^d.— I* A*-*«i— HoncanaHse^tofocrtonw.^— , 

Transformation is associated withphenotyptc changes, such as immortrOrzahonofoells, 
aberrant growth con.ro,, nonmorphologica, changes, and/or malignancy (sea, Freshney, 
Culture of Animal Cells a Manual of Basic Technique (3« ed. 1994)). 

"Antibody" refers to a polypeptide comprising a framework region from an 
tamnnog.obuhn gene or fragments thereof that speciflcally bind* and recogtozes an antigen. 
The recognized immunoglobulin genea include the kappa, lambda, alpha, gamma, delta, 
«ta. and mu constan. region genes, as weU as the myriad immunoglobuUn variable regton 
genes Light chatos are classified as either kappa or lambda. Heavy chains are clashed as 
gamma, mu, alpha, deUa, or epailon, which to torn define the immunogtobulin classes, IgG, 
IgM IgA, IgD and IgE, respectively. TypicaUy, the antigen-binding region of an anttbody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul, 

Fundamental Immunology. 

An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pan 

of each chain defines a variable region of about 100 to 1 10 or more amino acids pnmanly 
.esponsibleforantigenrecognition. The terms variable Ught chain (V L ) and variable heavy 
chain (V H ) refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well- 
characterized fragments produced by digestion with various peptidases. Thus, e.g., pepsm 
digests an antibody below the disulfide linkages in the hinge region to produce F(ab)' 2 . a 
, dimerofFab which itselfis a light chainjoined to V H -C„1 by a disulfide bond. TheF(ab)' 2 
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,v . « break the disulfide linkage in the hinge legion, 
may be ^eeed under mild co^ttonstobre* me monom6ris 

,993). While various antibody fragments .redefined to 

herein, a)so includes antibody fragments either produced b sing „ 
cbain Fv) or thoae identified using phage display libranos (ses. rag, McCafferty 
10 348:55W54 °:Z«parafiono f andbod i o S ,o,.,— «, nronoolona, or po.yolono, 

256:495-497 (1975); Kozbor or al, *** ™ ^ °* 

15 Zl— (.9,1)1 Harlow * Lane, i —7 — - 
Monoclonal Antibodies: Principles and Practice (2d ed. 1986)). Techniques for the 
production of ring* chain antibodies (US. Paten. 4,946,778) can he adapted to produce 

.otouran^als.nraybeusM.oexprosshunranMan.thodios. Alternately, phage 

specifically bind to se.ec.ed antigens (see, eg., McCafferty e, of., Nature 348.552-554 

)1990) Marks el at.. Biotechnology 10:779-783 (1992)). 

' A»chimerictmtibod/Ms a n a nriboaymolecnlein«hich(a)fhecuns B nt 

.gion or a portion thereof, is altered, replaced or exchanged ao that Che antigen binding rule 

andbody, eg., an enzyme, toxin, hormone, grow* factor, drug, etc, or (b) the vanable 
region, or a portion thereof, ia altered, replaced or exchanged with a variable region havnrg a 
different or altered antigen specificity. 
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m on. asp.* the expression levels of genes are dotermrneo ,n Afferent 
• „fii,of aD articularsampleisessentiallya"fingerpnnt of the state 
. nlbe, of genes Stanley aUows the generation of a gene express^ pro*, .ha, 

Z oan Lnaanec^n^oo^o^^^ueonnbc^w*^ 
Lea of ann other tissues fe. surviving canoer pa«s. By conrpanng oxprossron 

obtained, . u 

The identification of sequences that are differentially expressed m breast 
cancer versus non-breast cancer tissue allows the use of this information in a number of ways. 
For example, a particular treatment regime may be evaluated: does a chemotherapeutxc drug 
act to down-regulate breast cancer, and thus tumor growth or recurrence, in a particular 
patient. Similarly, diagnosis and treatment outcomes may be done or confirmed by 
comparing patient samples with the known expression profile, Metastatic tissue can also be 
analyzed to determine the stage of breast cancer in the tissue. Furthermore, these gene 
expression profiles (or mdividual genes) allow screenmg of dmg candidates ^ 
m imicking or altering a particular expression profile; e.g., screening can be done for drugs 
thatsuppressmebreas^^ 

uprising setsoftheimportantbreast cancer genes, which can then be used in these screens, 
these methods can also be done on the protein basis; that is, protein expression levels of the 
breast cancer proteins can be evaluated for diagnostic purposes or to screen candidate agents. 
!n addition, the breast cancer nucleic acid sequences can be administered for gene therapy 
purposes, including the administration of antisense nucleic acids, or the breast cancer protems 
(including antibodies and other modulators thereof) adrninistered as therapeuUc drug, 
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Thus thepresent invention provides nucleic acid andprotein sequences that 

ahigher level) inbreast cancer, as well as those that are down-regulated (i.e., expressed at a 
lower level). In a preferred embodiment, the breast cancer sequences are from humans; 
however, as will be appreciated by those in the art, breast cancer sequences from other 

cancer sequences are provided, from vertebrates, including mammals, including rodents (rats, 
mice, hamsters, guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, 
horses, etc.) and pets, e.g., (dogs, cats, etc.). Breast cancer sequences from other orgamsms 
may be obtained using the techniques outlined below. 

Breast cancer sequences can include both nucleic acid and amino acid 
sequence, As will be appreciated by those in the art and is more fully outlined below, breast 
cancer nucleic acid sequences are useful in a variety of applications, including diagnostic 
applications, which will detect naturally occurring nucleic acids, as well as screemng 
applications; e.g., biochips comprising nucleic acid probes or PCR microliter plates with 
selected probes to the breast cancer sequences can be generated. 

A breast cancer sequence can be initially identified by substantial nucleic acid 
and/or amino acid sequence homology to the breast cancer sequences outlined herein. Such 
homology can be based upon the overall nucleic acid or amino acid sequence, and is 
generally determined as outlined below, using either homology programs or hybridization 
conditions. 

For identifying breast cancer-associated sequences, the breast cancer screen 
typically includes comparing genes identified in different tissues, e.g., normal and cancerous 
tissues, or tumor tissue samples from patients who have metastatic disease vs. non metastatic 
tissue Other suitable tissue comparisons include comparing breast cancer samples with 
metastatic cancer samples from other cancers, such as lung, breast, gastrointestinal cancers, 
ovarian, etc. Samples of different stages of breast cancer, e.g., survivor tissue, drug resistant 
states, and tissue undergoing metastasis, are applied to biochips comprising nucleic acid 
30 probes. The samples are first microdissected, if applicable, and treated as is known in theart 
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,■ nofmRNA Suitable biochips are cordially available, e.g. from 

-yKd ' ^nae^enUbagaaaasbow^cbangeainaxp^onaabe^n ^ 
, mddi5 l su.es a.ecompared.ogenasexpreaseaino.hernonaa^sues.p^y 
nonnal and disease siaw* r kidney, 

normalbreast,but also including, andnot limited to ' 

signifi cantamountmom^^ 

3 lldiments.thisisnotnecessa.. 

that the target be disease specific, to minimize possible side effects. 

inapreferredembodiment.breastcancerseqnencesarethosethatareup- 

regul atedinbreast cancer; that is, the expression of these genes is higher in the breast cancer 
Jsueascomparedtonon-canceroustissue. ^-regulation" as used herein often means at 

five-fold or higher being preferred. All unigene cluster identification numbers and accession 
numbers herein are for the GenBank sequence database and the sequences of the accession 
numbers are hereby expressly incorporated by reference. GenBank is known in the art, see, 
e g. Benson, DA, et al, Nucleic Acids Research 26:1-7 (1998) and 
20 http://www.ncbi.nlm.nih.gov/. Sequences are also available in other databases, e^g 

EuropeanMolecular Biology Laboratory (EMBL) and DNADatabase of Japan (DDBJ). 
US Patent ApplicationN.09^ , 
further discloses related sequences, compositions, and methods of diagnosis and treatment of 
breast cancer is hereby expressly incorporated by reference. 
25 In another preferred embodiment, breast cancer sequences are those that are 

down-regulated in the breast cancer; that is, the expression of these genes is lower inbreast 
cancer tissue as compared to non-cancerous tissue (see. e.g., Tables 1,2, 3, 15, 16 etc > 
"Down-regulation" as used herein often means at least about a two-fold change, preferably at 
least about a three fold change, with at least about five-fold or higher being preferred. 
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Informatics der sed ^ breast cancer 

canadditionaUyprovidelugh-resolunon.high-s^ 

5 ^ofd^os^^^^ 
biosensordevelop^^^ 

exarap ,e, subcellular toxicologicaHofo— ^J"^^,^ 
^acHvity—n^Anderson,,/^ ^ CA(June lM2j 

, i „, , h , tbc Proleom cs conference, Coronano, y 
,998)). SubcCular^cologica.infom.aUoncanalsobeu.^u.a * 

W*. », leas, one se, of assay dab, The da* con,ained in Ore dattrbase i. aapnred, e g 

my form m winch data can be maintained and transmitted, bul is preferably m el ^ tI °" C 
20 Hase Tbc e ,ectt. ra cda*asc„ £ft einvcn,ioncanbem,n tt incdo„anye,ccttomc 
nraUowm^ortttcs^eof.dacoessttrmeda^.ancbns.porson.compn.cr, 

.Upref.mblydisttlbntodonawide^nc^sucbasmeWorldW.dcW*. 

Lfocnsofttrcprcacntsectiononda.abasesmatincmdepcpttdescuenco 
^isforolarttyofil— 0 nly. ttwdlbe app«en,,ottrose„f « in^ear, ^ snm,ar 
25 Uascscanbcasaemblcdforany.aanyd^aco.nircdnamganassayofmemven.ro, 

Tbccompoaido^andmcmodaforidcnttfyingrforr.nantt.a.mgdrerda.rv. 

md/OT absolote abundanco of a van* of mo.ecn.ac and maeromo.eeo.ar species ttom a 
30 £ pl ^.^,-^^^^ , ^ ,,, ~ 
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Although the data generated from the assays oiiuc 

database system for storing biomolecular sequence infonnatton m a mannet th rows 
seauences to be catalogued and searched according to one or more protem functron 

TbecJoguedandsearchedaccordrngtoassociaUonwi^ 

forobtaining full-length sequences from the collection of partial length sequences U.S. 
Patent 5,706,498 disc^^^^ 

using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 
incomputerdatabasesby comparison of predicted mass spectra wimexperimentally^denved 
rnassspectrausingacloseness-of-fitmeasure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysts 
described as on-line analytical processing (OLA?), which entails the consolidate of 
projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5 295 261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 
the merger of two or more such tree structures. 

See also Mount et ah, Bioinformatics (2001); Biological Sequence Analysis: 
Probabilistic Models of Proteins and Nucleic Acids (Durbin et ah, eds., 1999); 
Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins (Baxevams & 
Oeullette eds., 1998)); Rashidi & Buehler, Bioinformatics: Basic Applications in Biological 
Science and Medicine (1999); Introduction to Computational Molecular Biology (Setubal et 
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Tayta, eds., 2000); Brown, ' ' «* - *— « * 

5 Waterman, » «** Maps, **-~ - " ^ 

The present invention provide a compter database oomprtstng a eomputet 
and software fot storing in conrpnter-retrievah,. form assay da* records ovulated, e.g., 
with data specifying the source of the targe.-containing sanrple tan which each secoence 
specificity record was obtained. 

10 man exemplary embodiment, at least one of the sources of target-contammg 

sample is from a control tissue sample known to be free of pathological disorders. In a 
variation, at least one of the sources is a know* pathological tissue specimen, e.g., a 
neoplastic lesion or another tissue specimen to be analyzed for breast cancer. In another 
variation, the assay records cross-tabulate one or more of the following parameters for each 

15 target species in a sample: (1) a unique identification code, which can include, e.g., a target 
molecular structure and/or characteristic separation coordinate (e.g., electrophoretic 
coordinates); (2) sample source; and (3) absolute and/or relative quantity of the target species 

present in the sample. 

The invention also provides for the storage and retrieval of a collection of 

20 target data in a computer data storage apparatus, which can include magnetic disks, optical 
disks, magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
magnetic bubble memory devices, and other data storage devices, including CPU registers 
and on-CPU data storage arrays. Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable medium or as an array of charge states or 

25 transistor gate states, such as an array of cells in aDRAM device (e.g., each cell comprised of 
a transistor and a charge storage area, which may be on the transistor). In one embodiment, 
the invention provides such storage devices, and computer systems built therewith, 
comprising a bit pattern encoding a protein expression fingerprint record comprising unique 
identifiers for at least 10 target data records cross-tabulated with target source. 
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WHen ft. to8 « U a ^ - — ■* - • — r**^ 
or retrieved from a computer storage device or database and 

beoftherelativeamountofapeptideornucleicacidsequencemapoolofsequences 
determinedfromapolypeptideornucleicacidsampleofaspecunen. 

The invention also preferably provides a magnetic disk, such as an IBM- 

(. g Linux, SunOS, Solaris, AIX, SCO Unix, VMS, MV, Macintosh, etc.) floppy drskette or 
nard'dixed, Winchester) disk drive, comprising a bU pattern encoding data from an assay of 
the invention in a file format suitable for retrieval and processing in a computerized sequence 
analysis, comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing 
devices linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, 
ISDN line, wireless network, optical fiber, or other suitable signal transmission medium, 
.hereby at least one network device (e.g., computer, disk array, etc.) comprises a pattern of 
magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 
cells) composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for 
comparing a query target to a database containing an array of data structures, such as an assay 
result obtained by the method of the invention, and ranking database targets based on the 
• degree of identity and gap weight to the target data. A central processor is preferably 
initialized to load and execute the computer program for alignment and/or comparison of the 
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from the data file, which comprises a binary description of an assay result. 

The tar get data or record and the computer program can be transferred 
secondary memory, which is typically random access memory (e.g., DRAM, SRAM, 
SGRAM, or SDRAM). Targets are ranked according to the degree of correspondence 
betweenaselected assay i*^^^**^^-**'** 

acentralproce^sorcanbeac 

PA-8000 SPARC, MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or 
public domain molecular biology software package (e.g., UWGCG Sequence Analysts 
Software, Darwin); a data file can be an optical or magnetic disk, a data server, a memory 
device (e g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, 
etc )• an I/O device can be a terminal comprising a video display and a keyboard, a modem, 
an ISDN terminal adapter, an Ethernet port, a punched card reader, a magnetic stnp reader, or 

other suitable I/O device. 

The invention also preferably provides the use of a computer system, such as 
that described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 
collection of peptide sequence specificity records obtainedby the methods of the invenUon, 
which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of companson results 
on the basis of computed similarity values. 

Characteristics of breast cancer-associated proteins 

Breast cancer proteins of the present invention may be classified as secreted 
proteins, transmembrane proteins or intraceUular proteins. In one embodiment, the breast 
cancer protein is an intracellular protein. Intracellular proteins may be found mthe 
cytoplasm and/or in the nucleus. Intracellular proteins are involved in all aspects of cellular 
function and replication (including, e.g., signaling pathways); aberrant expression of snch 
, proteins often results in unregulated or disregulated cellular processes (see. e.g., Molecular 
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15 



20 



„ , a n. * ,A ed 1994). For example, many intracellular proteins 

P^s „ va.es -b — — ns, -d « >»*- . — 8 *e 
structural integrity of organelles. orese nce 

An increasingly appreciated concept in character^ protems . the presen 
intheproteinsofoneormore motifs for which defined functions have been attnbuted. n 

conserved sequences have been identified in proteins that are involved m protem-pr t m 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphory ated 
targets in a sequence dependent manner. PTE domains, which are distinct from 
domains, also bind tyrosine phosphorylated targets. SH3 domains bind to prohne-nch 
targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 
few have been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phospholipids or other second messengers. As will be appreaated by 
one of ordinary skill in the art, these motifs can be identified on the basis of prunary 
sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may assoaate. 
One useful database is Pfam (protein families), which is a large collection of multiple 
sequence alignments and hidden Markov models covering many common protem domams. 
Versions are available via the internet from Washington University in. St. Louis, the Sanger 
Center in England, and the Karolinska Institute in Sweden {see. e.g., Bateman et al. Nuc. 
Acids Res 28:263-266 (2000); Sonnhammer et al, Proteins 28:405-420 (1997); Bateman et 
al, Nuc. Acids Res. 27:260-262 (1999); and Sonnhammer et al, Nuc. Acids Res. 26:320-322- 



25 

(1998)). 



(1998)). 

In another embodiment, the breast cancer sequences are transmembrane 
proteins. Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. 
They may have an intracellular domain, an extracellular domain, or both. The intracellular 
domains of such proteins may have a number of functions including those already desenbed 
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and/ormay serve asabinding site for ^^l^* 1 ^^ 
Jnases havebothproteinkinase activity and SH2 doniams. b ^«Uo^W 

"^^smernbraneproteinsn^ 

domain, For example, receptor tyrosine kinases, certain cytokine receptors, receptor 

guanylyl cyclases and ^ a 
tnmsmembrane domain. However, various other proteins including channels and adenylyl 
cyclases contain numerous transmembrane domains. Many important cell surface receptors 
such as G protein coupled receptors (GPCRs) are classified as "seven transmembrane 
domain" proteins, as they contain 7 membrane spanning regions. Characteristics of 
tnmsmembrane domains include approximately 20 consecutive hydrophobic amino adds that 
may be followed by charged amino acids. Therefore, upon analysis of the ammo add 
sequence of a particular protein, the localization and number of transmembrane domains 
within the protein may be predicted (see, e.g. PSORT web site http://psort.nibb.ac.jp/). 
Important transmembrane protein receptors include, but are not limited to the insulin 
receptor, insulin-like growth factor receptor, human growth hormone receptor, glucose 
transporters, transferrin receptor, epidermal growth factor receptor, low density lipoprotein 
receptor, epidermal growth factor receptor, leptin receptor, interleukin receptors, e.g. IL-1 

receptor, DL-2 receptor, 

The extracellular domains of transmembrane proteins are diverse; however, 

conserved motifs are found repeatedly among various extracellular domains. Conserved 

structure and/or functions have been ascribed to different extracellular motifs. Many 

extracellular domains are involved in binding to other molecules. In one aspect, extracellular 

domains are found on receptors. Factors that bind the receptor domain include circulating 

ligands, which may be peptides, proteins, or small molecules such as adenosine and the like. 

For example, growth factors such as EGF, FGF and PDGF are circulating growth factors that 

) bind to their cognate receptors to initiate a variety of cellular responses. Other factors include 
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anchor, or may ft-*- ■» traosmcmon.no prolans. Extinc* 
tvoicallv oermeebltad to provide access to intracellular proteins. 

W n „„ 1 al S obeapprecU«edby fc o S einmea ttl ha,a t ransmomoranopm,omo»n 

Furthennore, mmamembrane proteins ma. have boon made aoluble oan be made to be 
15 secmtedlhrough recombinant means by adding an appropriate signal sequence. 

In another embodiment, Ihe breast cancer proteina are secreted protems; the 
secretion of which oan be either constitutive or regulated. These proteina have a argn all 
peptide or signal sequence that large* lhe molecule to Ihe secremry patirway. Secreted 
proteina are involved in numeroua physiological events; by virtue oftheir circulating natiue, 

anaumoriuomamrerCaoting on IheceUti- secreted tire faoter), a paracrine manner (aotirrg 
„„ cells in Cose proximity to Ihe cell tira, secreted tire M or an endocrine manner (acting 
„» eel* a, a distimoe). Thus aecreled moleculoa find use in modulating or altering numerous 
aapecis of physiology. Breast cancer proteins thai am aecreled proteins am particularly 
25 preferred in .he preaen. invention as .hey aorvo as good targets for diagnostic markera, ,g„ 
for blood, plasma, serum, or stool tests. 

Use of breast cancer nucleic acids 

As described above, breast cancer sequence is initially identified by 

30 sub stantialnu^^ 



WO 02/059377 



PCT7US02/02242 



10 



15 



s^esou.linedh^.Suchbomologycanb.based^atoovc.aU-e.oao.dor 

programs or hybridization conditions. Typicafiy, finked sequent on a mRNA aro fotntd 

theSmem0l l: b roaa,ca„cerouo 1 a^d S e (1 oeoco S ofdreiov.otio,e,,drase q ueooo S 
ta Tab.esl-25,oanbarrag m .o K of.argergmos,U,«hayarenucWoacid S o 8 ma„ti i . 
■ w io dris context includaa codiog regions, nomcoding regions, and nnxtums of codmg 
and non-coding regions. Aeeordingiy, as will be appreciated by tirose in ft. ad, ustng the 
sequences provided herein, extended sciences, in either dimction, of.be breast oancer genes 
can be obtained, naing techniques wefi known in the art for cloning either longer sequence, or 
the fhU length sequences; see Ausnbel, a, al, supra. Much can be done by infonnatios and 
many sequences can be clustered to include multiple sequences corresponding to a stngle 
gene, e.g., systems such as UniGene (see, hup://www.nchi.nhn.nih.gov/Un,GeneO. 

Once the breast cancer nucleic acid is identified, it can be cloned and, .f 
nec^sary, its constituent parts recombined to form the entire breast cancer nucleic acid 
coding regions or the entire mRNA sequence. Once isolated from its natural source, e.g„ 
contained within a plasmid or other vector or excised therefrom as a linear nucleic an,d 
segment, tire recombinant bmast cancer nucleic acid can be further-used as a probe to .dentify 
m i isolate other breast cancer nucleic acids, e.g., extended coding regions. It can also be 
used as a "precursor" nucleic acid to make modified or variant breast cancer nucleic actds 
and proteins. 

The breast cancer nucleic acids of the present invention are used in several 
ways. In a first embodiment, nucleic acid probes to the breast cancer nucleic acids are made 
and attached to biochips to be used in screening and diagnostic methods, as outlined below, 
I or for adniinistration, e.g., for gene therapy, vaccine, and/or antisense applications. 

Alternatively, the breast cancer nucleic acids that include coding regions of breast cancer 
proteins can be put into expression vectors for the expression of breast cancer proteins, again 
for screening purposes or for administration to a patient. 

In a preferred embodiment, nucleic acid probes to breast cancer nucleic acids 
(both the nucleic acid sequences outlined in the figures and/or the complements thereof) are 
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mad . The nuCeic acid probe. ahached » the biochip am de^ed * * ^— V 
complement -o me breas, cane. nucleic acids, U. the urge, sequence M-» ~ 

Lg,e stranded nucteic acids of * present invention. However, if .he number of mutations 
is so great ma, no hybridization can occur nude, even .he ieas. stringent of hybridan 
cnndiUons. *, sequence is no. a comp.enaen.ary targe, sequence. Thus, by "snbstanttally 
complement herein is mean, that the probea are sufficiently complementary to the target 
sequences .0 hybridize under normal reaction conditions, particularly high stnngency 

conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single 
and partially double stranded. The strandedness of the probe is dictated by the structure, 
composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally whole 
genes are not used. In some embodiments, much longer nucleic acids can be used, up to 
hundreds of bases. 

In a preferred embodiment, more than one probe per sequence is used, with 
either overlapping probes or probes to different sections of the target being used. That is, 
two three, four or more probes, with three being preferred, are used to build in a redundancy 
for a particular target. The probes can be overlapping (i.e., have some sequence in common), 
or separate. In some cases, PCR primers may be used to amplify signal for higher sensmvrty. 

As will be appreciated by those in the art, nucleic acids can be attached or 
unrnobilizedtoasolidsupportinawidevarietyofways. By "immobilized" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 
0 covalent binding" and grammatical equivalents herein is meant one or more of electrostatic, 

40 



PCT/US02/02242 

WO 02/059377 



c included in non-covalent binding is the covalent 

fomtedbyac^slinlrer or by ta**.*.*-*.— « group 

covalent and non-covalent interactions. „,„r„™as 
10 u, genera., the probes are anached .0 «he biochip in a wade vanely of ways, as 

^appreciatedbythoseinlheari. Asdeseribedberei„, ft enue..ic acidseanerUrerbe 

ftebi0Cl>iP ' rnebiochipcnmprises.suitablesoUdsubsu^ By "substrate" or "solid 

contain discrete individual sites appropriate for to attachment or association of the nucletc 
,cid probea and is amenable «o a, leas, one detention medaod. As wiU be appreciated by those 
ta the art, the number of possible substea.es are very large, and toclude, but are no, luruted «o, 
glass and modified or tbnntionaliaed glass, plastics (including acrylics, polystyrene and 
20 copolymercofrttymneandomermateriaKpo^ 

poWurethones, Teflon!, ete.), polyaaccbarides, nylon or nitrooellulose, testa, siltoa or athoa- 
based materials inoluding silicon and modified silieon, oatbon, metals, inorganic glasses, 
plastics, ete. In general, me snbsteatcs allow optical detection and do not apprectably 

25 HuorescrmtPUstic Biochip, U.S. Applicant Serial No. 09/270,214, fiUdMarch 15, 1999, 
herein incorporated by reference in its entirety. 

Generally the substrate is planar, although as will be appreciated by those in 
the art other configurations of substrates may be used as well. For example, the probes may 
be placed on the inside surface of a tube, for flow-through sample analysis to — 
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closed cell foams made of particular plastics. 

fc a prefer embeOitnent, «« — ^ bi °*» fc ^ 7 " 
with ebemica. banana, grt,ups fot — — — - * 

to gnaups, oarboxy group, 0*0 groups a*d thiol groups, with amino groups ^ng 
partieuLy prefetted. Using mese mnotionat groups, the probes can be attached us,n g 
letups on m, probes. For nucieic -* — 
cached to surfaees comprising amino groups, e.g. using .inkers as are known m ft. art, e.g., 
tMMUU ^era as are we.l known f .994 Pierce Chemtcal Company 
oal ,log,.echnieal S ee»ononeross.Unker S ,pa g «sl55-200). In addition, in some cases, 
additional linkers, such aa rdky. groups (ineiuding substituted and heteroaiky. groups) may be 

used. . u , 

In this embodiment, oligonucleotides are synthesized as is known m the art, 

and then attached to the surface of the solid support. As will be appreciated by those stalled 

in the art, either the 5' or 3" terminus may be attached to the solid support, or attachment may 

be via an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very 
strong, yet non-covalent. For example, biotinylated oligonucleotides can be made, which 
bind to surfaces covalently coated with streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface, as is 
know, in the art. For example, photoactivation techniques utilizing photopolymerization 
compounds and techniques are used. In a preferred embodiment, the nucleic acids can be 
synthesized in situ, using well known photolithographic techniques, such as those descnbed 
in WO 95/251 16; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references 
cited within, all of which are expressly incorporated by reference; these methods of 
attachment form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression 
leV el of breast cancer-associated sequences. These assays are typically performed in 
j conjunction with reverse transcription. In such assays, a breast cancer-associated nucleic acid 
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LLls provides a measure of the amount of breast cancer-associated RNA. Methods of 
I! 2e amotion are we,, hnown to .hose of shif, in me nr.. D-W pretooe, for 
quantitative PCR are provided, e.g., in Iiuiis el al., PCR Protocols, A Guide to Methods and 

dye andarquenching agent, the probe hybridizes m a PCR pmductibu. canoo. itself be 
ex.endeddue.oab.ochingagen.a.tbeS'end. When the PCR product is amplified in 
subsequent cycles, Ore 5' nue.ease activity of tire polymerase, rag., AmpliTaq, resnhs re the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3 
quenching agent, thereby resulting in an increase in fluorescence as a function of 
amplification (see, e.g„ nferahrre provided by Peridn-Ehner, erg., wv^.peridn-elm.r.com). 

Other suitable amplification methods include, but are not limited to, hgase 
chain reaction (LCR) (see Wu & Walface, Genomics 4:560 (1989), Undegren etal. Science 
241 1077 (1988), and Barringer et al, Oe„e 89:1 17 (1990)), transcription amplification 
(Kerch e< «! Proc. Natl.Acad. Set USA 86:1173 (1989)), self-sustained sequence replication 
(Guat.111 et al.. Proe. No,. Acai. Set. USA 87:1874 (1990)), do. PCR, and linker adapter PCR, 



etc. 



Expression of breast cancer proteins from nucleic acids 

In a preferred embodiment, breast cancer nucleic acids, e.g., encoding breast 
25 cancer proteins are used to make a variety of expression vectors to express breast cancer 

proteins whicb can tbenbe used in screening assays, as described below. Expression vectors 
and recombinant DNA technology are well known to those of skill in the art {see, e.g.. 
Ausubel, supra, and Gene Expression Systems (Fernandez & Hoeffler, eds, 1999)) and are 
used to express proteins. Hie expression vectors may be either self-replicatmg 
extrachromosomal vectors or vectors which integrate into a host genome. Generally, these 
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refers to DNA sequences used for the expression „ t „rvotes e c include a 

• r„ ntm l seouences that are suitable for prokaryotes, e.g., mere 
particular host organism. Control sequences Euka ™tic cells are 

5 p rom o.er,opuo M .lyanopera to rse,uence,andarthosomeb m dur E s.te. Eukary 

known to utilize promoters, polyadenylahon signals, and enhancers. 

NLleicacidia-ope-y.inlted-wheni.isp.aeedintoahrnchona, 

unship with another nucleic acid sequence. For example, DNA for a presequene e or 
secretory, ader is openly Wedto DNAfor apo.ypepddeif i«,a expreased K ap«pr^ 

Uoper^lyUn.red^.e^gse^^ifi.UpoaiUrmedao.^U.a.e^.ahom 

and, in me case of a aecretory ,eader, conuguous and in reading phase. However, enhancers 
M do no. have ,o be contiguous. Linking re typically accompnshed by .igahon a, convenren, 
restriction sites. If such sites do not exist synthetic oligonucleotide adaptors or .inkers are 
used in accordance with convenuona. practice. Transcriptional and transteuonal regulatory 
nuoleie aeid will general., be appropriate to the host cell uaed to express the breast cancer 
protein. Numerous types of appropriate expression vectors, and suitable regulatory 
20 sequences are known in the art for a variety of host eells. 

In general, transcriptional and tranalational regulatory sequences may melude, 
bn, are no. limited to, promoter sequences, ribosoma! binding sites, narrseriptional surt and 
stop sequences, u-analanona. ster, and step sequences, and enhancer or activator sequences 
fc, a preferred embodiment, the regulatory sequences include a promoter and transcrrptronal 

25 start and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. The 
promoters may be either naturally occurring promoters or hybrid promoters. Hybrid 
promoters, which combine elements of more than one promoter, are also known m the art, 
and are useful in the present invention. 
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^ ed in 1 organisms, e.g. in mammalian or insect celis for express.cn and ma 
5 Loltheexpressionvectorcontainsatleastones^ 

appropriate homologous sequence for inclusion in the vector. Constructs for integrating 
vectors are well known in the art (e.g., Fernandez & Hoeffler, supra). 
10 In addition, in a preferred embodiment, the expression vector contains a 

selectable marker gene to allow the selection of transformed host cells. Selection genes are 
well known in the art and will vary with the host cell used. 

The breast cancer proteins of the present invention are produced by cultunng a 
host cell transformed with an expression vector containing nucleic acid encoding a breast 
15 cancer protein, under the appropriate conditions to induce or cause expression of the breast 
cancer protein. Conditions appropriate for breast cancer protein expression will vary with the 
choice of the expression vector and the host cell, and will be easily ascertained by one skilled 
in the art through routine experimentation or optimization. For example, the use of 
constitutive promoters in the expression vector will require optimizing the growth and 
20 proliferation of the host cell, while the use of an inducible promoter requires the appropriate 
growth conditions for induction. In addition, in some embodiments, the timing of the harvest 
is important. For example, the baculoviral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect 
25 and animal cells, including mammalian cells. Of particular interest are Saccharomyces 
cerevisiae and other yeasts, E. coU, Bacillus subtilis, Sf9 cells, C129 cells, 293 cells, 
Neurospora, BHK, CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial 
cells), THP1 cells (a macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the breast cancer proteins are expressed in 
ma mmatian cells. Mammalian expression systems are also known in the art, and include 
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10 



^hasis generfy "'" ? "°" torus, a, « 
nmmntprs from mammalian viral genes, since me vn<u B 

include those derived form SV40. 

The methods of introdueing exogenous nucleic acid into mammalian hosts, as 
well as other hosts, is well known in the art, and will vary with the host cell used. 
Techniques include dextran-mediated transfection, calcium phosphate predion, 
polybrene mediated transfection, protoplast fusion, electroporation, vual infection, 
encapsulation ofthe polynucleotide(s) in liposomes, and direct microinjection ofthe DNA 

into nuclei. , . 

In a preferred embodiment, breast cancer proteins are expressed in bacterial 
systems. Bacterial expression systems are well known in the art. Promoters from 
bacteriophage mayalso be used and are known in the art. In addition, synthetic promoters 
and hybrid promoters are also useful; e;g., the tac promoter is a hybrid of the trp and lac 
proffi0 ter sequences. Formed ^ , 
pro motersofnon-bacterial origin that have the ability to bind bacterial RNA polymerase and 
initiate transcription. In addition to a functioning promoter sequence, an efficient nbosome 
5 binding site is desirable. The expression vector may also include a signal peptide sequence 
that provides for secretion of the breast cancer protein in bacteria. The protein is either 
secreted into the growth media (gram-positive bacteria)or into the periplasm, space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). The bacterial 
expression vector may also include a selectable marker gene to allow for the selection of 
bacterial strains that have been transformed. Suitable selection genes include genes which 
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• ■ w to drues such as ampicillin, chloramphenicol, erythromycin, 
components are assembled into expression vectors. Expression vec 

express ion vectors » — ed »- « - -» " S 

te art, such as calcium chloride treatment, elecuoporauon, and others. 

In one embodiment, breast cancer proteins are produced u> msec, cells. 

expression vectors, are well known in the art. 

In a preferred embodiment, breast cancer protein is produced m yeast cells. 
Yeast expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansenula polymorpha, 
15 KIuyveromycesJragilisw&K. lactis, Pichia guillerimondii and P. pastorts, 
Schizosaccharomycespombe, and Yarrowia lipolytica. 

The breast cancer protein may also be made as a fusion protein, using 
techniques well known in the art. Thus, e.g., for the creation of monoclonal antibodies, if the 
desired epitope is small, the breast cancer protein may be fused to a carrier protein to form an 
20 immunogen. Alternatively, the breast cancer protein may be made as a fusion protein to 
increase expression, or for other reasons. For example, when the breast cancer protem is a 
breast cancer peptide, the nucleic acid encoding the peptide may be linked to other nucleic 

acid for expression purposes. 

In a preferred embodiment, the breast cancer protein is purified or isolated 
25 after expression. Breast cancer proteins may be isolated or purified in a variety of ways 
known to those skilled in the art depending on what other components are present m the 
sample Standard purification methods include electrophoretic, molecular, immunological 
and chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse- 
phase HPLC chromatography, and chromatofocusing. For example, the breast cancer protein 
30 may be purified using a standard Ultrafiltration 
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protein. In some instances no pttrification will be necessary. 
« Oneeexpre S sMarK 1 pnnae4ifn=eessary,unebreas,ca„cerpro tel nsand 

^.*».^.^**^^^* — 
reagents, as vaccine reagents, as screening agents, etc. 

Variants of breast cancer proteins 

10 m one embodiment, the breast cancer proteins axe derivative or vanant fan* 

the derivative breast cancer peptide will often contain at least one amino acrd subsUtutron, 
deletionor insertion, with amino acid .h^^vbWV^^'™ 
acidsub^ 

peptide. ^ ^ ^ effibodiment of breast cancer proteins of the present 

invention are amino acid sequence variant, These variants typically fall into one or more of 
three classes: substitutional, insertional or deletional variants. These variants ordinanly are 
preparedbysite specific mutagenesis of nucleotides in the DNA encoding the breast cancer 
20 protein, using cassette or PGR mutagenesis or other techniques well known in the art, to 
produceDNA encoding the variant, and thereafter expressing the DNA in recombmant cell 
culture as outlined above. However, variant breast cancer protein fragments having up to 
about 100-150 residues may be prepared by in vitro synthesis using established techniques. 
Amino acid sequence variants are characterized by the predetermined nature of the variation, 
25 a feature that sets them apart from naturally occurring allelic or interspecies variation of the 
breast cancer protein amino acid sequence. The variants typically exhibit the same 
nuaUtative biological activity as the naturally occurring analogue, although variants can also 
be selected which have modified characteristics as will be more fully outlined below. 

While the site or region for introducing an.amino acid sequence variation is 
30 predetermined, the mutation per se need not be predetermined. For example, in order to 
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' ""J" ? Screwing of the mutants is done using assay, of teas, 

mutagenesis aud PCR mutagenesis. Screening 

some cases deletions may be much larger. 

Substimtio^.detedons.insemonsoranycombmaaoumesoofmaybousedto 

«*. a. a final derivative. General* tee ohang* are done on a few amino acltfc * 

desire substitutions are generally nurd, in accordance with .he amino acid subs.ttut.on 

relationships provided in the deenition section 

The variant typically exhibit the same qualitative biological activity and wtll 
cW , «re same immune response as the nabirallyoccnmng anulog, although variants also are 
selected to modify die characteristics of the breast cancer proteins as needed. AU.mat.vely, 
the varian. may be designed such «ha. .he biological activity of ft. breas, cancer protein ts 
altered. For example, glycosylation sites may be altered or removed. 

Substantial changes in function or immunological identity are made by 
seieoring substitutions that are iess conserve than those described above, for example 
substitutions may be made which more significantly affect: the stnsctureof.hepo.ypept.de 

fcecharg.orhyd^hobici.yoffitemo.e^ a, the target or theb.dk of.he.de oham. 
The substitutions which in genera! are expected to produc the grea.es, changes tn the 
po-ypepfide* properties a re ««sem w«ch(a) ahvdropMfioresiduo.c.g. serylor fi^! K 
substituted for (or by) . hydrophobio residue, e.g. leuoyl, W-* pnenylalauyl, v*. or 
. alanyl; (b) a cysteine or proline is substituted for (orby) any otherresidne; (o) aresulue 
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■a u • ivsvl arginyl, or histidyUs substituted for (or by), 
having an electropositives.de toeg ; ^'^'^ a residue having a bulky side 
an.lectronegativeresidue.e.g.g^ 

chain, e. g . phenylalanine, is substituted for (or by) one no ^ M ^ 
Covalent modifications of breast cancer polypeptides are inci 
k- • fn LtypeofcovalentmodificaUon^ 

My bribed below. Common* used crosslink^ agents include, e.g., 11- 
JL^M*» ghttaraldehyde, N-hydroxy—tde esters , e.g„ «M 
J!— yUc acid, nomobimnctional unid« M «— - *» 
s»cb as 3,3-^ob^ucetaWdylp^ion^), biftocdona. maleimides such ■ ta* 

mafcimido-U-octaneandagents^^ 

Other modifications include deamidation of ghttaminyl and asparaguryl 

proIta and lysine, phosphoryUtion of hydroxy, groups of setyl, dsteonyl 

Prom,: Secure I**—. » < 1983 »' aCM >" i " i '" 1 

terminal amine, snd amidation of any C-tenninal carboxyl group. 

Another type of covalent modification of the breast cancer polypeptide 

ofthepolypeptide. "Altering the native glycosylate, pattern" is intended for purposes 
, bereintomeandeletingoneormoreca^hydmtemoiedeafoundmnativeser.uencebre^ 

cancer polypeptide, and/or adding one o, mote glycosyUtion sites dta, are no, present . ft. 
native sequence breast cancer polypepdde. Glycosylate, patterns can be altered tn many 
ways. For example the use of different cell types to express breast cancer-assomted 
sequences can result in different glycosyladon patterns. 
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byte ^i tt on.f,o, S ub S .Wo.by,on e o,mo,e S e m eorto 
' ee breast cane po.ypepMe (for OMred glycol «. . The 

z»-^^« ta ^r*trC^S--'-- 

polypeptide. Such methods are described in the art, e.g., 

Lefav^e.yof^^ycosi^Md^nbeubyTho^e.a,.,^ 

Enzyme!., 138:350 (1987). 

Mote type of covatart modification of breast eaoeer oompnses Imtag the 
breasteaneerpolyoepttdetooaeofavarietyofnonpro.einaeeeospolymers.e.g, 

U.S. Patents. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,92 or ^ 

Breast eaneer polypeptides of Ore presea. inventton may a.so be tnodtfied „, 

n.osouapolypeptideora^iaoaeldser.aenee. b oae embodied,, sued a efcatene 
JpHlateonofabreaa.e.eerpo.ypepUdewldraUgpo^pepb *** 

3„ ^saebepi.pe-.gged^ofa^ca^po^decanbeda,^^ 
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an antibody against the tag polypeptide. Also, provision of the epitope tag enables the breast 
cancer polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a breast cancer polypeptide with an 

5 immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the 
chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are well known in the 
art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; 
fflS6 and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field et 

10 al, Mol Cell BioL 8:2159-2165 (1988)); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 
9E10 antibodies thereto (Evan et al, Molecular and Cellular Biology 5:3610-3616 (1985)); 
and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky et al, 
Protein Engineering 3(6):547-553 (1990)). Other tag polypeptides include the Flag-peptide 
{Hopp et al t BioTechnology 6:1204-1210 (1988)); the KT3 epitope peptide (Martin et al, 

15 Science 255:192-194 (1992)); tubulin epitope peptide (Skinner et al, J, Biol. Chem. 

266:15163-15 166 (1991)); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth et al, 
Proa Natl Acad. ScL USA 87:6393-6397 (1990)). 

Also included are other breast cancer proteins of the breast cancer family, and 
breast cancer proteins from other organisms, which are cloned and expressed as outlined 

20 below. Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be 
used to find other related breast cancer proteins from humans or other organisms. As will be 
appreciated by those in the art, particularly useful probe and/or PCR primer sequences 
include the unique areas of the breast cancer nucleic acid sequence. As is generally known in 
the art, preferred PCR primers are from about 15 to about 35 nucleotides in length, with from 

25 about 20 to about 30 being preferred, and may contain inosine as needed. The conditions for 
the PCR reaction are well known in the art (e.g., Innis, PCR Protocols, supra). 

Antibodies to breast cancer proteins 

In a preferred embodiment, when the breast cancer protein is to be used to 
30 generate antibodies, e.g. , for immunotherapy or immunodiagnosis, the breast cancer protein 
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an antibody or T-ceU receptor in the context of MHC. thus, in 

parhcu-arty linear epitope, In . »*nrf embodiment, » ep.«op« - «- * 

encoded by a nucleic acid of the figures or fragment thereof or a toon pro,.tn .hereof. I. 

, taM .o keyhole Umpe, hemocyanin, aororo albumin, bovine thyroglobulin, and soybean 
WkUto Examples of adjuvants which may be ployed include Freund s complete 
adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 
dicorynomycolate). The immunization protocol may be selected by one skilled in the art 

without undue experimentation. 
0 The antibodies may, alternatively, be monoclonal antibodies. Monoclonal 

antibodies m ay be prepared using hybridoma methods, such as those described by Kohler & 
MM Nature 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 
appropriate host animal, is typically immunized with an immunizing agent to ehat 
Oocytes thatproduc^orarecapableofp^ 
l5 theinm.unizingagent.Altemanvely.mely^phocytesma^ 

^unizing agent will typically include a polypeptide encoded by annexe aad of Tables 1^ 
2 5or fr agmentmereof,orafusionprotemmereof.Generally,eimerperi^ 
lymP hocyte S ("PBLs") are used if cells of human origin are desired, or spleen cells or lymph 
cells are used if non-human mammalian sources are desired. The lymphocytes are then 
30 fused with an immortalized cell line using a suitable fusing agent, such as polyethylene 
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„p 59-103(1986)). taoflalizedoeU lines are usuaUytra^fonnedma^nalmc* 

■heunfus^taunortaUzedcnlls. For example, iftoparenta! cell, lack to enzyme 

for to hybridomas .ypieaUy will include hypoxanlhine, arninopterin, and Ihymrdme ( HAT 
medium"), whieh anoyances prevent to growth of HCPRT-deficient cells. 

„ in one embodiment, the antibodies are bispecific antibodies. Btspecrfic 

^•bodies are monoclona!, preferably human or humanized, antibodies ma. have bindmg 
specificities for a. least two different antigens or tot have binding specificities for two 
epitopes on the same antigen. In one embodiment, one of to binding specificities is for a 
protein encoded by a nucleic acid Tables 1-25 or a fragment thereof, to otor one is for any 

5 otor antigen, and preferably for a oell-aurfaee protein or receptor or receptor subuntt, 
preferably one that is tumor specific. Alternatively, tetremer-type technology may create 

multivalent reagents. 

In a preferred embodiment, the antibodies to breast cancer protein are capable 
of reducing or eUminating a biological function of a breast cancer protein, as is described 
20 below That is, the addition of anti-breast cancer protein antibodies (either polyclonal or 
preferably monoclonal) to breast cancer tissue (or cells containing breast cancer) may reduce 
or eliminate the breast cancer. Generally, at least a 25% decrease in activity, growth, size or 
the like is preferred, with at least about 50% being particularly preferred and about a 95- 
1 00% decrease being especially preferred. 
25 In a preferred embodiment the antibodies to the breast cancer proteins are 

humanized antibodies (e.g., Xenerex Biosciences, Mederex, Inc., Abgenix, Inc., Protein 
Design Labs Inc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric 
molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, 
Fab Fab' F(ab')2 or other antigen-binding subsequences of antibodies) which contain 
30 minimal sequence derived from non-human immunoglobulin. Humanized antibodies include 
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substantially all of the CDRregrons correspond to those of ano 

immunogiouum v sQfiH992Y) Humamzation 

332-323-329 (1988); and Presta, Curr. Op. Struct Ac* 2.593-596 (19»» 

jjz.^ j** v /» winter and co-workers (Jones et al , 

canbeessentiaUyperfonnedfollowingthemethodof * 

can oe ebbcuua / f oon.oo-i -197 H988Y Verhoeyen et al, 

Man* 321:522-525 (1986); Riuchrnann«a,.,M><»re 332.323-327 (1988), y 

Sd^ 239:1534-1536(1988)), by .ub^rodentCDE, or CDRs^uenc. for*. 

dJL antibodies (U.S. P - NO. 4,8.6,567), wnereur substantia,,, less lhan an -* 
buma n variable domain has b^nsubsS^ed by ftocorr.spondmg^ueoce from a oo„- 



20 

human species 



25 



Human antibodies can also bo produced naing various .oohniuuus known un fbe 

^aisoavailabloformeprapa-bouofhuansnmouoolona^dboriieslColeara, 

W *- " " (l985) ^ t Tt 

,„« (.991)). Similarly, unman antibodies oan be made by intiodnomg of humaar 

^noglobutinlooi into tiansgenic antauds, e.g., mine inwuiohthe endogenous 
^(obmmgeueshav.beonpartiaUyoroomp.e.alyinaotiva.ed. Upon cWlengo, 
30 hu man m tibodyprodnotioni S obsarvad,wb i obo 1 o S e, y r K =mb l es l na.s=anmbuman SI „aU 
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i (1994); ttatarill « «*« 14 - 84W1 (1W6> ; 8 (H95) . 

B, immunotherapy is meant treatment of breast cancer w.«h an antrbody 

against breast cancer proteins. As used herein, immunotherapy can be passive or active. 
^Luno^yaad^haremUm.p-vetransraro^dy.oa^ 

wJanlganU.wraohantibodiosaroraiseo.Mappreciatadbyonoofordmarysk.Umma 
art, me antigen may be providod by injecting a polypeptide against whioh antibodies are 
desired to bo raise* into a recipient or contacting the recrpien, with a nucleic ac.d capable 
,5 expressing the antigen and under conditions for expression of tire antigen, leadntg to an 

immune response. . 

In a preferred embodiment the breast cancer proteins against winch antibod.es 

are raised are secreted proteins as described above. Without being bound by theory, 
antibodies used for treatment, bind and prevent the secreted protein from binding to tta 
20 receptor, thereby inactivating the secreted breast cancer protein. 

In another preferred embodiment, the breast cancer protein to winch 

uaedfbrn^entbtndtec^Uularoom^oftobre^can^pretem^^ve-^ 

25 antibody may cause down-regulation of the transmembrane breaa. cancer pro.cn. As he 
appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 

toeas.cancerpro.ein The antibody is also an antagonist the breaa, cancer protem. 
Furt her, the antibody prevents activation of the transmembrane toons, cancer proteut. In one 
30 aspect, when me antibody prevents the binding of other molecules to the breast cancer 
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md IL-2, or chemomerapootic — «— 5FU ' "''"T " ^ 

activates serum complement when coniplexed with the transmemor F 

cancer pK-Wn. M«M - » — * »** * 
otherwise provide meam to locally ablate cells. 

In another preferred embodiment, the antibody is corrugated ,0 an effector 
moie^.Tbeeffeemrmoielycanbeanynnnrneeofmoleome^oludinglabeUingmoienes 

su ch as radioactive Ube.s or fmoreacentiabela, or canbe atherapeotic moitfy. mono aspect 
^mempeotiemoie.yisasm.llmo.conle.ba.modnUtts me activUy of.be breast cancer 
protein. In m o«beras P eo.memen,peoac m oi=rymodnla.es.beae t ivityofmoleou.es 
; ^iareowiarormoloseprordmitytomcbrcasloanoerprolein. The therapeotic mo.eby 
may inhibil enzymatic aetivUy snch as protease or oollagenase or protein kinase actrvty 

associated with breast cancer. 

In a prefened embodiment, the therapentio moiety can also be a cytotoxrc 
ag en.. In tins method, targeting me cytotoxic agent to breas, cancer tissoe or cells, resrdte rn 
0 a redaction in the nmnbor of afflicted ceUs, thereby reducmg symptoms aasoorated wrth 
breas. canoe. Cytotoxic agente are nameroos and varied and inolude,bn, are no.titni.ed to, 
cytotoxic drugs or toxins or active fragments of snob toxins. Snitab.c toxins and then 
corresponding fragments tactade diphtheria A chain, exotoxin A chain, ricin A Cham, abnn A 
chain, curcin, crotin, phenomycin, enomycin and me tike. Cytotoxic agente also mctade 

(he antibody. Targeting toe merapeotic moiety to tmnsmembrane breas. cancer proton* no. 
od, serves to increase the local concentration of ti.erap.ntio moiety in the bra*, cancer 
afficW area, bn, also serves to reduce deleterions side effects mat may be associated wrth 
30 the therapeutic moiety. 
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In another preferred embodiment, the breast cancer protein against which the 
antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
to a protein which facilitates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 
5 the individual or cell. Moreover, wherein the breast cancer protein can be targeted within a 
cell, i.e., the nucleus, an antibody thereto contains a signal for that target localization, i.e., a 
nuclear localization signal. 

The breast cancer antibodies of the invention specifically bind to breast cancer 
proteins. By "specifically bind" herein is meant that the antibodies bind to the protein with a 
10 K4 of at least about 0.1 mM, more usually at least about 1 ^M, preferably at least about 0.1 
\xM or better, and most preferably, 0.01 yM or better. Selectivity of binding is also 
important. 

Detection of breast cancer sequence for diagnostic and therapeutic applications 

15 In one aspect, the RNA expression levels of genes are determined for different 

cellular states in the breast cancer phenotype. Expression levels of genes in normal tissue 
(i.e., not undergoing breast cancer) and in breast cancer tissue (and in some cases, for varying 
severities of breast cancer that relate to prognosis, as outlined below) are evaluated to provide 
expression profiles. An expression profile of a particular cell state or point of development is 

20 essentially a "fingerprint" of the state. While two states may have any particular gene 
similarly expressed, the evaluation of a number of genes simultaneously allows the 
generation of a gene expression profile that is reflective of the state of the cell. By comparing 
expression profiles of cells in different states, information regarding which genes are 
important (including both up- and down-regulation of genes) in each of these states is 

25 obtained. Then, diagnosis may be performed or confirmed to determine whether a tissue 
sample has the gene expression profile of normal or cancerous tissue. This will provide for 
molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 

30 patterns within and among cells and tissue. Thus, a differentially expressed gene can 
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qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
normal versus breast cancer tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more states. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 
detectable by standard techniques. Some genes will be expressed in one state or cell type, but 
not in both. Alternatively, the difference in expression may be quantitative, e.g., in that 
expression is increased or decreased; i.e., gene expression is either upregulated, resulting in 
an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 
standard characterization techniques as outlined below, such as by use of Affymetrix 
GeneChip™ expression arrays, Lockhart, Nature Biotechnology 14:1675-1680 (1996), 
hereby expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PCR, northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is at least 
about 50%, more preferably at least about 100%, more preferably at least about 150%, more 
preferably at least about 200%, with from 300 to at least 1000% being especially preferred. 

Evaluation may be at the gene transcript, or the protein level. The amount of 
gene expression may be monitored using nucleic acid probes to the DNA or RNA equivalent 
of the gene transcript, and the quantification of gene expression levels, or, alternatively, the 
final gene product itself (protein) can be monitored, e.g., with antibodies to the breast cancer 
protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 
spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to breast 
cancer genes, i.e., those identified as being important in a breast cancer phenotype, can be 
evaluated in a breast cancer diagnostic test. 

In a preferred embodiment, gene expression monitoring is performed 
simultaneously on a number of genes. Multiple protein expression monitoring can be 
performed as well. Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the breast cancer nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of breast cancer sequences in 
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a particular cell. The assays are further described below in the example. PCR techniques can 
be used to provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the breast cancer protein are 
detected. Although DNA or RNA encoding the breast cancer protein may be detected, of 
5 particular interest are methods wherein an mRNA encoding a breast cancer protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 
complementary to and hybridizes with the mRNA and includes, but is not limited to, 
oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 

10 examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
detected. In another method detection of the mRNA is performed in situ. In this method 
permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 

1 5 washing to remove the non-specifically bound probe, the label is detected. For example a 
digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding a 
breast cancer protein is detected by binding the digoxygenin with an anti-digoxygenin 
secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

20 In a preferred embodiment, various proteins from the three classes of proteins 

as described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The breast cancer proteins, antibodies, nucleic acids, modified proteins and cells 
containing breast cancer sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 

25 expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, breast cancer proteins, including 
intracellular, transmembrane or secreted proteins, find use as markers of breast cancer. 
30 Detection of these proteins in putative breast cancer tissue allows for detection or diagnosis 
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of breast cancer. In one embodiment, antibodies are used to detect breast cancer proteins. A 
preferred method separates proteins from a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the breast cancer protein is 
5 detected, e.g., by immunoblotting with antibodies raised against the breast cancer protein. 
Methods of immunoblotting are well known to those of ordinary skill in the art. 

In another preferred method, antibodies to the breast cancer protein find use in 
in situ imaging techniques, e.g., in histology (e.g., Methods in Cell Biology: Antibodies in 
Cell Biology, volume 37 (Asai, ed. 1993)). In this method cells are contacted with from one 

10 to many antibodies to the breast cancer protein(s). Following washing to remove non- 
specific antibody binding, the presence of the antibody or antibodies is detected. In one 
embodiment the antibody is detected by incubating with a secondary antibody that contains a 
detectable label. In another method the primary antibody to the breast cancer protein(s) 
contains a detectable label, e.g. an enzyme marker that can act on a substrate. In another 

1 5 preferred embodiment each one of multiple primary antibodies contains a distinct and 

detectable label. This method finds particular use in simultaneous screening for a plurality of 
breast cancer proteins. As will be appreciated by one of ordinary skill in the art, many other 
histological imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the 

20 ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing breast 
cancer from blood, serum, plasma, stool, and other samples. Such samples, therefore, are 
useful as samples to be probed or tested for the presence of breast cancer proteins. 

25 Antibodies can be used to detect a breast cancer protein by previously described 
immunoassay techniques including ELISA, immunoblotting (western blotting), 
immunoprecipitation, BIACORE technology and the like. Conversely, the presence of 
antibodies may indicate an immune response against an endogenous breast cancer protein. 

In a preferred embodiment, in situ hybridization of labeled breast cancer 

30 nucleic acid probes to tissue arrays is done. For example, arrays of tissue samples, including 

61 



WO 02/059377 PCT/US02/02242 



breast cancer tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, 
supra) is then performed. When comparing the fingerprints between an individual and a 
standard, the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the 
findings. It is further understood that the genes which indicate the diagnosis may differ from 
5 those which indicate the prognosis and molecular profiling of the condition of the cells may 
lead to distinctions between responsive or refractory conditions or may be predictive of 
outcomes. 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing breast cancer sequences are used in prognosis 

10 assays. As above, gene expression profiles can be generated that correlate to breast cancer, 
in terms of long term prognosis. Again, this may be done on either a protein or gene level, 
with the use of genes being preferred. As above, breast cancer probes may be attached to 
biochips for the detection and quantification of breast cancer sequences in a tissue or patient. 
The assays proceed as outlined above for diagnosis. PCR method may provide more 

15 sensitive and accurate quantification. 

Assays for therapeutic compounds 

In a preferred embodiment members of the proteins, nucleic acids, and 
antibodies as described herein are used in drug screening assays. The breast cancer proteins, 
20 antibodies, nucleic acids, modified proteins and cells containing breast cancer sequences are 
used in drug screening assays or by evaluating the effect of drug candidates on a "gene 
expression profile" or expression profile of polypeptides. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes after treatment with a candidate 
25 agent (e.g., Zlokarnik, et al. y Science 279:84-8 (1998); Heid, Genome Res 6:986-94, 1996). 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing the native or modified breast cancer proteins are 
used in screening assays. That is, the present invention provides novel methods for screening 
for compositions which modulate the breast cancer phenotype or an identified physiological 
30 function of a breast cancer protein. As above, this can be done on an individual gene level or 
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by evaluating the effect of drug candidates on a "gene expression profile" In a preferred 
embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra. 
5 Having identified the differentially expressed genes herein, a variety of assays 

may be executed. In a preferred embodiment, assays may be run on an individual gene or 
protein level That is, having identified a particular gene as up regulated in breast cancer, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
breast cancer protein. "Modulation" thus includes both an increase and a decrease in gene 

10 expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing breast cancer, with changes of at least 
10%, preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4-fold increase in breast cancer tissue compared to normal 
tissue, a decrease of about four-fold is often desired; similarly, a 10-fold decrease in breast 

1 5 cancer tissue compared to normal tissue often provides a target value of a 1 0-fold increase in 
expression to be induced by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes 
and the quantification of gene expression levels, or, alternatively, the gene product itself can 
be monitored, e.g., through the use of antibodies to the breast cancer protein and standard 

20 immunoassays. Proteomics and separation techniques may also allow quantification of 
expression. 

In a preferred embodiment, gene expression or protein monitoring of a number 
of entities, i.e., an expression profile, is monitored simultaneously. Such profiles will 
typically involve a plurality of those entities described herein.. 
25 In this embodiment, the breast cancer nucleic acid probes are attached to 

biochips as outlined herein for the detection and quantification of breast cancer sequences in 
a particular cell. Alternatively, PCR may be used. Thus, a series, e.g., of microtiter plate, 
may be used with dispensed primers in desired wells. A PCR reaction can then be performed 
and analyzed for each well. 
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Expression monitoring can be performed to identify compounds that modify 
the expression of one or more breast cancer-associated sequences, e.g., a polynucleotide 
sequence set out inTable 17. Generally, in a preferred embodiment, a test modulator is added 
to the cells prior to analysis. Moreover, screens are also provided to identify agents that 
5 modulate breast cancer, modulate breast cancer proteins, bind to a breast cancer protein, or 
interfere with the binding of a breast cancer protein and an antibody or other binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 

1 0 indirectly alter the breast cancer phenotype or the expression of a breast cancer sequence, 
e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 
expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses a breast cancer phenotype, e.g. to a normal tissue 
fingerprint. In another embodiment, a modulator induced a breast cancer phenotype. 

1 5 Generally, a plurality of assay mixtures are run in parallel with different agent concentrations 
to obtain a differential response to the various concentrations. Typically, one of these 
concentrations serves as a negative control, i.e., at zero concentration or below the level of 
detection. 

Drug candidates encompass numerous chemical classes, though typically they 
20 are organic molecules, preferably small organic compounds having a molecular weight of 
more than 100 and less than about 2,500 daltons. Preferred small molecules are less than 
2000, or less than 1500 or less than 1000 or less than 500 D. Candidate agents comprise 
functional groups necessary for structural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, 
25 preferably at least two of the functional chemical groups. The candidate agents often 
comprise cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic 
structures substituted with one or more of the above functional groups. Candidate agents are 
also found among biomolecules including peptides, saccharides, fatty acids, steroids, purines, 
pyrimidines, derivatives, structural analogs or combinations thereof. Particularly preferred 
30 are peptides. 
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In one aspect, a modulator will neutralize the effect of a breast cancer protein. 
By "neutralize" is meant that activity of a protein is inhibited or blocked and the consequent 
effect on the cell 

In certain embodiments, combinatorial libraries of potential modulators will be 
5 screened for an ability to bind to a breast cancer polypeptide or to modulate activity. 

Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating- the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 

10 are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve 
providing a library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 

15 display a desired characteristic activity. The compounds thus identified can serve as 

conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical 
compounds generated by either chemical synthesis or biological synthesis by combining a 
number of chemical "building blocks" such as reagents. For example, a linear combinatorial 

20 chemical library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of 
chemical building blocks called amino acids in every possible way for a given compound 
length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical 
compounds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop etal, J. Med. Chem. 37(9):1233-1251 (1994)). 

25 Preparation and screening of combinatorial chemical libraries is well known to 

those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka, Pept ProL Res. 37:487-493 
(1991), Houghton et aL 9 Nature, 354:84-88 (1991)), peptoids (PCT Publication No WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio-oligomers (PCT 

30 Publication WO 92/00091), benzodiazepines (U.S. Pat. No. 5,288,514), diversomers such as 
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hydantoins, benzodiazepines and dipeptides (Hobbs et aL, Proc. Nat. Acad. Sci. USA 
90:6909-6913 (1993)), vinylogous polypeptides (Hagihara et aL, J. Amer. Chem. Soc. 
114:6568 (1992)), nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding 
(Hirschmann etal.,J. Amer. Chem. Soc. 114:9217-9218 (1992)), analogous organic syntheses 
5 of small compound libraries (Chen et aL, J. Amer. Chem. Soc. 1 16:2661 (1994)), 

oligocarbamates (Cho, et al 9 Science 261 : 1303 (1993)), and/or peptidyl phosphonates 
(Campbell et aL, J. Org. Chem: 59:658 (1994)). See, generally, Gordon et aL, J. Med. Chem. 
37:1385 (1994), nucleic acid libraries {see, e.g., Strategene, Corp.)* peptide nucleic acid 
libraries (see, e.g., U.S. Patent 5,539,083), antibody libraries (see, e.g„ Vaughn et aL, Nature 

10 Biotechnology 14(3):309-3 14 (1996), and PCT/US96/10287), carbohydrate libraries {see, 
e.g., Liang et aL, Science 274:1520-1522 (1996), and U.S. Patent No. 5,593,853), and small 
organic molecule libraries (see, e.g., benzodiazepines, Baum, C&EN, Jan 18, page 33 (1993); 
isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent No. , 
5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 5,519,134; morpholino compounds, 

15 U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent No. 5,288,5 14; and the like). 

Devices for the preparation of combinatorial libraries are commercially 
available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 

20 . A number of well known robotic systems have also been developed for 

solution phase chemistries. These systems include automated workstations like the 
automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, 
Japan) and many robotic systems utilizing robotic arms (Zymate II, Zymark Corporation, 
Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif.), which mimic the manual 

25 synthetic operations performed by a chemist. Any of the above devices are suitable for use 
with the present invention. The nature and implementation of modifications to these devices 
(if any) so that they can operate as discussed herein will be apparent to persons skilled in the 
relevant art. In addition, numerous combinatorial libraries are themselves commercially 
available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, Inc., St. Louis, 
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MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, 
Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of breast cancer gene transcription, 
inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 
polypeptide activity. 

High throughput assays for the presence, absence, quantification, or other 
properties of particular nucleic acids or protein products are well known to those of skill in 
the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 
e.g., U.S. Patent No. 5,559,410 discloses high throughput screening methods for proteins, 
U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic acid 
binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available 
{see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 
typically automate entire procedures, including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 
as a high degree of flexibility and customization. The manufacturers of such systems provide 
detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring 
proteins or fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing 
proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In 
this way libraries of proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodiment are libraries of bacterial, fungal, viral, and 
mammalian proteins, with the latter being preferred, and human proteins being especially 
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preferred. Particularly useful test compound will be directed to the class of proteins to which 
the target belongs, e.g., substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 
30 amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 
5 to about 15 being particularly preferred. The peptides may be digests of naturally occurring 
proteins as is outlined above, random peptides, or "biased" random peptides. By 
"randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 

10 may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all or 
most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence 

1 5 preferences or constants at any position. In a preferred embodiment, the library is biased. 
That is, some positions within the sequence are either held constant, or are selected from a 
limited number of possibilities. For example, in a preferred embodiment, the nucleotides or 
amino acid residues are randomized within a defined class, e.g., of hydrophobic amino acids, 
hydrophilic residues, sterically biased (either small or large) residues, towards the creation of 

20 nucleic acid binding domains, the creation of cysteines, for cross-linking, prolines for SH-3 
domains, serines, threonines, tyrosines or histidines for phosphorylation sites, etc., or to 
purines, etc. 

Modulators of breast cancer can also be nucleic acids, as defined above. 

As described above generally for proteins, nucleic acid modulating agents may 
25 be naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
For example, digests of procaryotic or eucaryotic genomes may be used as is outlined above 
for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical 
moieties, a wide variety of which are available in the literature. 
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After the candidate agent has been added and the cells allowed to incubate for 
some period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 
the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
5 with purification and/or amplification such as PCR performed as appropriate. For example, 
an in vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a 
fluorescent, a chemilumine§cent, a chemical, or a radioactive signal, to provide a means of 
10 detecting the target sequence's specific binding to a probe. The label also can be an enzyme, 
such as, alkaline phosphatase or horseradish peroxidase, which when provided with an 
appropriate substrate produces a product that can be detected. Alternatively, the label can be 
a labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also can be a moiety or compound, such as, an 
1 5 epitope tag or biotin which specifically binds to streptavidin. For the example of biotin, the 
streptavidin is labeled as described above, thereby, providing a detectable signal for the 
bound target sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct 
hybridization assays or can comprise "sandwich assays", which include the use of multiple 
20 probes, as is generally outlined in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 

5.594.117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 

5.594.118, 5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by 
reference. In this embodiment, in general, the target nucleic acid is prepared as outlined 
above, and then added to the biochip comprising a plurality of nucleic acid probes, voider 

25 conditions that allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. The assays are 
generally run under stringency conditions which allows formation of the label probe 
hybridization complex only in the presence of target Stringency can be controlled by 
30 altering a step parameter that is a thermodynamic variable, including, but not limited to, 
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temperature, formamide concentration, salt concentration, chaotropic salt concentration pH, 
organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is 
generally outlined in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain 
5 steps at higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in different orders, 
with preferred embodiments outlined below. In addition, the reaction may include a variety 
of other reagents. These include salts, buffers, neutral proteins, e.g. albumin, detergents, etc. 
10 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 

The assay data are analyzed to determine the expression levels, and changes in 
15 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the breast cancer phenotype. 
In one embodiment, screening is performed to identify modulators that can induce or 
suppress a particular expression profile, thus preferably generating the associated phenotype. 
In another embodiment, e.g., for diagnostic applications, having identified differentially 
20 expressed genes important in a particular state, screens can be performed to identify 

modulators that alter expression of individual genes. In an another embodiment, screening is 
performed to identify modulators that alter a biological function of the expression product of 
a differentially expressed gene. Again, having identified the importance of a gene in a 
particular state, screens are performed to identify agents that bind and/or modulate the 
25 biological activity of the gene product. 

In addition screens can be done for genes that are induced in response to a 
candidate agent. After identifying a modulator based upon its ability to suppress a breast 
cancer expression pattern leading to a normal expression pattern, or to modulate a single 
breast cancer gene expression profile so as to mimic the expression of the gene from normal 
30 tissue, a screen as described above can be performed to identify genes that are specifically 

70 



WO 02/059377 PCT/US02/02242 



modulated in response to the agent. Comparing expression profiles between normal tissue 
and agent treated breast cancer tissue reveals genes that are not expressed in normal tissue or 
breast cancer tissue, but are expressed in agent treated tissue. These agent-specific sequences 
can be identified and used by methods described herein for breast cancer genes or proteins. 
5 In particular these sequences and the proteins they encode find use in marking or identifying 
agent treated cells. In addition, antibodies can be raised against the agent induced proteins 
and used to target novel therapeutics to the treated breast cancer tissue sample. 

Thus, in one embodiment, a test compound is administered to a population of 
breast cancer cells, that have an associated breast cancer expression profile. By 

10 "administration" or "contacting" herein is meant that the candidate agent is added to the cells 
in such a manner as to allow the agent to act upon the cell, whether by uptake and 
intracellular action, or by action at the cell surface. In some embodiments, nucleic acid 
encoding a proteinaceous candidate agent (i.e., a peptide) may be put into a viral construct 
such as an adenoviral or retroviral construct, and added to the cell, such that expression of 

15 the peptide agent is accomplished, e.g., PCT US97/01019. Regulatable gene therapy systems 
can also be used. 

Once the test compound has been administered to the cells, the cells can be 
washed if desired and are allowed to incubate under preferably physiological conditions for 
some period of time. The cells are then harvested and a new gene expression profile is 
20 generated, as outlined herein. 

Thus, e.g., breast cancer tissue may be screened for agents that modulate, e.g., 
induce or suppress the breast cancer phenotype. A change in at least one gene, preferably 
many, of the expression profile indicates that the agent has an effect on breast cancer activity. 
By defining such a signature for the breast cancer phenotype, screens for new drugs that alter 
25 the phenotype can be devised. With this approach, the drug target need not be known and 
need not be represented in the original expression screening platform, nor does the level of 
transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 
30 differentially expressed gene as important in a particular state, screening of modulators of 
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either the expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "breast cancer proteins" or 
a "breast cancer modulatory protein". The breast cancer modulatory protein may be a 
fragment, or alternatively, be the full length protein to the fragment encoded by the nucleic 
5 acids of the Tables. Preferably, the breast cancer modulatory protein is a fragment. In a 
preferred embodiment, the breast cancer amino acid sequence which is used to determine 
sequence identity or similarity is encoded by a nucleic acid of Table 25. In another 
embodiment, the sequences are naturally occurring allelic variants of a protein encoded by a 
nucleic acid of Table 25. In another embodiment, the sequences are sequence variants as 

1 0 further described herein. 

Preferably, the breast cancer modulatory protein is a fragment of , 
approximately 14 to 24 amino acids long. More preferably the fragment is a soluble 
fragment. Preferably, the fragment includes a non-transmembrane region. In a preferred 
embodiment, the fragment has an N-terminal Cys to aid in solubility. In one embodiment, the 

15 C-terminus of the fragment is kept as a free acid and the N-terminus is a free amine to aid in 
coupling, i.e., to cysteine. 

In one embodiment the breast cancer proteins are conjugated to an 
immunogenic agent as discussed herein. In one embodiment the breast cancer protein is 
conjugated to BSA. 

20 Measurements of breast cancer polypeptide activity, or of breast cancer or the 

breast cancer phenotype can be performed using a variety of assays. For example, the effects 
of the test compounds upon the function of the breast cancer polypeptides can be measured 
by examining parameters described above. A suitable physiological change that affects 
activity can be used to assess the influence of a test compound on the polypeptides of this 

25 invention. When the functional consequences are determined using intact cells or animals, 
one can also measure a variety of effects such as, in the case of breast cancer associated with 
tumors, tumor growth, tumor metastasis, neovascularization, hormone release, transcriptional 
changes to both known and uncharacterized genetic markers (e.g., northern blots), changes in 
cell metabolism such as cell growth or pH changes, and changes in intracellular second 
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messengers such as cGMP. In the assays of the invention, mammalian breast cancer 
polypeptide is typically used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in 
vitro. For example, a breast cancer polypeptide is first contacted with a potential modulator 
5 and incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, 
the breast cancer polypeptide levels are determined in vitro by measuring the level of protein 
or mRNA. The level of protein is measured using immunoassays such as western blotting, 
ELIS A and the like with an antibody that selectively binds to the breast cancer polypeptide or 
a fragment thereof. For measurement of mRNA, amplification, e.g., using PCR, LCR, or 

10 hybridization assays, e.g., northern hybridization, RNAse protection, dot blotting, are 
preferred. The level of protein or mRNA is detected using directly or indirectly labeled 
detection agents, e.g., fluorescently or radioactively labeled nucleic acids, radioactively or 
enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the breast cancer 

1 5 protein promoter operably linked to a reporter gene such as luciferase, green fluorescent 
protein, CAT, or p-gal. The reporter construct is typically transfected into a cell. After 
treatment with a potential modulator, the amount of reporter gene transcription, translation, or 
activity is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on 

20 individual genes and gene products (proteins). That is, having identified a particular 

differentially expressed gene as important in a particular state, screening of modulators of the 
expression of the gene or the gene product itself can be done. The gene products of 
differentially' expressed genes are sometimes referred to herein as "breast cancer proteins." 
The breast cancer protein may be a fragment, or alternatively, be the full length protein to a 

25 fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes 
is performed. Typically, the expression of only one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
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differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or 
isolated gene product is used; that is, the gene products of one or more differentially 
5 expressed nucleic acids are made. For example, antibodies are generated to the protein gene 
products, and standard immunoassays are run to determine the amount of protein present. 
Alternatively, cells comprising the breast cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a breast 
cancer protein and a candidate compound, and determining the binding of the compound to 

10 the breast cancer protein. Preferred embodiments utilize the human breast cancer protein, 
although other mammalian proteins may also be used, e.g. for the development of animal 
models of human disease. In some embodiments, as outlined herein, variant or derivative 
breast cancer proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the breast cancer 

15 protein or the candidate agent is non-diffusably bound to an insoluble support having isolated 
sample receiving areas (e.g. a microtiter plate, an array, etc.). The insoluble supports may be 
made of any composition to which the compositions can be bound, is readily separated from 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of any convenient shape. Examples of 

20 suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 
teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 

25 with the reagents and overall methods of the invention, maintains the activity of the 

composition and is nondiffiisable. Preferred methods of binding include the use of antibodies 
(which do not sterically block either the ligand binding site or activation sequence when the 
protein is bound to the support), direct binding to "sticky" or ionic supports, chemical 
crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 

30 the protein or agent, excess unbound material is removed by washing. The sample receiving 
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areas may then be blocked through incubation with bovine serum albumin (BSA), casein or 
other innocuous protein or other moiety. 

In a preferred embodiment, the breast cancer protein is bound to the support, 
and a test compound is added to the assay. Alternatively, the candidate agent is bound to the 
5 support and the breast cancer protein is added. Novel binding agents include specific 
antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 
analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 

10 protein binding, functional assays (phosphorylation assays, etc.) and the like. 

The determination of the binding of the test modulating compound to the 
breast cancer protein may be done in a number of ways. In a preferred embodiment, the 
compound is labeled, and binding determined directly, e.g., by attaching all or a portion of 
the breast cancer protein to a solid support, adding a labeled candidate agent (e.g., a 

1 5 fluorescent label), washing off excess reagent, and determining whether the label is present 
on the solid support. Various blocking and washing steps may be utilized as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the 
proteins (or proteinaceous candidate compounds) can be labeled. Alternatively, more than 
one component can be labeled with different labels, e.g., 125 I for the proteins and a fluorophor 

20 for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

In one embodiment, the binding of the test compound is determined by 
competitive binding assay. The competitor is a binding moiety known to bind to the target 
molecule (i.e., a breast cancer protein), such as an antibody, peptide, binding partner, ligand, 

25 etc. Under certain circumstances, there may be competitive binding between the compound 
and the binding moiety, with the binding moiety displacing the compound. In one 
embodiment, the test compound is labeled. Either the compound, or the competitor, or both, 
is added first to the protein for a time sufficient to allow binding, if present. Incubations may 
be performed at a temperature which facilitates optimal activity, typically between 4 and 

30 40°C. Incubation periods are typically optimized, e.g., to facilitate rapid high throughput 
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screening. Typically between 0. 1 and 1 hour will be sufficient. Excess reagent is generally 
removed or washed away. The second component is then added, and the presence or absence 
of the labeled component is followed, to indicate binding. 



5 compound. Displacement of the competitor is an indication that the test compound is binding 
to the breast cancer protein and thus is capable of binding to, and potentially modulating, the 
activity of the breast cancer protein. In this embodiment, either component can be labeled. 
Thus, e.g., if the competitor is labeled, the presence of label in the wash solution indicates 
displacement by the agent. Alternatively, if the test compound is labeled, the presence of the 
1 0 label on the support indicates displacement. 



incubation and washing, followed by the competitor. The absence of binding by the 
competitor may indicate that the test compound is bound to the breast cancer protein with a 
higher affinity. Thus, if the test compound is labeled, the presence of the label on the 
1 5 support, coupled with a lack of competitor binding, may indicate that the test compound is 
capable of binding to the breast cancer protein. 



identity agents that are capable of modulating the activity of the breast cancer proteins. In 
this embodiment, the methods comprise combining a breast cancer protein and a competitor 

20 in a first sample. A second sample comprises a test compound, a breast cancer protein, and a 
competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the breast cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 

25 agent is capable of binding to the breast cancer protein. 



bind to the native breast cancer protein, but cannot bind to modified breast cancer proteins. 
The structure of the breast cancer protein may be modeled, and used in rational drug design 
to synthesize agents that interact with that site. Drug candidates that affect the activity of a 



In a preferred embodiment, the competitor is added first, followed by the test 



In an alternative embodiment, the test compound is added first, with 



In a preferred embodiment, the methods comprise differential screening to 



Alternatively, differential screening is used to identify drug candidates that 
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breast cancer protein are also identified by screening drugs for the ability to either enhance or 
reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably 
control and test samples are performed in at least triplicate to obtain statistically significant 
results. Incubation of all samples is for a time sufficient for the binding of the agent to the 
protein. Following incubation, samples are washed free of non-specifically bound material 
and the amount of bound, generally labeled agent determined. For example, where a 
radiolabel is employed, the samples may be counted in a scintillation counter to determine the 
amount of bound compound. 

A variety of other reagents may be included in the screening assays. These 
include reagents like salts, neutral proteins, e.g. albumin, detergents, etc. which may be used 
to facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a breast cancer protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising breast cancer 
proteins. Preferred cell types include almost any cell. The cells contain a recombinant 
nucleic acid that encodes a breast cancer protein. In a preferred embodiment, a library of 
candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous 
or subsequent exposure of physiological signals, e.g. hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 
chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate breast cancer agents are identified. 
Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the breast cancer protein. Once identified, similar structures are evaluated to identify critical 
structural feature of the compound. 
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In one embodiment, a method of inhibiting breast cancer cell division is 
provided. The method comprises administration of a breast cancer inhibitor. In another 
embodiment, a method of inhibiting breast cancer is provided. The method comprises 
administration of a breast cancer inhibitor. In a further embodiment, methods of treating cells 
5 or individuals with breast cancer are provided. The method comprises administration of a 
breast cancer inhibitor. 

In one embodiment, a breast cancer inhibitor is an antibody as discussed 
above. In another embodiment, the breast cancer inhibitor is an antisense molecule. 

A variety of cell growth, proliferation, and metastasis assays are known to 
1 0 those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 
transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
1 5 such as semi-solid or soft agar. The transformed cells, when transfected with tumor 

suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
grow. Soft agar growth or colony formation in suspension assays can be used to identify 
modulators of breast cancer sequences, which when expressed in host cells, inhibit abnormal 
cellular proliferation and transformation. A therapeutic compound would reduce or eliminate 
20 the host cells' ability to grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are 
described in Freshney, Culture of Animal Cells a Manual of Basic Technique (3 rd ed., 1994), 
herein incorporated by reference. See also, the methods section of Garkavtsev et ai (1996), 
25 supra, herein incorporated by reference. 

Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until 
they touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
30 continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 

78 



WO 02/059377 PCT/US02/02242 



higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with ( 3 H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 
5 supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 
normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with ( 3 H)-thymidine at saturation density is a 
preferred method of measuring density limitation of growth. Transformed host cells are 
transfected with a breast cancer-associated sequence and are grown for 24 hours at saturation 
10 density in non-limiting medium conditions. The percentage of cells labeling with ( 3 H)- 
thymidine is determined autoradiographically. See, Freshney (1994), supra. 

Growth factor or serum dependence 

Transformed cells have a lower serum dependence than their normal 
15 counterparts {see, e.g., Temin, J. Natl Cancer fasti. 37:167-175 (1966); Eagle et al y J. Exp. 
Med. 131:836-879 (1970)); Freshney, supra. This is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transformed host 
cells can be compared with that of control. 

20 Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells (see, e.g., 
Gullino, Angiogenesis, tumor vascularization, and potential interference with tumor growth. 
25 in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985)). Similarly, Tumor 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, e.g., Folkman, Angiogenesis and Cancer, Sem Cancer Biol. (1992)). 

Various techniques which measure the release of these factors are described in 
Freshney (1994), supra. Also, see, Unkless et al. , J. Biol. Chem. 249:4295-4305 (1974); 
30 Strickland & Beers, J. Biol Chem. 251:5694-5702 (1976); V/hw et al, Br. J. Cancer 42:305- 
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312 (1980); Gullino, Angiogenesis, tumor vascularization, and potential interference with 
tumor growth, in Biological Responses in Cancer ; pp. 1 78- 1 84 (Mihich (ed.) 1 985); 
Freshney Anticancer Res. 5:111-130 (1985). 

5 Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate breast cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
invasiveness of cells into Matrigel or some other extracellular matrix constituent. In this 
1 0 assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these host cells would decrease invasiveness of the host cells. 

Techniques described in Freshney (1994), supra, can be used. Briefly, the 
level of invasion of host cells can be measured by using filters coated with Matrigel or some 
other extracellular matrix constituent Penetration into the gel, or through to the distal side of 
1 5 the filter, is rated as invasiveness, and rated histologically by number of cells and distance 

moved, or by prelabeling the cells with 125 I and counting the radioactivity on the distal side of 
the filter or bottom of the dish. See, e.g., Freshney (1984), supra, 

Tumor growth in vivo 

20 Effects of breast cancer-associated sequences on cell growth can be tested in 

transgenic or immune-suppressed mice. Knock-out transgenic mice can be made, in which 
the breast cancer gene is disrupted or in which a breast cancer gene is inserted. Knock-out 
transgenic mice can be made by insertion of a marker gene or other heterologous gene into 
the endogenous breast cancer gene site in the mouse genome via homologous recombination. 

25 Such mice can also be made by substituting the endogenous breast cancer gene with a 

mutated version of the breast cancer gene, or by mutating the endogenous breast cancer gene, 
e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 

30 is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
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that possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 
lesion {see, e.g., Capecchi et al, Science 244:1288 (1989)), Chimeric targeted mice can be 
derived according to Hogan et al , Manipulating the Mouse Embryo: A Laboratory Manual, 
5 Cold Spring Harbor Laboratory (1988) and Teratocarcinomas and Embryonic Stem Cells: A . 
Practical Approach, Robertson, ed., IRL Press, Washington, D.C., (1987). 

Alternatively, various immune-suppressed or immune-deficient host animals 
can be use<L For example, genetically athymic "nude" mouse {see, e.g., Giovanella et al, J. 
Natl. Cancer Inst. 52:921 (1974)), a SCDD mouse, a thymectomized mouse, or an irradiated 

10 mouse (see, e.g. t Bradley et al, Br. J. Cancer 38:263 (1978); Selby et al, Br, J. Cancer 
41 :52 (1980)) can be used as a host. Transplantable tumor cells (typically about 10 6 cells) 
injected into isogenic hosts will produce invasive tumors in a high proportions of cases, while 
normal cells of similar origin will not. In hosts which developed invasive tumors, cells 
expressing a breast cancer-associated sequences are injected subcutaneously. After a suitable 

1 5 length of time, preferably 4-8 weeks, tumor growth is measured (e.g., by volume or by its two 
largest dimensions) and compared to the control. Tumors that have statistically significant 
reduction (using, e.g., Student's T test) are said to have inhibited growth. 

Polynucleotide modulators of breast cancer 

20 Antisense Polynucleotides 

In certain embodiments, the activity of a breast cancer-associated protein is 
down-regulated, or entirely inhibited, by the use of antisense polynucleotide, i.e., a nucleic 
acid complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., a breast cancer protein mRNA, or a subsequence thereof. 

25 Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from naturally-occuning 
subunits or their close homologs. Antisense polynucleotides may also have altered sugar 
30 moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
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sulfur containing species which are known for use in the art. Analogs are comprehended by 
this invention so long as they function effectively to hybridize with the breast cancer protein 
mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant 
5 means, or can be synthesized in vitro. Equipment for such synthesis is sold by several 
vendors, including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 

Antisense molecules as used herein include antisense or sense 
oligonucleotides. Sense oligonucleotides can, e.g., be employed to block transcription by 

10 binding to the anti-sense strand. The antisense and sense oligonucleotide comprise a single- 
stranded nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA 
(sense) or DNA (antisense) sequences for breast cancer molecules. A preferred antisense 
molecule is for a breast cancer sequences in Tables 1-25, or for a ligand or activator thereof. 
Antisense or sense oligonucleotides, according to the present invention, comprise a fragment 

15 generally at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The 
ability to derive an antisense or a sense oligonucleotide, based upon a cDNA sequence 
encoding a given protein is described in, e.g., Stein & Cohen (Cancer Res. 48:2659 (1988 and 
van der Krol et al (BioTechniques 6:958 (1988)). 

20 Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and 
inhibit transcription of breast cancer-associated nucleotide sequences. A ribozyme is an 
RNA molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes 
have been described, including group I ribozymes, hammerhead ribozymes, hairpin 

25 ribozymes, RNase P, and axhead ribozymes (see, e.g., Castanotto et al y Adv. in 

Pharmacology 25: 289-317 (1994) for a general review of the properties of different 
ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel et al, 
Nucl Acids Res. 18:299-304 (1990); European Patent Publication No. 0 360 257; U.S. Patent 
30 No. 5,254,678. Methods of preparing are well known to those of skill in the art (see, e.g., 
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WO 94/26877; Ojwang et al., Proc. Natl. Acad. Sci. USA 90:6340-6344 (1993); Yamada et 
al., Human Gene TJierapy 1:39-45 (1994); Leavitt et al.,Proc. Natl. Acad. Sci. USA 92:699- 
703 (1995); Leavitt etal, Human Gene Therapy 5:1151-120 (1994); and Yamada etal. 
Virology 205: 121-126(1994)). 

Polynucleotide modulators of breast cancer may be introduced into a cell 
containing the target nucleotide sequence by formation of a conjugate with a ligand binding 
molecule, as described in WO 91/04753. Suitable ligand binding molecules include, but are 
not limited to, cell surface receptors, growth factors, other cytokines, or other ligands that 
bind to cell surface receptors. Preferably, conjugation of the Ugand binding molecule does 
not substantially interfere with the ability of the ligand binding molecule to bind to its 
corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of breast 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 
understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating breast cancer in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 
anti-breast cancer antibody that reduces or eliminates the biological activity of an endogenous 
breast cancer protein. Alternatively, the methods comprise administering to a cell or 
organism a recombinant nucleic acid encoding a breast cancer protein. This may be 
accomplished in any number of ways. In a preferred embodiment, e.g. when the breast 
cancer sequence is down-regulated in breast cancer, such state may be reversed by increasing 
the amount of breast cancer gene product in the cell. This can be accomplished, e.g., by 
overexpressing the endogenous breast cancer gene or administering a gene encoding the 
breast cancer sequence, using known gene-therapy techniques, e.g.. In a preferred 
embodiment, the gene therapy techniques include the incorporation of the exogenous gene 
using enhanced homologous recombination (EHR), e.g. as described in PCT/US93/03868, 
hereby incorporated by reference in its entirety. Alternatively, e.g. when the breast cancer 
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sequence is up-regulated in breast cancer, the activity of the endogenous breast cancer gene is 
decreased, e.g. by the administration of a breast cancer antisense nucleic acid. 

In one embodiment, the breast cancer proteins of the present invention may be 
used to generate polyclonal and monoclonal antibodies to breast cancer proteins. Similarly, 
5 the breast cancer proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify breast cancer 
antibodies useful for production, diagnostic, or therapeutic purposes. In a preferred 
embodiment, the antibodies are generated to epitopes unique to a breast cancer protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The breast cancer 
10 antibodies may be coupled to standard affinity chromatography columns and used to purify 
breast cancer proteins. The antibodies may also be used as blocking polypeptides, as outlined 
above, since they will specifically bind to the breast cancer protein, 

Methods of identifying variant breast cancer-associated sequences 

1 5 Without being bound by theory, expression of various breast cancer sequences 

is correlated with breast cancer. Accordingly, disorders based on mutant or variant breast 
cancer genes may be determined. In one embodiment, the invention provides methods for 
identifying cells containing variant breast cancer genes, e.g., determining all or part of the 
sequence of at least one endogenous breast cancer genes in a cell. This may be accomplished 

20 using any number of sequencing techniques. In a preferred embodiment, the invention 

provides methods of identifying the breast cancer genotype of an individual, e.g., determining 
all or part of the sequence of at least one breast cancer gene of the individual. This is 
generally done in at least one tissue of the individual, and may include the evaluation of a 
number of tissues or different samples of the same tissue. The method may include 

25 comparing the sequence of the sequenced breast cancer gene to a known breast cancer gene, 
i.e., a wild-type gene. 

The sequence of all or part of the breast cancer gene can then be compared to 
the sequence of a known breast cancer gene to determine if any differences exist. This can be 
done using any number of known homology programs, such as Bestfit, etc. In a preferred 

30 embodiment, the presence of a difference in the sequence between the breast cancer gene of 
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the patient and the known breast cancer gene correlates with a disease state or a propensity 
for a disease state, as outlined herein. 

In a preferred embodiment, the breast cancer genes are used as probes to 
determine the number of copies of the breast cancer gene in the genome. 

In another preferred embodiment, the breast cancer genes are used as probes to 
determine the chromosomal localization of the breast cancer genes. Information such as 
chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the breast 
cancer gene locus. 

Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a breast cancer protein 
or modulator thereof, is administered to a patient. By "therapeutically effective dose" herein 
is meant a dose that produces effects for which it is administered. The exact dose will depend 
on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques (e.g., Ansel et al., Pharmaceutical Dosage Forms and Drug Delivery; 
Liebennan, Pharmaceutical Dosage Forms (vols. 1-3, 1992), Dekker, ISBN 0824770846, 
082476918X, 0824712692, 0824716981; Lloyd, The Art, Science and Technology of 
Pharmaceutical Compounding (1999); and Pickar, Dosage Calculations (1999)). As is 
known in the art, adjustments for breast cancer degradation, systemic versus localized 
delivery, and rate of new protease synthesis, as well as the age, body weight, general health, 
sex, diet, time of administration, drug interaction and the severity of the condition may be 
necessary, and will be ascertainable with routine experimentation by those skilled in the art. 
U.S. Patent Application N. 09/687,576, further discloses the use of compositions and 
methods of diagnosis and treatment in breast cancer is hereby expressly incorporated by 
reference. 

A "patient" for the purposes of the present invention includes both humans 
and other animals, particularly mammals. Thus the methods are applicable to both human 
therapy and veterinary applications. In the preferred embodiment the patient is a mammal, 
preferably a primate, and in the most preferred embodiment the patient is human. 
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The administration of the breast cancer proteins and modulators thereof of the 
present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneally, intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In 
5 some instances, e.g., in the treatment of wounds and inflammation, the breast cancer proteins 
and modulators may be directly applied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise a breast 
cancer protein in a form suitable for administration to a patient. In the preferred embodiment, 
the pharmaceutical compositions are in a water soluble form, such as being present as 

10 pharmaceutically acceptable salts, which is meant to include both acid and base addition 
salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain the 
biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 

15 propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 
methanesulfonic acid, ethanesulfonic acid, pi-toluenesulfonic acid, salicylic acid and the like. 
"Pharmaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 

20 manganese, aluminum salts and the like. Particularly preferred are the ammonium, 

potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 

25 tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the 
following: carrier proteins such as serum albumin; buffers; fillers such as microcrystalline 
cellulose, lactose, corn and other starches; binding agents; sweeteners and other flavoring 
agents; coloring agents; and polyethylene glycol. 
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The pharmaceutical compositions can be administered in a variety of unit 
dosage forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that breast cancer protein modulators (e.g., antibodies, 
antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing the 
molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 
packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a. 
protection barrier. Means of protecting agents from digestion are well known in the art. 

The compositions for administration will commonly comprise a breast cancer 
protein modulator dissolved in a pharmaceutical^ acceptable carrier, preferably an aqueous 
carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 
solutions are sterile and generally free of undesirable matter. These compositions may be 
sterilized by conventional, well known sterilization techniques. The compositions may 
contain pharmaceutically acceptable auxiliary substances as required to approximate 
physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
and the like, e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, 
sodium lactate and the like. The concentration of active agent in these formulations can vary 
widely, and will be selected primarily based on fluid volumes, viscosities, body weight and 
the like in accordance with the particular mode of administration selected and the patient's 
needs (e.g., Remington 's Pharmaceutical Science (15th ed., 1980) and Goodman & Gillman, 
The Pharmacologial Basis of Therapeutics (Hardman et a/.,eds., 1996)). 

Thus, a typical pharmaceutical composition for intravenous administration 
would be about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per 
patient per day may be used, particularly when the drug is administered to a secluded site and 
not into the blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially higher dosages are possible in topical administration. Actual methods for 
preparing parenterally administrable compositions will be known or apparent to those skilled 
in the art, e.g., Remington 's Pharmaceutical Science and Goodman and Gillman, The 
Pharmacologial Basis of Therapeutics, supra. 
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The compositions containing modulators of breast cancer proteins can be 
administered for therapeutic or prophylactic treatments. In therapeutic applications, 
compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 
amount sufficient to cure or at least partially arrest the disease and its complications. An 
5 amount adequate to accomplish this is defined as a "therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be administered 
depending on the dosage and frequency as required and tolerated by the patient. In any event, 
the composition should provide a sufficient quantity of the agents of this invention to 

10 effectively treat the patient. An amount of modulator that is capable of preventing or slowing 
the development of cancer in a mammal is referred to as a "prophylactically effective dose." 
The particular dose required for a prophylactic treatment will depend upon the medical 
condition and history of the mammal, the particular cancer being prevented, as well as other 
factors such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 

15 treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 

recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer. 

It will be appreciated that the present breast cancer protein-modulating 
compounds can be administered alone or in combination with additional breast cancer 

20 modulating compounds or with other therapeutic agent, e.g., other anti-cancer agents or 
treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in Tables 1-25, such as antisense polynucleotides 
or ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
25 methods, reagents, vectors, and cells useful for expression of breast cancer-associated 
polypeptides and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 
30 introducing foreign nucleotide sequences into host cells may be used. These include the use 
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of calcium phosphate transfection, spheroplasts, eiectroporation, liposomes, microinjection, 
plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, 
e.g., Berger & Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 
5 volume 152 (Berger), Ausubel et al , eds., Current Protocols (supplemented through 1999), 
and Sambrook et al, Molecular Cloning - A Laboratory Manual (2nd ed., Vol. 1-3, 1989. 

In a preferred embodiment, breast cancer proteins and modulators are 
administered as therapeutic agents, and can be formulated as outlined above. Similarly, 
breast cancer genes (including both the full-length sequence, partial sequences, or regulatory 

1 0 sequences of the breast cancer coding regions) can be administered in a gene therapy 

application. These breast cancer genes can include antisense applications, either as gene 
therapy (i.e. for incorporation into the genome) or as antisense compositions, as will be 
appreciated by those in the art. 

Breast cancer polypeptides and polynucleotides can also be administered as 

15 vaccine compositions to stimulate HTL, CTL and antibody responses.. Such vaccine 

compositions can include, e.g., lipidated peptides {see, e.g.,Vitiello, A. et al, 1 Clin. Invest. 
95:341 (1995)), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") 
microspheres (see, e.g., Eldridge, et al, Molec. Immunol 28:287-294, (1991); Alonso et al, 
Vaccine 12:299-306 (1994); Jones et al, Vaccine 13:675-681 (1995)), peptide compositions 

20 contained in immune stimulating complexes (ISCOMS) (see, e.g., Takahashi et al, Nature 
344:873-875 (1990); Hu et al, Clin Exp Immunol 1 13:235-243 (1998)), multiple antigen 
peptide systems (MAPs) (see, e.g., Tarn, Proc. Natl Acad. ScL U.S.A. 85:5409-5413 (1988); 
Tam, J. Immunol Methods 196:17-32 (1996)), peptides formulated as multivalent peptides; 
peptides for use in ballistic delivery systems, typically crystallized peptides, viral delivery 

25 vectors (Perkus, et al, In: Concepts in vaccine development (Kaufinann, ed., p. 379, 1996); 
Chakrabarti, et al, Nature 320:535 (1986); Hu et al, Nature 320:537 (1986); Kieny, et al, 
AIDS Bio/Technology 4:790 (1986); Top et al, J. Infect. Dis. 124:148 (1971); Chanda et al, 
Virology 175:535 (1990)), particles of viral or synthetic origin (see, e.g., Kofler et al, J. 
Immunol Methods. 192:25 (1996); Eldridge etal, Sem. Hematol. 30:16 (1993); Falo et al, 

30 Nature Med. 7:649 (1995)), adjuvants (Warren et al, Annu. Rev. Immunol 4:369 (1986); 
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Gupta et aL % Vaccine 11:293 (1993)), liposomes (Reddy et ai, J. Immunol 148:1585 (1992); 
Rock, Immunol. Today 17:131 (1996)), or, naked or particle absorbed cDNA (Ulmer, et ai, 
Science 259:1745 (1993); Robinson et al, Vaccine 11:957 (1993); Shiver et al, In: Concepts 
in vaccine development (Kaufmann, ed., p. 423, 1996); Cease & Berzofsky, Annu. Rev. 
5 Immunol. 12:923 (1994) and Eldridge et al y Sem. Hematol. 30:16 (1993)). Toxin-targeted 
delivery technologies, also known as receptor mediated targeting, such as those of Avant 
Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 



substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide 
10 or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 
Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 
Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 
15 aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 
tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 

20 Vaccines can be administered as nucleic acid compositions wherein DNA or 

RNA encoding one or more of the polypeptides, or a fragment thereof, is administered to a 
patient. This approach is described, for instance, in Wolff et. ai, Science 247:1465 (1990) as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; 
WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 

25 include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 

cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery 
{see, e.g., U.S. Patent No. 5,922,687). 



invention can be expressed by viral or bacterial vectors. Examples of expression vectors 
30 include attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 



Vaccine compositions often include adjuvants. Many adjuvants contain a 



For therapeutic or prophylactic immunization purposes, the peptides of the 
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vaccinia virus, e.g., as a vector to express nucleotide sequences that encode breast cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 
Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover et aL, Nature 351:456-460 (1991). A wide variety of other vectors useful 
for therapeutic administration or immunization e.g. adeno and adeno-associated virus vectors, 
retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the like, 
will be apparent to those skilled in the art from the description herein {see, e.g., Shata et aL, 
Mol Med Today 6:66-71 (2000); Shedlock et aL, JLeukoc Biol 68:793-806 (2000); Hipp et 
aL, In Vivo 14:571-85 (2000)). 

Methods for the use of genes as DNA vaccines are well known, and include 
placing a breast cancer gene or portion of a breast cancer gene under the control of a 
regulatable promoter or a tissue-specific promoter for expression in a breast cancer patient. 
The breast cancer gene used for DNA vaccines can encode full-length breast cancer proteins, 
but more preferably encodes portions of the breast cancer proteins including peptides derived 
from the breast cancer protein. In one embodiment, a patient is immunized with a DNA 
vaccine comprising a plurality of nucleotide sequences derived from a breast cancer gene. For 
example, breast cancer-associated genes or sequence encoding subfragments of a breast 
cancer protein are introduced into expression vectors and tested for their immunogenicity in 
the context of Class I MHC and an ability to generate cytotoxic T cell responses. This 
procedure provides for production of cytotoxic T cell responses against cells which present 
antigen, including intracellular epitopes. 

In a preferred embodiment, the DNA vaccines include a gene encoding an 
adjuvant molecule with the DNA vaccine. Such adjuvant molecules include cytokines that 
increase the immunogenic response to the breast cancer polypeptide encoded by the DNA 
vaccine. Additional or alternative adjuvants are available. 

In another preferred embodiment breast cancer genes find use in generating 
animal models of breast cancer. When the breast cancer gene identified is repressed or 
diminished in cancer tissue, gene therapy technology, e.g., wherein antisense RNA directed 
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to the breast cancer gene will also diminish or repress expression of the gene. Animal models 
of breast cancer find use in screening for modulators of abreast cancer-associated sequence 
or modulators of breast cancer. Similarly, transgenic animal technology including gene 
knockout technology, e.g. as a result of homologous recombination with an appropriate gene 
5 targeting vector, will result in the absence or increased expression of the breast cancer 
protein. When desired, tissue-specific expression or knockout of the breast cancer protein 
may be necessary. 

It is also possible that the breast cancer protein is overexpressed in breast 
cancer. As such, transgenic animals can be generated that overexpress the breast cancer 
10 protein. Depending on the desired expression level, promoters of various strengths can be 
employed to express the transgene. Also, the number of copies of the integrated transgene 
can be determined and compared for a determination of the expression level of the transgene. 
Animals generated by such methods find use as animal models of breast cancer and are 
additionally useful in screening for modulators to treat breast cancer. 

Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic applications suggested above, 
kits are also provided by the invention. In the diagnostic and research applications such kits 
may include any or all of the following: assay reagents, buffers, breast cancer-specific nucleic 

20 acids or antibodies, hybridization probes and/or primers, antisense polynucleotides, 

ribozymes, dominant negative breast cancer polypeptides or polynucleotides, small molecules 
inhibitors of breast cancer-associated sequences etc. A therapeutic product may include 
sterile saline or another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions 

25. (i.e., protocols) for the practice of the methods of this invention. While the instructional 
materials typically comprise written or printed materials they are not limited to such. Any 
medium capable of storing such instructions and communicating them to an end user is 
contemplated by this invention. Such media include, but are not limited to electronic storage 
media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the 
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like. Such media may include addresses to internet sites that provide such instructional 
materials. 

The present invention also provides for kits for screening for modulators of 
breast cancer-associated sequences. Such kits can be prepared from readily available 
materials and reagents. For example, such kits can comprise one or more of the following 
materials: a breast cancer-associated polypeptide or polynucleotide, reaction tubes, and 
instructions for testing breast cancer-associated activity. Optionally, the kit contains 
biologically active breast cancer protein. A wide variety of kits and components can be 
prepared according to the present invention, depending upon the intended user of the kit and 
the particular needs of the user. Diagnosis would typically involve evaluation of a plurality 
of genes or products. The genes will be selected based on correlations with important 
parameters in disease which may be identified in historical or outcome data. 



EXAMPLES 

Example 1: Tissue Preparation, Labeling Chips, and Fingerprints 

Purifying t otal RNA from tissue sample using TRJzol Reagent 

The sample weight is first estimated. The tissue samples are homogenized in 
1 ml of TRIzol per 50 mg of tissue using a homogenizer (e.g., Polytron 3100). The size of 
the generator/probe used depends upon the sample amount. A generator that is too large for 
the amount of tissue to be homogenized will cause a loss of sample and lower RNA yield. A 
larger generator (e.g., 20 mm) is suitable for tissue samples weighing more than 0.6 g. Fill 
tubes should not be overfilled. If the working volume is greater than 2 ml and no greater than 
10 ml, a 15 ml polypropylene tube (Falcon 2059) is suitable for homogenization. 

Tissues should be kept frozen until homogenized. The TRIzol is added 
directly to the frozen tissue before homogenization. Following homogenization, the insoluble 
material is removed from the homogenate by centrifiigation at 7500 x g for 15 min. in a 

Sorvall superspeed or 12,000 x g for 10 min. in an Eppendorf centrifuge at 4°C. The cleared 
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homogenate is then transferred to a new tube(s). Samples may be frozen and stored at -60 to 

-70°C for at least one month or else continue with the purification. 

The next process is phase separation. The homogenized samples are incubated 
for 5 minutes at room temperature. Then, 0.2 ml of chloroform per 1ml of TRIzol reagent is 
5 added to the homogenization mixture. The tubes are securely capped and shaken vigorously 
by hand (do not vortex) for 15 seconds. The samples are then incubated at room temp, for 
2-3 minutes and next centrifiiged at 6500 rpm in a Sorvall superspeed for 30 min. at 4oC. 

The next process is RNA Precipitation. The aqueous phase is transferred to a 
fresh tube. The organic phase can be saved if isolation of DNA or protein is desired. Then 
10 0.5 ml of isopropyl alcohol is added per 1ml of TRIzol reagent used in the original 

homogenization. Then, the tubes are securely capped and inverted to mix. The samples are 
then incubated at room temp, for 10 minutes an centrifiiged at 6500 rpm in Sorvall for 20 

min. at4°C. 

The RNA is then washed. The supernatant is poured off and the pellet washed 
1 5 with cold 75% ethanol. 1 ml of 75% ethanol is used per 1 ml of the TRIzol reagent used in 
the initial homogenization. The tubes are capped securely and inverted several times to 
loosen pellet without vortexing . They are next centrifiiged at <8000 rpm (<7500 x g) for 5 
minutes at 4°C. 

The RNA wash is decanted. The pellet is carefully transferred to an 
20 Eppendorf tube (sliding down the tube into the new tube by use of a pipet tip to help guide it 
in if necessary). Tube(s) sizes for precipitating the RNA depending on the working volumes. 
Larger tubes may take too long to dry. Dry pellet. The RNA is then resuspended in an 
appropriate volume (e.g., 2 -5 ug/ul) of DEPC H 2 0. The absorbance is then measured. 

The poly A+ mRNA may next be purified from total RNA by other methods 
25 such as Qiagen' s RNeasy kit. The poly A + mRNA is purified from total RNA by adding the 
oligotex suspension which has been heated to 37°C and mixing prior to adding to RNA. 
The Elution Buffer is incubated at 70°C. If there is precipitate in the buffer, warm up the 2 x 
Binding Buffer at 65°C. The the total RNA is mixed with DEPC-treated water, 2 x Binding 
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Buffer, and Oligotex according to Table 2 on page 16 of the Oligotex Handbook and next 
incubated for 3 minutes at 65°C and 10 minutes at room temperature. 

The preparation is centrifuged for 2 minutes at 14,000 to 18,000 g, preferably, 
at a "soft setting," The supernatant is removed without disturbing Oligotex pellet. A little bit 
of solution can be left behind to reduce the loss of Oligotex. The supernatant is saved until 
satisfactory binding and elution of poly A + mRNA has been found. 

Then, the preparation is gently resuspended in Wash Buffer OW2 and pipetted 
onto the spin column and centrifuged at full speed (soft setting if possible) for 1 minute. 

Next, the spin column is transferred to a new collection tube and gently 
resuspended in Wash Buffer O W2 and centrifuged as described herein. 

Then, the spin column is transferred to a new tube and eluted with 20 to 100 ul 
of preheated (70°C) Elution Buffer. The Oligotex resin is gently resuspended by pipetting up 
and down. The centrifugation is repeated as above and the elution repeated with fresh elution 
buffer or first eluate to keep the elution volume low. 

The absorbance is next read to determine the yield, using diluted Elution • 
Buffer as the blank. 

Before proceeding with cDNA synthesis, the mRNA is precipitated before 
proceeding with cDNA synthesis, as components leftover or in the Elution Buffer from the 
Oligotex purification procedure will inhibit downstream enzymatic reactions of the mRNA. 
0.4 vol. of 7.5 M NH40Ac + 2.5 vol. of cold 100% ethanol is added and the preparation 
precipitated at -20°C 1 hour to overnight (or 20-30 min. at -70°C), and centrifuged at 
14,000-16,000 x g for 30 minutes at 4°C. Next, the pellet is washed with 0.5 ml of 80% 
ethanol (-20°C) and then centrifuged at 14,000-16,000 x g for 5 minutes at room temperature. 
The80% ethanol wash is then repeated The last bit of ethanol from the pellet is then dried 
without use of a speed vacuum and the pellet is then resuspended in DEPC H 2 0 at lug/ul 
concentration. 

Alternatively the RNA may be purified using other methods (e.g.. Oiagen's RNeasv kit) 
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No more than 100 ug is added to the KNeasy column. The sample volume is 
adjusted to 100 ul with RNase-free water. 350 ul Buffer RLT and then 250 ul ethanol 
(100%) are added to the sample. The preparation is then mixed by pipetting and applied to an 
RNeasy mini spin column for centrifiigation (15 sec at >1 0,000 rpm). If yield is low, reapply 
5 the flowthrough to the column and centrifuge again. 

Then, transfer column to a new 2 ml collection tube and add 500 ul Buffer 
RPE and centrifuge for 15 sec at >10,000 ipm. The flowthrough is discarded. 500 ul Buffer 
RPE and is then added and the preparation is centriuged for 15 sec at >10,000rpm. The 
flowthrough is discarded, and the column membrane dried by centrifuging for 2 min at 
10 maximum speed. The column is transferred to a new 1.5-ml collection tube. 30-50 ul of 

RNase-free water is applied directly onto column membrane. The column is then centrifliged 
for 1 min at > 10,000 rpm and the elution step repeated. 

The absorbance is then read to determine yield. If necessary, the material may 
be ethanol precipitated with ammonium acetate and 2.5X volume 100% ethanol. 

15 

First Strand cDNA Synthesis 

The first strand can be make using using Gibco's "Superscript Choice System 
for cDNA Synthesis" kit. The starting material is 5 ug of total RNA or 1 ug of polyA+ 
mRNAL For total RNA, 2 ul of Superscript RT is used; for polyA+ mRNA, 1 ul of 

20 Superscript RT is used. The final volume of first strand synthesis mix is 20 ul. The RNA 
should be in a volume no greater than 10 ul. The RNA is incubated with 1 ul of 100 pmol 
T7-T24 oligo for 10 min at 70°C followed by addition on ice of 7 ul of: 4ul 5X 1 st Strand 
Buffer, 2 ul of 0. 1M DTT, and 1 ul of lOmM dNTP mix. The preparation is then incubated at 
37°C for 2 min before addition of the Superscript RT followed by incubation at 37°C for 1 

25 hour. 

Second Strand Synthesis 

For the second strand synthesis, place 1st strand reactions on ice and add: 91 

ul DEPC H 2 0; 30 ul 5X 2nd Strand Buffer; 3 ul lOmM dNTP mix; 1 ul 10 U/ul E.coli DNA 

30 Ligase; 4 ul 10 U/ul E.coli DNA Polymerase; and 1 ul 2 U/ul RNase H. Mix and incubate 2 

96 



• 



WO 02/059377 PCT/US02/02242 



hours at 16°C. Add 2 ul T4 DNA Polymerase. Incubate 5 min at 16°C. Add 10ulof0.5M 
EDTA. 



Cleaning up cDNA 

5 The cDNA is purified using Phenol:Chloroform:Isoamyl Alcohol (25:24: 1) 

and Phase-Lock gel tubes. The PLG tubes are centrifuged for 30 sec at maximum speed. 
The cDNA mix is then transferred to PLG tube. An equal volume of 
phenol:chloroform:isamyl alcohol is then added, the preparation shaken vigorously (no 
vortexing), and centrifuged for 5 minutes at maximum speed. The top aqueous solution is 
10 transferred to a new tube and ethanol precipitated by adding 7.5X 5M NH40Ac and 2.5X 
volume of 100% ethanol. Next, it is centrifuged immediately at room temperature for 20 
min, maximum speed. The supernatant is removed, and the pellet washed with 2X with cold 
80% ethanol. As much ethanol wash as possible should be removed before air drying the 
pellet; and resuspending it in 3 ul RNase-free water. 

15 

In vitro Transcription (TVTD and labeling with biotin 

In vitro Transcription (IVT) and labeling with biotin is performed as follows: 
Pipet 1 .5 ul of cDNA into a thin-wall PCR tube. Make NTP labeling mix by combining 2 ul 
T7 lOxATP (75 mM) (Ambion); 2 ul T7 lOxGTP (75 mM) (Ambion); 1.5 ul T7 lOxCTP (75 

20 mM) (Ambion); 1.5 ul T7 lOxUTP (75 mM) (Ambion); 3.75 ul 10 mM Bio-ll-UTP 

(Boehringer-Mannheim/Roche or Enzo); 3.75 ul 10 mM Bio-16-CTP (Enzo); 2 ul lOx T7 
transcription buffer (Ambion); and 2 ul lOx T7 enzyme mix (Ambion). The final volume is 
20 ul. Incubate 6 hours at 37°C in a PCR machine. The RNA can be furthered cleaned. 
Clean-up follows the previous instructions for RNeasy columns or Qiagen's RNeasy protocol 

25 handbook. The cRNA often needs to be ethanol precipitated by resuspension in a volume 
compatible with the fragmentation step. 

Fragmentation is performed as follows. 15 ug of labeled RNA is usually 
fragmented. Try to minimize the fragmentation reaction volume; a 10 ul volume is 
recommended but 20 ul is all right. Do not go higher than 20 ul because the magnesium in 

30 the fragmentation buffer contributes to precipitation in the hybridization buffer. Fragment 
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RNA by incubation at 94 C for 35 minutes in 1 x Fragmentation buffer (5 x Fragmentation 
buffer is 200 niM Tris-acetate, pH 8.1; 500 mM KOAc; 150 mM MgOAc). The labeled 
RNA transcript can be analyzed before and after fragmentation. Samples can be heated to 
65°C for 15 minutes and electrophoresed on 1% agarose/TBE gels to get an approximate idea 
5 of the transcript size range 

For hybridization, 200 ul (10 ug cRNA) of a hybridization mix is put on the 
chip. If multiple hybridizations are to be done (such as cycling through a 5 chip set), then it 
is recommended that an initial hybridization mix of 300 ul or more be made. The 
hybridization mix is: fragment labeled RNA (50 ng/ul final cone); 50 pM 948-b control 
10 oligo; 1.5 pM BioB; 5 pM BioC; 25 pM BioD; 100 pM CRE; 0.1 mg/ml herring sperm DNA; 
0.5 mg/ml acetylated BSA; and 300 ul with lxMES hyb buffer. 

The hybridization reaction is conducted with non-biotinylated IVT (purified 
by RNeasy columns) (see example 1 for steps from tissue to IVT): The following mixture is 
prepared: 

1 5 IVT antisense RNA; 4 jig: (il 

Random Hexamers (1 |ig/^il): 4 \xl 
H 2 0: 

14 |al 

Incubate the above 14 ^1 mixture at 70°C for 10 min.; then put on ice. 
20 The Reverse transcription procedure uses the following mixture: 

0.1MDTT: 3^1 

50XdNTPmix: 0.6^1 

H 2 0: 2.4 Ktl 

Cy3orCy5dUTP(lmM): 3 fil 
25 SSRTII(BRL): 1 jal 



16 ul 

The above solution is added to the hybridization reaction and incubated for 30 min., 42°C. 

Then, 1 jil SSII is added and incubated for another hour before being placed on ice. 
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The SOX dNTP mix contains 25mM of cold dATP, dCTP, and dGTP, lOmM 
of dTTP and is made by adding 25 pi each of lOOmM dATP, dCTP, and dGTP; 10 pi of 
lOOmMdTTP to 15plH 2 0.] 

RNA degradation is performed as follows. Add 86 pi H 2 0, 1.5 pl 1M NaOH/ 
2 raM EDTA and incubate at 65°C, 10 min.. For U-Con 30, 500 pl TE/sample spin at 7000 g 
for 10 min, save flow through for purification. For Qiagen purification, suspend u-con 
recovered material in 500 pl buffer PB and proceed using Qiagen protocol. For DNAse 
digestion, add 1 ul of 1/100 dilution of DNAse/30 ul Rx and incubate at 37°C for 15 min. 
Incubate at 5 min 95°C to denature the DNAse. 

Sample preparation 

For sample preparation, add Cot-1 DNA, 10 ul; 50X dNTPs, 1 ul; 20X SSC, 2.3 pi; Na pyro 
phosphate, 7.5 |xl; 10 mg/ml Herring sperm DNA; 1 ul of 1/10 dilution to 21 .8 final vol. Dry 
in speed vac. Resuspend in 15 pi H 2 0. Add 0.38 pi 10% SDS. Heat 95°C, 2 min and slow 
cool at room temp, for 20 min. Put on slide and hybridize overnight at 64°C. Washing after 
the hybridization: 3X SSC/0.03% SDS: 2 mia, 37.5 ml 20X SSC+0.75ml 10% SDS in 
250ml H 2 0; IX SSC: 5 min., 12.5 ml 20X SSC in 250ml H 2 0; 0.2X SSC: 5 min., 2.5 ml 20X 
SSC in 250ml H 2 0. Dry slides and scan at appropiate PMT's and channels. 
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TABLE 1: Figure 1 from BRCA 001 US 

5 Table 1 shows genes, (incorporated in their entirety here and throughout the application 
where primekeys are provided), downregulated in tumor tissue compared to normal breast 
tissue. 



10 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnkjenelD: Unigene number 



15 R1: 



Unigene Title: Unigene gene title 



Ratio of normal breast tissue to tumor 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey 


ExAccn 


100472 


090084 


100499 


T51986 


100545 


M55405 


100549 


BE142019 


100613 


X52078 


100635 


BE259039 


100645 


X16841 


100654 


A03758 


100702 


L27065 


100915 


M60832 


100971 


BE379727 


101125 


AJ250562 


101166 


M90424 


101164 


NM 001674 


101336 


NM 006732 


101367 


X03350 


101447 


M21305 


101461 


N98569 


101511 


M27826 


101634 


AV650262 


101736 


M74447 


102208 


U22961 


102297 


NIVL0015O4 


102450 


U48251 


102515 


U89337 


102571 


U60115 


102800 


AA313538 


102857 


NM 006744 


102990 


AA829286 


103434 


X98085 


103747 


AA081995 


103750 


AA126129 


103812 


AA137107 


103851 


AA326216 


104080 


AB041036 


104093 


R50727 


104106 


AA422123 


104109 


AL353957 


104250 


F06638 


104340 


AA426189 


104492 


N73185 


104506 


N91071 


104511 


N99542 


104532 


AI498763 



UnigeneiD UnigeneTHIe R1 

Hs.1023 pyruvate dehydrogenase (lipoamide) alpha 5 

Hs.283108 hemoglobin, gamma G 10 

gb:Homo sapiens mucin (MUC-3) mRNA, part 5 

Hs.222056 Homo sapiens cDNA FU1 1572 fis, done HE 10 

Hs.101047 transcription factor 3 (E2A immunoglobul 5 

Hs.129953 Ewing sarcoma breakpoint region 1 5 

Hs.167988 neural ceil adhesion molecule 1 5 

NM_000477*:Homo sapiens albumin (ALB), m 1 0 

gb:Human neurofibromatosis 2 (NF2) mRNA, 5 

Hs.249239 collagen, type VIII, alpha 2 5 

Hs.83213 fatty acid binding protein 4, adipocyte 10 

Hs.82749 transmembrane 4 superfamay member 2 5 

Hs.2099 tipocalin 1 (protein migrating faster th 5 

Hs.460 activating transcription factor 3 10 

Hs.75678 FBJ murine osteosarcoma viral oncogene h 1 0 

Hs.4 alcohol dehydrogenase 18 (class I), beta 10 

gbrHuman alpha satellite and satellite 3 1 0 

Hs.76422 phosphofipase A2, group IIA (platelets, 1 0 

Hs.267319 endogenous retroviral protease 10 

Hs.75765 GR02 oncogene 5 

Hs.502 transporter 2, ATP-binding cassette, sub 1 0 

gb:Human mRNA clone with similarity to L 10 

Hs.1 98252 G protein-coupled receptor 9 5 

Hs.75871 protein kinase C binding protein 1 1 0 

Hs.169886 tenascinXB 10 

Hs.239069 four and a half UM domains 1 5 

gb:EST185419 Colon carcinoma (HCC) cell 10 

Hs.76461 retinol-binding protein 4. interstitial 10 

Hs.332053 serum amyloid A1 10 

Hs.54433 tenasdn R (restrictin, janusin) 5 

gb:zn26d06.ii Stratagene neuroepithelium 1 0 

gb:zm78c07.rl Stratagene neuroepithelium 5 

Hs.326391 Homo sapiens, clone MGC:16638, mRNA, com 10 

Hs.8719 hypothetical protein MGC1136 5 

Hs.57771 kalfikrein 11 (KLK11; TLSP; PRSS20; hipp 5 

Hs.336970 ESTs 10 

gb:zv26h12.r1 SoaresJJhHMPu S1 Homosapi 5 

Hs.284181 hypothetical protein DKFZp434P0531 10 

Hs.1 2440 Homo sapiens done 24734 mRNA sequence 10 

gb:zw1 1 e09.r1 Soares JJhHMPu S1 Homo sapl 5 

Hs.94285 EST 10 

Hs.109650 ESTs 10 

Hs.572 orosomucold 1 5 

Hs.203013 hypothetical protein FU12748 10 
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104536 R24024 Hs.158101 Homo sapiens cDNA FU14673 fts, done NT 5 

104572 Y11312 Hs.132463 phosphoinosilide^-kinase. class 2, beta 5 

104659 AW969769 Hs.105201 ESTs 5 

104677 AA009764 Hs.190380 ESTs 10 

5 104711 AA017245 Hs.32794 ESTs 10 

104731 AA019300 Hs. 125070 ESTs, Moderately similar to I54374 gene 10 

104764 AI039243 Hs.278585 ESTs 5 

105005 A129B208 Hs.28805 ESTs 10 

105036 AA130390 Hs.25549 hypofoetlcal protein FU20898 10 

10 105105 R61532 Hs.87016 hypothetical protein RJ2293B 5 

105231 AW970043 Hs.238039 hypothetical protein RJ 11 090 5 

105239 AA221036 gb:zri)3f12ji Stratagene NT2 neuronal pr 10 

105921 AA421973 Hs.1691 19 ESTs, Weakly similar to T25731 hypotheti 5 

105957 BE242857 Hs.27021 hypothetical protein FU11159 5 

15 106052 N79885 Hs.6382 ESTs, Highly similar to T00391 hypotheti 10 

106119 AL359624 Hs.11387 KIAA1 453 protein 5 

106181 A1803651 Hs.191608 ESTs 10 

106194 AW976171 Hs.286194 hypothetical protein RJ22233 5 

106283 A1085846 Hs.25522 KIAA1 808 protein 10 

20 106379 AL042069 Hs.1 19021 DKFZP434N061 protein 10 

106451 AW235928 Hs.313182 ESTs 10 

106491 AA135688 Hs.10083 Homo sapiens, clone IMAGE :4 139786, mRNA, 10 

106700 AA906434 Hs.3776 zinc finger protein 216 5 

106782 AW054886 Hs.25682 Homo sapiens mRNA tor KIAA1 863 protein, 10 

25 106851 AI458623 gb:tk04g09jc1 NCI_CGAP_Lu24 Homo sapiens 5 

106870 AI983730 Hs.26530 serum deprivation response (phosphatidyl 5 

106892 AI347578 Hs.124015 hypothetical protein MGC2605 5 

106954 AF128847 Hs.204038 hdoletjiylamineN-meftyllransferase 5 

106991 AJ223811 Hs.30127 hypothetical protein 5 

30 107103 AI446183 Hs.9572 ESTs, Highly similar to CYA5 HUMAN ADENY 5 

107124 AB006532 Hs.31442 RecQ protein -like 4 10 

107148 AI005036 Hs.334305 GS1999fuU 10 

107214 AF127026 Hs.5394 myosin IA 10 

107242 AB020672 Hs.175411 K1AA0865 protein 10 

35 107331 AI905985 Hs.111805 ESTs 10 

107351 U51704 Hs.323428 ESTs, Moderately similar to ALU8 HUMAN A 5 

107423 W26652 Hs.6163 PTEN induced putative kinase 1 5 

107447 W28516 Hs.19210 hypothetical protein MGC1 1308 10 

107451 AL042425 Hs.283976 hypthetical protein PR02389 10 

40 107453 AI092790 Hs.334703 hypothetical protein FLJ14529 5 

107459 W38002 Empirically selected from AFFX single pr 10 

107683 N53167 Hs.47623 ESTs 10 

107711 W96141 Hs.220687 ESTs 10 

107754 AA017462 Hs.269244 ESTs 10 

45 107757 BE621721 Hs.280792 hypothetical protein FU 12387 similar to 10 

107864 AA025060 Hs.61246 ESTs 10 

107872 BE271708 Hs.95110 ESTs, Weakly similar to A55943 1-phospha 10 

107888 AA025836 Hs.191637 ESTs 5 

107997 AL049176 Hs.82223 chordin-fike 10 

50 108056 AA043675 Hs.62633 ESTs 10 

108081 AA093668 Hs.28578 muscleblind (Drosophila)-like 5 

108113 AA012881 Hs.72531 hypothetical protein FU 11838 10 

108238 AA059473 Hs.66783 EST 10 

108257 AA677927 Hs.144269 ESTs 5 

55 108335 AA070500 gbzm70h03.s1 Stratagene neuroepithelkim 5 

108351 AA071193 gb:zf79b12s1 Soares_pineaUland^N3HPG 10 

108382 NMJ06770 Hs.67726 macrophage receptor with collagenous str 5 

108392 AA075124 gb:zmB6a01.s1 Stratagene ovarian cancer 10 

108441 AA079079 gb2m97c09^1 Stratagene colon HT29 (937 10 

60 108446 AA085383 gbzn13g03.s1 Stratagene hNT neuron (937 10 

108497 AA074897 gb2m85a05.r1 Stratagene ovarian cancer 10 

108604 AA934589 Hs.49696 ESTs 5 

108662 AF1 17646 Hs.156637 Cas-Br-M (murine) ectropic retroviral tr 5 

108706 AA121820 Hs.74569 KIAA0842 protein 10 

65 108738 AA126583 Hs.158725 ESTs 10 

108827 AI273692 Hs.1 10470 ESTs 10 

109123 AI028376 Hs.73232 ESTs 10 
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109389 AA101325 Hs.86154 hypothetical protein FU 1 2457 10 

109546 F01449 Hs.26954 Homo sapiens mRNA; cDNA OKFZp762G123 (fr 5 

109919 R40604 Hs.129539 ESTs, Weakly similar to MCAT_HUMAN MITOC 10 

110006 AI094674 Hs.30524 ring finger protein 24 10 

5 110141 H46749 Hs.31540 ESTs 10 

110354 W22165 Hs.22586 ESTs 5 

110433 AW294162 Hs.301062 UDP-N-acetyJ-atpha-0-galactosamine:polyp 10 

110448 H51276 Hs.13526 hypothetical protein FU 12688 10 

110455 H52576 gb:yt85e08.r1 Soares_pineal_gtand N3HPG 5 

10 110540 H72639 Hs.167608 ESTs 5 

110553 H60593 Hs.124990 ESTs 10 

110976 AL044174 Hs.159526 patched (Drosophila) homolog 10 

110987 AI753316 Hs.26034 ESTs 5 

111158 N66616 Hs. 1 38629 Ksapiens mRNA for subtelomeric repeats 5 

15 111168 A1798376 gb:tr34b07jc1 NCLCGAPJ)v23 Homo sapiens 10 

111187 AJ224864 Hs.9688 leukocyte membrane antigen 5 

111307 AA641636 Hs.37477 ESTs, Weakly similar to T46903 hypothec" 5 

111400 R00144 Hs.189771 ESTs 10 

111498 AI168511 gb:ow90h09.s1 Soares_fetaLliver_spleen_ 10 

20 111651 R16733 Hs.20499 ESTs 10 

111738 R26065 gb:yh39d03.s1 Soares placenta Nb2HP Homo 5 

111803 AA593731 Hs.325823 ESTs, Moderately similar to ALUS HUMANA 10 

111995 R42333 Hs.302292 ESTs 10 

112071 AH 17490 Hs.47225 Ras-assoctated protein Rap 1 10 

25 112204 NMJ06668 Hs.25121 cytochrome P450, subfamily 46 (cholester 10 

112258 R51889 Hs.24990 ESTs 5 

112490 R31094 Hs.24378 ESTs 10 

1 12588 R77302 gb:yi75h08.s1 Soares placenta Nb2HP Homo 10 

112654 BE618629 Hs.268809 ESTs 5 

30 112784 T98628 Hs.191290 ESTs 5 

112817 AI057205 Hs.14584 ESTs 5 

112885 AA581428 Hs.5021 EST 10 

112913 T16837 Hs.4241 ESTs 5 

113149 T51588 gb^b27e06.s1 Stratagene fetal spleen (9 10 

35 113174 T54659 Hs.301755 Homo sapiens cDNA FU 11465 fis, done HE 5 

113203 AA743563 Hs.10305 ESTs 5 

113299 AW207424 Hs.332594 ESTs 10 

113367 N92359 Hs.14518 ESTs, Moderately similar to A48752 B-cel 10 

113457 R16763 Hs.268679 ESTs 5 

40 1 13563 AA913635 Hs.326413 Homo sapiens cONA FU20812 fis, clone AD 10 

113574 R06874 Hs.268628 ESTs, Moderately similar to ALU1_HUMAN A 5 

113776 AI791905 Hs.95549 hypothetical protein 10 

113790 AI244311 Hs.26912 ESTs 10 

113807 W07586 Hs.8045 ESTs 3 

45 113958 W86195 gb:zh54e05.s1 SoaresJeraUver_spleen_ • 10 

114211 Z39319 Hs.27347 EST 10 

114254 AB018263 Hs.180338 tumor necrosis factor receptor supermini 5 

114349 AA745978 Hs.28273 ESTs 5 

114449 AA020736 gb:ze63b11.s1 Soares retina N2MHR Homo 5 

50 114484 AA034378 Hs.267319 endogenous retroviral protease 5 

114576 AA065096 gb2m50a02.s1 Stratagene fibroblast (937 5 

1 14624 AA081507 gbznQ5b10.r1 Stratagene hNT neuron (937 5 

114844 AA234826 Hs.87386 EST 5 

114906 AA234462 Hs.87350 ESTs 5 

55 115624 AK000725 Hs.50579 hypothetical protein FU2071 8 3 

115666 AF173081 Hs.178215 Vertebrate LIN7 homolog 1, Tax interadi 5 

115712 AB020649 Hs.74569 KIAA0842 protein 5 

115889 AA398841 Hs.39850 hypothetical protein FU205 17 10 

115949 AI478427 Hs.43125 esophageal cancer related gene 4 protein 10 

60 116107 AL133916 Hs. 172572 hypothetical protein FU20093 10 

116180 AA463902 Hs.13522 ESTs, Weakly similar to 138022 hypotheti 5 

116267 AW968703 Hs.30085 hypothetical protein FU23186 5 

116291 AW4 10377 Hs.41502 hypothetical protein FU21 276 5 

116527 AW194253 Hs.68607 ESTs 10 

65 116659 BE314852 Hs. 168694 Homo sapiens clone 23763 unknown mRNA, p 5 

116708 F10528 Hs.70001 ESTs. Moderately similar to JC61 69 nude 5 

117058 AW801806 gb:IL5-UM0070-1 10400-062^07 UM0070 Homo 5 
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117151 AIB03656 
117226 N20468 
117323 AI472853 
117571 N34417 
117624 N26627 
117673 N40551 
117847 N49265 
117877 AW263476 
117919 BE222341 
118049 N53145 
118413 AW955696 
118613 AW78236 
118664 N70907 
118858 AL122040 
118902 AA993527 
119039 A1160570 
119159 AF142419 
119216 AA514422 
119317 AK002001 
119366 T77892 
119378 T81824 
119528 W38051 
119792 AL049798 
119800 AF086332 
119817 AF088061 
119835 AF086429 
119923 AW803308 
119961 U34249 
120379 AL042725 
120931 AW136934 
121037 AA907743 
121282 AA401695 
121382 AA405763 
121764 AA421452 
122034 AK000229 
122441 AA447555 
122756 AA458945 
122771 AW135093 
123601 AA609122 
123623 AI024595 
123941 AA621529 
124215 H62570 
124276 H83465 
124680 AK001527 
125099 NM.014312 
125121 T98199 
125188 BE299567 
125284 NM002666 
125906 BE256206 
128484 AA485421 
128511 NR.002250 
128538 R44214 
128606 C16161 
128850 AA193106 
128870 H39537 
128903 AW150717 
128931 N62889 
129001 AA443323 
129091 AA056483 
129101 NM_013403 
129146 AL117472 
129213 A1146494 
129228 U40714 
129265 AA530892 
129285 8E617015 
129346 AF110141 
129368 NM.003877 



Hs.42373 ESTs 



Hs.43387 

Hs.44584 

Hs^2364 

Hs.184043 

Hs.182391 

Hs.44268 

Hs.279472 

Hs.90960 

Hs.49688 

Hs.230619 

Hs.102981 

Hs.293907 

Hs.252097 

Hs.15020 

Hs.221849 

Hs.51305 

Hs.90949 

Hs.80552 

Hs:58314 

Hs.159690 

Hs.58429 

Hs.62954 

Hs.337461 

Hs.97162 

Hs.142373 

Hs.97334 

Hs.111939 

Hs.164851 

Hs.98017 

Hs.99116 

Hs.95898 

Hs.97282 

Hs.1 12645 

Hs.97508 



Hs.163953 

Hs.1 12377 

Hs.48403 

Hs.271749 

Hs.103253 

Hs.17775 

Hs.270503 

Hs.10082 

Hs.101189 

Hs.283040 

Hs.180817 

Hs.75309 

Hs.296176 

Hs.107242 

Hs.107812 

Hs.301463 

Hs.108665 

Hs.108924 

Hs.109525 

Hs.239307 

Hs.171695 

Hs.1 1006 

Hs.288908 

Hs.1 10776 



gb:yx39b10.s1 Scares melanocyte 2NbHM Ho 

ESTs 

ESTs 

ESTs, Weakly similar to JC4124 pregnancy 
Homo sapiens Ets-1 binding protein (E1B) 
ESTs 

myelin gene expression factor 2 
ESTs 

gb:yv55fl)9.s1 Soares fetal liver spleen 

ESTs 

ESTs 

EST 

Homo sapiens mRNA; cDNA DKFZp434G1972 (f 
hypothetical proteii FU23403 
pregnancy specific beta-1-glycoproteln 6 
homolog of mouse quaking QKI (KH domain 
ESTs 

v-maf muscubaponeurotic fibrosarcoma (a 
gbryd20ffl4^1 Soares fetal fiver spleen 
EST 

Empirically selected from AFFX single pr 

dermatopontin 

ESTs 

ESTs 

ESTs 

ferritin, heavy polypeptide 1 

Human putative zinc finger protein (ZNFB 

gb;DKFZp434B1822_r1 434 (synonym: htes3) 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA FLJ20470 fis, clone KA 
ESTs, WeaWy similar to K1AA0926 protein 
Homo sapiens cDNA FU20222 fis, clone CO 
EST 
ESTs 

ESTs, Highly similar to G100 HUMAN 1 10 K 
Homo sapiens mRNA; cDNA DKFZp434D2472 (I 
a disintegrin and metalloproteinase doma 
gb:af47a02.s1 Soares_totaLfetusJJb2HF8_ 
gb:yr44a01.r1 Soares fetal Ever spleen 
gb.7s91a1U1 Soares retina N2b5HR Homo 
hypothetical protein FU 10665 
cortic al thymocyte receptor (X. laevis 
hypothetical protein FU 10847 
ESTs. Moderately similar to ALU8.HUMAN A 



p75NTR-associated ceD death executor; o 
ESTs, Weakly similar to ALU7_HUMAN ALU S 
potassium intermediate/small conductance 
ESTs 

hypothetical protein PR02543 
chromosome 11 open reading frame 23 
eukaryotic translation elongation factor 
STAT induced STAT inhibitor 3 
Homo sapiens cDNA FU 12965 fis, clone NT 
BPOZ protein 

Human Chromosome 16 BAC done CIT987SK-A 
zinedin 

SH3-<iomain protein 5 (ponsln) 

ESTs. Weakly similar to IRX2_HUMAN IROQU 

tyrosyl-tRNA synthetase 

dual specificity phosphatase 1 

ESTs, Moderately similar to T17372 plasm 

WAS protein family, member 2 

STAT induced STAT inhlbitor-2 
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Table 1 A shows the accession numbers for those pkeys lacking unigenelD's for Table 1. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 
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TABLE 2: Figure 2 from BRCA 001 US 
5 Table 2 shows genes downregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

1 0 UnigenelD: Unigene number 

Unigene Title: Unigene gene title 
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hypothetical protein FU20898 


10 




gbzr03f12.r1 Stratagene NT2 neuronal pr 


10 


Hs.6382 


ESTs, Highly similar to T00391 hypometi 


10 


Hs.191608 


ESTs 


10 


Hs.25522 


KIAA1608 protein 


10 


Hs.119021 


DKFZP434N061 protein 


10 


Hs.313182 


ESTs 


10 


Hs.10083 


Homo sapiens, done IMAGE:4139786, mRNA, 


10 


Hs.25682 


Homo sapiens mRNA for KIAA1863 protein, 


10 


Hs.31442 


RecQ proteln-like4 


10 


Hs.334305 


GS1999fuIl 


10 


Hs.5394 


myosin IA 


10 


Hs.175411 


KIAA0865 protein 


10 


Hs.1 11805 


ESTs 


10 


Hs.19210 


hypothetical protein MGC1 1308 


10 


Hs.283976 


hypothetical protein PR02389 


10 


Hs.95110 


ESTs, Weakly similar to A55943 1-phospha 


10 




gbzf79b12.s1 Soaresj)lneaIjiandJJ3HPG 


10 


Hs.26954 


Homo sapiens mRNA; cDNA DKFZp762G123 (fr 


10 


Hs.301062 


UDP-N-acetyklpha-D-galactosamine:polyp 
patched (DrosophDa) homolog 

108 


10 


Hs.159526 
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111168 AJ793376 
111651 R16733 
111803 AA593731 
114484 AA034378 
125284 NMJJ02666 
128850 AA193106 
128903 AW150717 
129346 AF110141 
129381 AW245805 
129516 AFQ20038 
129554 BE222078 
130085 M62402 
130243 D88435 
130400 V00517 
130436 NM_001928 
130563 BE270472 
130589 AL1 10226 
130683 AA993269 
130689 NM_006691 
130689 AA046747 
130718 N70196 
130798 M81349 
130840 BE048821 
131184 AB040935 
131282 X03350 
131328 AW939251 
131543 AW966881 
131753 AA829286 
131785 H69342 
131828 AJ000263 
132426 AW118072 
132675 AI291496 
132898 W28548 
132905 NMJJ04235 
133120 NM_C03278 
133407 AF017987 
133719 H26904 
134007 AF072441 
134055 086062 
134111 AI372588 
134117 AA081846 
134177 BE243319 
134369 AF207664 
134496 M64936 
134510 NM.002757 
134550 M26315 
134758 NMJW0078 
134963 NM.003394 
135066 X04430 
408790 AW580227 
446674 AA563892 





gb:tr34b07j(1 NCI_CGAP_Ov23 Homo sapiens 


10 


Hs.20499 


ESTs 


10 


Hs.325823 


ESTs, Moderately similar to ALU5_HUMAN A 


10 


Hs.267319 


endogenous retroviral protease 


10 


Hs.103253 


perilipin 


10 


Hs.180817 


chromosome 1 1 open reading frame 23 


5 


Hs.296176 


STAT induced STAT inhfeitor 3 


10 


Hs.288908 


WAS protein tamiry. member 2 


10 


Hs.110903 


daudin 5 (transmembrane protein deleted 


10 


Hs.11223 


bocitrate dehydrogenase 1 (NADP^ solu 


10 


Hs.1 13069 


ESTs 


10 


Hs.274313 


insulin-like growth factor binding prote 


10 


Hs.153227 


cyciin G associated kinase 


10 


Hs.283108 


hemoglobin, gamma G 


10 


Hs.155597 


O component of complement (adipsin) 


10 


Hs.279900 


HSPC015 protein 


10 


Hs.16441 


OKFZP434H204 protein 


10 


Hs.17872 


Homo sapiens, done IMAGE3875012, mRNA 


10 


Hs.17917 


extracellular link domain-containing 1 


10 


Hs.17917 


extracellular link domain-containing 1 


10 


Hs.18376 


K1AA1319 protein 


10 


Hs.1955 


serum amyloid A4, constitutive 


10 


Hs.20144 


small inducible cytokine subfamily A (Cy 


10 


Hs.23954 


cerebral ceil adhesion molecule 


10 


Hs.4 


alcohol dehydrogenase 1B (class I), beta 


10 


Hs.25647 


v-fos FBJ murine osteosarcoma viral onco 


10 


Hs.41639 


programmed cefl death 2 


10 


Hs.332053 


serum amyloid A1 


10 


Hs.26320 


TRABID protein 


10 


Hs.278658 


keratin, hair, basic, 6 (monilethrix) 


10 


Hs.89981 


diacykjrycerol kinase, zeta (104kD) 


10 


Hs.5476 


Homo sapiens, done IMAG&3530123, mRNA, 


10 


Hs.224829 


ESTs 


10 


Hs.7934 


Kruppe Mike factor 4 (gut) 


10 


Hs.65424 


tetranectin (plasmkiogen-binding protein 


10 


Hs.7306 


secreted frizzled-related protein 1 


10 


Hs.75736 


apoiipoprotein O 


10 


Hs.7840 


caidneurin binding protein 1 


10 


Hs.182423 


ES1 (zebrafish) protein, human homolog o 


10 


Hs.8022 


TU3A protein 


5 


Hs7921 


Homo sapiens mRNA; cDNA OKFZp566E183 (fr 


5 


Hs.79672 


KIAA0552geneprodud 


10 


Hs.8230 


a dislntegrin-Qke and metalloprotease ( 


10 




gbflomo sapiens retinoic acid-inducible 


10 


Hs.250870 


rnftogen-activated protein kinase kinase 


10 


Hs.85258 


CD8 antigen, alpha polypeptide (p32) 


5 


Hs.89538 


cholesteryl ester transfer protefo, plas 


5 


Hs.91985 


wingless-type MMTV integration site fami 


10 


Hs.93913 


Interleukin 6 (interferon, beta 2) 


10 


Hs.47860 


neurotrophic tyrosine kinase, receptor, type 2 


10 


Hs.306000 


solute carrier fam2y 4 (anion exchanger), memb 


10 
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TABLE 2A 

Table 2A shows the accession numbers for those pkeys lacking unigenelD's for Table 2. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession 5 * column. 

Pkey: Unique Eos probeset Identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 



111168 38585.1 AJ798376 S4640Q AW811617 AW811616 W00557 BE142245 AW858232 AW861851 AW858362 AA232351 AA218567 
AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 AA199942 AA223384 
20 AA227092 AA227080 T12379 AA092174 T61 139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 AA157718 

AA157719 AA100472 AA100774 AA130756 AA157705 AA157730 AA157715 AA053524 AW849581 AW854566 C05254 
AW882836 T92637 AW812621 AA2065B3 AA209204 BE156909 AA226824 AI829309 AW991957 N66951 AA527374 H66215 
AA045564 AI694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 AW817659 
BE081531 H59570 
25 103747 117944.1 AA081995 AA101099 

134496 46501.1 M64936 AI025512 AI382987 BE061777 AA089966 BE169930 T41 176 AW594624 BE502415 AA121893 AI269283 T4031 1 

AI684569 AA25701 1 AI079277 A1241318 BE327710 AW975215 AW896268 M884990 BE327514 
105239 34624 J AA221036 R87170 BE537068 BE544757 C18935 AW812058 T92565 AA227415 AA233942 AA223237 M668403 AA601627 
AW869639 BE061833 BE000620 AW961 170 AW847519 AA308542 AW821833 AW945688 C04699 AA205504 AA377241 
3 0 AW821667 AA055720 AW817981 AW856468 AA155719 AA1 79928 T03007 AW754298 AA227407 AA113928 AA307904 

. C16859 

100654 tigr.HT2969 A03758 A06977 A1 5293 D17029 D17107 D17171 L00132 L00133 M12523 M13075 M13076 M92816 U22961 V00494V00495 
X51363 X51364 X51365 

102208 6735.9 U22961 AA203623 AA503337 A1174733 AI192802 C06092 AA035357 AJ190619 AI199244 AI828450 AA602296 A1378195 

3 5 AI20917Q AH86653 A1127795 AI183846 H77389 A1589465 AA629390 H94306 AI01 8388 R68584 AA027196 AJ745413 

AI685092 AKJ93426 AI623873 A1074570 N50096 AA047486 N25060 AA327614 AI042512 AI383957 AA156873 A1333101 
N70806 A1141254 AI383191 AW01237 A1080709 AI093400 W84549 T90806 R00012 W01413 AA630557 AI378348 AI559265 
AA877103 W84464 AA625146 R68379 AI133207 AM32980 AI133214 AI064826 AI061615 AI133473 AI174852 AI133404 
AI133272 V00494 M12523 M12523 AI207526 AI133120 AI064802 AI174993 A11 14729 AI061645 AI064716 AI064959 H77388 

40 T85706 AF075298 AI1 10799 017107 NM.000477 AF190168 R50724 AI248416 AI207432 AI133684 A1133345 AI174710 

AI133290 AI133304 AI174948 AI2074B4 AI110717 AF074624 AI114515 AF063516 A1110642 AI114559 A1114498 AI114759 
AI207568 A1084960 AJ174753 AI1 14666 R69184 R00011 AI064997 T60501 AI207701 T71735 AA385318 H73569T60496 
H94399 AJ133158 T74675 AA484750 T73413 T55909 R50261 772061 N80533 T51 189 T74936 AI207490 AM32925 AI064701 
AI174748 A1114663 A1133104 AI132999 AI133100 AI064925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 

45 A1444620 T52290 D16931 T40012 T48403 T58926 T69195 AI133061 T50850 AI400677 AI091 136 AA334608 T5741 1 Z20979 

N56507 T87485 AH33622 AA343370 T40075 T69671 T53849 T74820 AF075316 AM 10818 T40121 T57381 AI114468 
AA332728T51362AI1 14589 R06691 A1110629 AF063503 AI140543 AA334661 AA332720AA343262 T73513T86549 
AI114840 T57284 T39981 T61407 T72757 T51749 T56630 AA343125 T72126 R94135 T83028 T39972 T39896 AI174786 
AI132926 R09237 AI064838 AM 33660 T60398 T88753 T55930 T92126 A1444602 T60996 AH 14792 H9391 1 AI133106 

50 R10779 AI065020 T90925 T50889 D17029 AI133703 AA333805 AI133040 AI133017 AI064867 AI110730 AF074637 AI207567 

H71080 T73217 AA343950 AI174743 AA334224 AA334281 R06692 T64739 T40163 T60628 T81661 T73179 R01842 
AA501730 T39931 T39662 T40136 AA334904 T71425 H77784 R00874 A1065049 T84512 T55918 AI207595 T39951 
AA005016 T60361 T69176 T73356 T58795 T61233 T39955 T60612 AU14676 AK)e4778 AA035710 W52763 AI114788 
T83564 AA341859 T81684 T55769 AI1 14710 T51776 AA343213 AI1 14714 T58102 AI110809 R28984 AI174854 AA305675 

5 5 AA343592 T53836 T46869 T64721 T55508 W05241 T54019 T57945 T60513 T48364 AF075308 W86731 T82851 T48269 

H54053T73211 AI114590 T48317 T55965 T74857 R84226 T56552 T52231 T74946 T76976 R02576 T95666 AI203974 
AI189471 AA005147 AI4781 02 AI207662 Ai 192792 AI768421 AIQ64737 AW051713 AA936693 AI133117 A1766232 AI913646 
T83962 AI0651 12 AI207689 AI174684 AI207702 T81475 A] 133325 AI032512 AA701169 AI936354 All 14720 AI433289 
AAQ469&0 A1823482 AH14536 AA860651 AW242644 R07469 AW300438 AI133416 AW271670 AI991363 T78943 AI823481 

60 AA845518 AA719124 AA883454 T68850 T69115 AI935509 AI150977 T62890T71374 T68294 A1 1 74774 T674 11 T68318 

A1064689 T56624 T69010 T68982 T68302 AI332829 T72908 AI06481 9 AI205B80 T62895 T69430 T951 1 1 AA025050 T73330 
W52657 T71984 T691 18 W92684 AI1 14860 T62093 T61797 AI522333 T73322 H92981 T56018 T6181 1 T57232 AI336158 

110 
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T61821 T69457 T62900 T62912 T72917 T46885 AI702448 T57212 T57203 R94581 T71311 T61819 T89358 T67708 T70918 
T59166 A1187111 T64308 T62071 T69427 A1114750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 T69334 
AI207729 T55839 T90273 T73339 AW194909 T75486 T71850 T71305 T71287 T53877 T73452 T68852 N75290 A1312890 
T67751 AI174983 T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T68447 011809 D12412 T64300 
T28321 T55864 



102800 14782_20 AA313538 U88895 U88902 
108351 genbank_AA071193 AA071193 
101447 entr&LM21305 M21305 
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TABLE 3: Figure 3 from BRCA 001 US 
Table 3 shows genes downregulated in tumor tissue compared to normal breast tissue. 



10 



Pkey: 
ExAccn: 
UnigenelD: 
Unigene Title: 
R1: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 



Ratio of normal breast tissue to tumor 



15 



20 



25 



Pkey ExAccn UnigenelD UnigeneTitie 



101336 NM_0O6732 

102208 U22961 

102990 AA829286 

111168 AI798376 

111803 AA593731 

130085 M62402 

130840 BE048821 

131543 AW966881 

133120 NM 003278 

134758 NM.000078 



Hs.75678 

Hs.332053 

Hs.325823 

Hs.274313 

Hs.20144 

Hs.41639 

Hs.65424 

Hs.89538 



R1 

FBJ murine osteosarcoma viral oncogene h 1 0.0 

gb:Human mRNA clone with similarity to L 10.0 

serum amyloid A1 10.0 

gb:tr34b07 jc1 NCLCGAP_0v23 Homo sapiens 10.0 

ESTs, Moderately similar to ALU5.HUMAN A 10.0 

insulin-tike growth factor binding prote 1 0.0 

small inducible cytokine subfamily A (Cy 1 0.0 

programmed cell death 2 1 0.0 

tetranectin (plasminogen-binding protein 10.0 

cholesteryl ester transfer protein, plas 10.0 



112 





WO 02/U59377 



PCT/US02/02242 



TABLE 3A 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 



Table 3 A shows the accession numbers for those pkeys lacking unigenelD's for Table 3. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 

111168 38585J AI798376S46400AW811617AW811616W00557BE142245AW858232AW861851 AW858362 AA232351 AA218567 



AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 AA199942 AA223384 
AA227092 AA227080 T12379 AA092174 T61 139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 AA157718 
AA157719 M100472 AA100774 AA130756 AA157705 AA157730 AA157715 M053524 AW849581 AW854566 C05254 
AW882836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 A1829309 AW991957 N66951 AA527374 H56215 
AA045564 A1694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 AW817659 
BE081531 H59570 



1022086735 J U22961 AA203623 AA503337 AI174733 AI192802 C06092 AA035357 AH 906 19 AM 99244 A1828450 AA602296 AI378195 



AI209170 AI186653 AI127795 AI183846 H77389 AI589465 M629390 H94306 A1018388 R68584 AA027196 A1745413 AI685092 
AI093426 A1623873 A1074570 N50096 AA047486 N25060 AA327614 AKM2512 AI383957 AA156873 AI333101 N70806 AI141254 
AI383191 AI401237 AI080709 A1093400 W84549 T90806 R00012 W01413 AA630557 A1378348 AI559265 AA877103 W84454 
AA625146 R68379 AM33207 A1132980 AM 33214 A1064826 A1061615 AI133473 AI174852 AI133404 AI133272 V00494 M12523 
M12523 A1207526 AI133120 AKJ64802 A1174993 AJ1 14729 AI061645 A1064718 AI064959 H77388 T85706 AF075298 All 10799 
D17107 NM.000477 AF190168 R50724 AI248416 AI207432 A1133684 AI133345 AI174710 AI133290 A1133304 AJ174948 
A1207484 All 10717 AF074624 A11 14515 AF063516 Al 1 10642 AI114559 AJ114498 AJ1 14759 AI207568 AI064960 AI174753 
AI114666 R69184 R00011 AI064997 T60501 A1207701 T71735 AA385318 H73569 T60496 H94399 AI133158 T74675 AA484750 
T73413 T55909 R50261 T72061 N80533 T51189 T74936 A1207490 AI132925 AI064701 AI174748 A1114663 A1133104 A1132999 
A1133100 AJ054925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 AI444620 T52290 016931 T40012 T48403 
T58926 T69195 AI133061 T50850 AI400677 AI091136 AA334608 T5741 1 220979 N55507 T87485 AI133622 AA343370 T40075 
T69671 T53849 T74820 AF075316 AM 10818 T40121 T57381 AI114468 AA332728 T51362 AI114589 R06691 AI110629 
AF063503 AI140543 AA334661 AA332720 AA343262 T73513 T86549 AI114840 T57284 T39981 T61407 T72757 T51749 
T56630 AA343125 T72126 R94135 T83028 T39972 T39896 AM74786 AI132926 R09237 A1064838 AI133660 T60398 T88753 
T55930 T92126 AI444602 T60996 AH 14792 H9391 1 AI133106 R10779 AI065020 T90925 T50889 D17029 Al 133703 AA333805 
AI133040 AI133017 AI064857 A1110730 AF074637 AI207567 H71080 T73217 AA343950 AI174743 AA334224 AA334281 
R06692 T64739 T40163 T60628 T81661 T73179 R01842 AA501730 T39931 T39662 T40136 AA334904 T71425 H77784 R00874 
AI065049 T84512 T55918 AI207595 T39951 AA005016 T60361 T69176 T73356 T58795 T61233 T39955 T60612 AH 14676 
AI064778 AA035710 W52763 AH 14786 T83564 AA341859 T81684 T55769 A1114710 T51776 AA343213 All 14714 T58102 
All 10809 R28984 AJ174854 AA305675 AA343592 T53836 T46869 T64721 T55508 W05241 T54019 T57945 T60513 T48364 
AF075308 W86731 T82851 T48269 H54053 T73211 A1114590 T48317 T55965 T74857 R84226 T56552 T52231 T74948 T76976 
R02576 T95666 AI203974 AI189471 AA005147 AI478102 AI207662 A1192792 AI768421 AI064737 AW051713 AA936693 
AI1331 17 AI766232 AI913646 T83962 AI0651 12 A1207689 AI174684 AI207702 T81475 AI133325 AI032512 AA701169 AI936354 
AI1 14720 AM33289 AA046980 AI823482 Al 1 14536 AA860651 AW242644 R07469 AW300438 Al 133416 AW271670 AI991363 
T78943 AI823481 AA84551B AA719124 AA883454 T68850 T691 15 AI935509 AI150977 T62890 T71374 T68294 AI174774 
T6741 1 T68318 AI064689 T56624 T69010 T68982 T68302 AI332829 T72908 A1064819 AI205880 T62895 T69430 T951 1 1 
AA025050 T73330 W52657 T71984 T69118 W92684 AI1 14860 T62093 T61797 AI522333 T73322 H92981 T56018 T6181 1 
T57232 AI336158 T61821 T69457 T62900T62912T72917T46885 AI702448 T57212 T57203R94581 171311 T61819T89358 
T67708 T70918 T59166 AI187111 T64308 T62071 T69427AI1 14750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 
T69394 AI207729 T55839 T90273 T73339 AW194909 T75486 T71850 T71305 T71287 T53877 T73452 T68852 N75290 
AI312890 T67751 AI174983 T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T68447 D11809 D12412 
T64300 T28321T55864 
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TABLE 4: Figure 4 from BRCA 001 US 
5 Table 4 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

1 0 UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal breast tissue 



15 Pkey ExAccn UnigenelD Unigene Title 



R1 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



100113 
100114 
100131 
100146 
100147 
100154 
100163 
100220 
100265 
100271 
100275 
100323 
100335 
100364 
100372 
100393 
100400 
100418 
100482 
100518 
100666 
100667 



100678 
100685 



100783 
100850 
100892 
100945 
100969 



100999 
101031 
101045 
101077 
101093 
101161 
101186 
101216 
101228 
101247 
101249 
101332 
101332 
101352 
101396 
101445 
101470 



NrvL001269Hs.64746 
X02308 Hs.82962 
D12485 Hs.11951 
BE185499 Hs.2471 
D13666 Hs.136346 
H60720 Hs.81892 
W44671 Hs.124 
AW015534 Hs.217493 
D38521 Hs.1 12396 
BE160081 Hs.256290 
BE242802 Hs.154797 
D50920 Hs.23106 
AW247529 Hs.6793 
NMJ04341HS.154868 
NM_014791Hs.184339 
084145 Hs.39913 
AW954324 Hs.75790 
D86978 Hs.84790 
M65028 Hs.81361 
NM_004415HsJ4316 
L05424 Hs.169610 
L05424 Hs.169610 
L05424 Hs.169610 
AW502935 Hs.740 
AA328229 Hs.184582 
AA383256 Hs.1657 
AF078847 Hs.191356 
AA836472 Hs.297939 
BE245294 Hs.180789 
AF002225 Hs.180686 
AA157634 Hs.79172 
AK000405 Hs.76480 
H38765 Hs.80706 



J05070 
J05614 
N99692 
L06419 



Hs.151738 

Hs.75227 
Hs.75093 
NMJ06262HS.37Q44 
AA020956 Hs.179881 
AA284166 Hs.84113 
AA333387 Hs.82916 
AA132666 HsJ8802 



chromosome condensation 1 2.3 

thymidylate synthetase 2.9 

ectonucleotide pyrophosphatase/phosphodiesterase 1 1 .9 

KIAA0020 gene product 1.9 

osteoblast specific factor 2 (fascicGn Hike) (periostin) 7.5 

KIAA0101 gene product 9-2 

gene predicted from cDNA with a complete coding sequence 1 .6 

annextnA2 2.0 

KIAA0077 protein 1.5 

S100 calcium-binding protein A1 1 (calgizzarin) 13.5 

KIM0090 protein 5.1 

KIAA0130 gene product 1-9 

platelet-activating factor acetylhydrolase, Isoform lb, gamma subunit (29kD) 2.7 

carbamoyl-phosphate synthetase 2, aspartate transcarbamylase, and dihydroorotase 2.0 

KIAA0175 gene product 2.6 

novel RGD-containing protein 3.2 

pnosphatidytinositol glycan, class C 1 .5 

KIAA0225 protein 2.0 

heterogeneous nuclear ribonudeoprotein A/B 2.9 

desmoplakin(DP! l DPII) 1.9 

CD44 antigen (homing function and Indian blood group system) ' 5.7 

CD44 antigen (homing function and Indian blood group system) 9.0 

CD44 antigen (homing function and Indian blood group system) 7.6 

PTK2 protein tyrosine kinase 2 53.2 

ribosomal protein L24 1-8 

estrogen receptor 1 16 

general transcription factor HH, polypeptide 2 (44kD subunit) 5.9 

cathepsinB 1.7 

S164 protein 1.7 
ublquitin protein (igase E3A (human papilloma virus E6-associated protein, Angelman syndrome) 1 .5 

solute carrier family 25 {mitochondrial earner; adenine nucleotide translocator), member 5 6.3 

ubiquitin-like 4 11.4 

diaphorase (NADH/NADPH) (cytochrome b-5 reductase) 1.6 

matrix metalioproteinase 9 (gelatinase B, 92kD gelatinase, 92kD type IV collagenase) 8.2 

gbftuman proliferating cell nuclear antigen (PCNA) gene, promoter region. 5.0 

Empirically selected from AFFX single probeset 2.6 
procollagen-lysine, 2-oxoglutarate 5-dioxygenase (lysine hydroxylase, Ehfers-Oanlos syndrome type VI) 1.4 



peripherin 
core-binding factor, beta subunit 

cyctin-dependent kinase inhibitor 3 (CDK2-assoclated dual specificity phosphatase) 
chaperonin containing TCP1, subunit 6A (zeta 1) 



L18964 
J04088 
J04088 
AW94299 
BE267931 
M21259 

NIVLO00546HS.1846 



Hs.1904 
Hs.156346 
Hs.156346 
Hs.16297 
Hs.78996 



protein kinase C, iota 

topoisomerase (DNA) 11 alpha (170kD) 

topoisomerase (DNA) II alpha (170kD) 

C0X17 (yeast) homolog, cytochrome c oxidase assembly protein 

proliferating cell nuclear antigen 

gfcHuman Alu repeats In the region 5' to the small nuclear rib 

tumor protein p53 (U-Fraumenl syndrome) 
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101478 NM_002890Hs.758 RAS p21 protein activator (GTPase activating protein) 1 2.5 

101483 M24486 Hs.76768 procotlagen-protine, 2-oxog)utarate 4-dioxygenase (proline 4-hydroxylase), alpha polypeptide I 5.5 

101540 J04977 Hs.84981 X-ray repair complementing defective repair in Chinese hamster cells 5 (double-strand-break rejoining 2.1 

101573 AW248421 Hs.250758 proteasoma (prosome, macropain) 26S subunit. ATPase. 3 1.6 

5 101580 NM_012151Hs.83363 coagulation factor Vlll-associated (introntc transcript) 5.7 

101592 AF064853 Hs.91299 guanine nucleotide binding protein (G protein), beta polypeptide 2 1.8 

101592 AF064853 Hs.91299 guanine nucleotide binding protein {G pr 5.6 

101621 BE391804 Hs.62661 guanylate binding protein 1, interferon-inducible, 67kD 2.4 

101702 AW504089 Hs.179574 protein phosphatase 2 (formerly 2A), regulatory subunit B (PR 52), alpha tsoform 1.3 

10 101734 M74099 Hs.1 47049 cut (DrosophflaJ-Cke 1 (CCAAT displacement protein) 2.1 

101759 M8G244 Hs.184601 solute carrier family 7 (cationic amino acid transporter, y+ system), member 5 5.0 

101767 M81057 Hs. 1 80884 carboxypeptidase B1 (tissue) 14.4 

101782 AA306495 Hs.1869 phosphoglucomutase 1 5.2 

101805 AW409747 Hs.75612 stress-induced-phosphoprotein 1 (Hsp7Q/Hsp90-organlzing protein) 8.6 

15 101806 AA586894 Hs.1 12408 S100 calcium-binding protein A7(psoriasin 1) 8.9 

101810 NM_0Q0318Hs. 180612 peroxisomal membrane protein 3 {35kD, Zellweger syndrome) 3.2 

101879 AA1 76374 Hs.243886 nuclear autoantigenic sperm protein (histone-binding) 1.6 

101911 AA441787 Hs.1 19689 glycoprotein hormones, alpha polypeptide 31.3 

101920 AF182645 Hs.8024 IK cytokine, down-regulator of HLA 11 1.8 

20 101973 U41514 Hs.80120 UDP-N-acety{-alpha-T>galactosamine:porypeptide N-acetylgaiactosaminyltransferase 1 (GalNAc-T1) 2.4 

101983 AI904232 Hs.75323 prohlbitin 8.4 

102009 BE245149 Hs.82643 protein tyrosine kinase 9 1.3 

102036 BE250127 Hs.82906 COC20 (ceD division cycle 20, S. cerevisiae, homolog) 2.0 

102083 T35901 Hs.75117 interleukin enhancer binding factor 2, 45kD 1.6 

25 102083 T35901 Hs.75117 interleukin enhancer binding factor 2, 4 1.3 

102107 BE258602 Hs.182366 heat shock protein 75 1.4 

102123 NM 0Q1809Hs,1594 centromere protein A (17kD) 1.8 

102165 BE313280 Hs.159627 death associated protein 3 4.6 

102198 AW950852 Hs.74598 polymerase (DNA directed), delta 2, regulatory subunit (50kD) 4.3 

30 102217 AA829978 Hs.301613 JTVIgene 6.7 

102220 U24389 Hs.65436 lysosomal 43 

102234 AW163390 Hs.278554 heterachromaWke protein 1 1.9 

102260 AL039104 Hs.159557 karyopherin alpha 2 (RAG cohort 1, importin alpha 1) 4.4 

102302 AA306342 Hs.69171 protein kinase C-Jke 2 Z7 

35 102330 BE298063 Hs.77254 chromobox homolog 1 (Drosophila HP1 beta) 1.5 

102339 BE378432 Hs.95577 cyclin-dependent kinase 4 2.3 

102348 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member B2 2.0 

102349 AU077055 Hs.289107 bacutoviral IAP repeat-containing 2 3.2 
102369 U39840 Hs.299867 hepatocyte nuclear factor 3, alpha 2.0 

40 102374 U33635 Hs.90572 PTK7 protein tyrosine kinase 7 6.2 

102391 AA296874 Hs.77494 deoxyguanosine kinase 1.5 

102455 U48705 Hs.75562 diswidm domain receptor family, member 1 6.9 

102465 NMJ)01359Hs.81548 2,4-dienoyl CoA reductase 1, mitochondrial 1.8 

102488 U50939 Hs.61828 amyloid beta precursor protein-binding protein 1,59kD 1.5 

45 102489 AL080116 HsJ4420 origin recognition complex, subunit 3 (yeast homobg)-Iike 3.3 

102494 AI188137 Hs.75193 COP9 homolog 2.1 

102501 AF217197 Hs.74562 siah binding protein 1;FBP interacting repressor, pyrimidine tract binding splicing 3.2 

102522 BE250944 Hs.183556 solute carrier famfly 1 (neutral amino acid transporter), member 5 2.8 

102532 AF040253 Hs.70186 suppressor of Ty(S.cerevisiae) 5 homolog 5.7 

50 102564 U59423 Hs.79067 MAD (mothers against decapentaplegic, Drosophila) homolog 1 2.3 

102568 W81489 Hs.223025 RAB31, member RAS oncogene family 5.3 

102580 U60808 Hs.152981 CDP-diacytgrycerol synthase (phosphaSdate cytidylyltransferase) 1 2.1 

102581 AU077228 Hs.77256 enhancer of zesle (Drosophila) homolog 2 1.6 

102582 U61232 Hs.32675 fubufa-spectficchaperonee 2.1 
55 102617 AW161453 Hs.198767 COP9 {(^stitutive photomoroh^ 1.8 

102618 AUJ37672 Hs.81071 extracellular matrix protein 1 5.8 

102627 AU321918 Hs.158174 zinc finger protein 184 (Krupp el-like) 1.3 

102663 NM_002270Hs.168075 karyopherin (importin) beta 2 1.8 

102676 BE262989 Hs.12045 putative protein 2.3 

60 102687 NM_007019Hs.93002 ubiquitin carrier protein E2-C 4.3 

102689 U96132 Hs.1 7 1280 hydroxyacyl-Coenzyme A dehydrogenase, type II 6.0 

102696 BE540274 Hs.239 torkhead box M1 4.2 

102704 AU07705B Hs.54089 BRCA1 associated RING domain 1 1.9 

102705 T97490 Hs.50002 small inducible cytokine subfamily A (Cys-Cys), member 19 2.3 
65 102750 AB014460 Hs.66196 nth (E.coli endonudease lipke 1 1.2 

102801 BE252241 Hs.38041 pyridoxal (pyridoxine, vitamin B6) kinase 6.4 

102812 U90549 Hs.236774 high-mobility group (nor^istone chromosomal) protein 17-like 3 1.6 
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102827 BE24458S Hs.6456 chaperonin containing TCPl.subunit 2 (beta) 5.6 

102831 AA262170 Hs.80917 adaptor-related protein complex 3. sigma 1 subunit 2,0 

102844 AV653790 Hs.324275 WW domain-containing protein 1 1.3 

102868 X02419 Hs.77274 plasminogen activator, urokinase 4,4 

5 102925 BE440142 Hs.2943 signal recognition particle 19kD 1.9 

102935 BE561850 Hs.80506 small nuclear ribonudeoproteln polypeptide A* 2.4 
102968 AU076611 Hs.154672 methylene tetrahydrofolate dehydrogenase (NAD* dependent), methenyltetrahydrofolatecyclohydrolase2.7 

102983 BE387202 Hs.118638 norwnetastatic cells 1, protein (NM23A) expressed In 3.1 

102985 U95742 Hs.2707 G1 to S phase transition 1 5,2 

1 0 103023 AW500470 Hs.1 17950 multifunctional polypeptide similar to SAICAR synthetase and AIR carboxylase 1.6 

103038 AA926960 Hs.334883 COC28 protein kinase 1 2.5 

103060 NM_005940Hs.155324 matrix metalloproteinase 11 (MMP11; stromelysin 3) 4.5 

103080 AU077231 Hs.82932 cydin D1 (PRAD1: parathyroid adenomatosis 1) 3.1 

103089 D31152 Hs.179729 collagen, type X, alpha 1 (Schmid metaphyseal chondrodysplasia) 2.4 

15 103177 BE244377 Hs.48876 famesyl-diphosphate famesyttransferase 1 3.5 • 

103178 AA205475 Hs.275865 ribosomal protein S18 9.9 

103179 NMJ)01777Hs.82685 CD47 antigen (Rh-reiated antigen, Integrin-associated signal transducer) 1.3 
103181 X69636 Hs.334731 Homo sapiens, done IMAGE:3448306, mRNA, partial cds 2.0 
103185 NM_006825Hs.74368 transmembrane protein (63kD), endoplasmic reticulum/Golgi intermediate compartment 1 .6 

20 103191 AA401039 Hs.2903 protein phosphatase 4 (formerly X), catalytic subunit 2.5 

103193 NM_004766Hs.75724 coatomer protein complex, subunit beta 2 (beta prime) 2.2 

103194 NM_004939Hs.78580 DEAD/H (Asp-GIu-Ala-Asp/His) box polypeptide 1 6.3 
103206 X72755 Hs.77367 monokine Induced by gamma Interferon 8.8 
103223 BE275607 Hs.1708 chaperonin containing TCP1, subunit 3 (gamma) 3.0 

25 103232 X75962 Hs.129780 tumor necrosis factor receptor superfamiiy, member 4 1.8 

■ 103238 AI369285 Hs.75189 death-assodated protein 5.6 

103297 NMJ)01545Hs.9078 Immature colon cardnoma transcript 1 1,9 

103330 AI803447 Hs.77496 small nuclear ribonudeoprotein polypeptide G 2.5 

103349 X89059 gb:H.sapiens mRNA for unknown protein expressed in macrophage 1.6 

30 103376 AL036166 Hs.323378 coated vesicle membrane protein 1.8 

103391 X94453 Hs. 114366 pyrrofine-5-carboxytate synthetase (glutamate gamma-semialdehyde synthetase) 2.3 

103392 X94563 gb:Rsapiens dbi/acbp gene exon 1 & 2. .4.0 
103430 BE564090 Hs.20716 transtocase of inner mitochondrial membrane '17 (yeast) homologA 1.3 
103491 AF264750 Hs.288971 myeloid/lymphoid or mixed-lineage leukemia 3 5.6 

35 103505 AL031224 Hs.33102 transcription factor AP-2 beta (actrvatlng entencer-binding protein 2 beta) 5.1 

103547 AI376722 Hs.180Q62 proteasome (prosome, macropain) subunit beta type, 8 (large multifunctional protease 7) 9.7 
103588 NM_006218Hs.85701 phospholnositide^-Wnase, catalytic, alpha polypeptide . 2.0 

1 0361 3 NM_000346Hs.231 6 SRY (sex determining region Y)iox 9 (campomelic dysplasia, autosomal sex-reversal) 1 .3 

103621 BE379766 Hs.150675 polymerase (RNA) II (DNA directed) polypeptide K (7.QkD) 2.0 

40 103622 AA609685 Hs.278672 membrane component chromosome 11, surface marker 1 2.3 

103727 AI878883 Hs.296381 growth factor receptor-bound protein 2 1.3 

103749 AL135301 Hs.8768 hypothetical protein FU10849 1.8 

103754 AI015709 Hs.172089 Homo sapiens mRNA; cONA DKFZp586l2022 (from done DKFZp586l2022) 1.3 

103780 AA094752 Hs. 169992 hypothetical 43.2 Kd protein 7.5 

45 103795 H26531 Hs.7367 Homo sapiens BTB domain protein (BDPL) mRNA, partial cds 1.2 

103797 AA080912 gb:zn04d03.r1 Stratagene hNT neuron (937233) Homo sapiens cDNA done 5' similar 1.5 

103813 A1042582 Hs.181271 CGM 20 protein 1.5 

103855 W02363 Hs.302267 hypothetical protein RJ10330 1,5 

103886 AK001278 Hs.105737 hypothetical protein FU 1041 6 similar to constitutive photomorphogenic protein 1 6.5 

50 104052 NMJ)O2407Hs.'97644 mammaglobin2 2.9 

104079 AA251242 Hs.103238 ESTs 1.4 

104174 AA478984 Hs.6451 PRO0659 protein 5.6 

104227 AB002343 Hs.98938 protocadherin alpha 9 1.6 

104275 AI751970 Hs.101067 GCN5 (general control of amino-add synthesis, yeast, homolog)-like 2 5.4 

55 104325 BE379766 Hs.150675 polymerase (RNA) II (DNA directed) polypeptide K (7.0kD) 6.3 

104370 AA324597 Hs.21851 Homo sapiens cONAFU12900fe, done NT2RP2004321 1.6 

104423 R83113 Hs.1432 protein kinase C substrate 80K-H 5.2 

104482 A8037762 Hs.44268 myelin gene expression factor 2 1.2 

104532 AI498763 Hs.203013 hypothetical protein FU1 2748 2.1 

60 104563 AL1 17403 Hs.306189 DKFZP434F1735 protein 1.2 

104667 AI239923 Hs.30098 ESTs 1.3 

104757 AI694413 Hs.332649 olfactory receptor, family 2, subfamily I, member 6 2.3 

104804 A1858702 Hs.31803 ESTs, Weakly similar to N-WASP [H^apiens] 1.3 

104806 AB023175 Hs.22982 KIAA0958 protein 2.3 

65 104827 AW052006 Hs.8551 PRP4/STK/WD spfidng factor 10.9 

104846 AI250789 Hs.32478 ESTs 5.6 

104854 AA041276 Hs.154729 3-phosphoinositide dependent protein klnase-1 12.3 
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104867 AA278898 Hs.225979 hypothetical protein similar to small G proteins, especially RAP-2A 20 
104871 T78044 Hs.28893 Homo sapiens mRNA; cDNA DKFZp56402364 (from clone DKFZp56402364) 1.3 
104896 AW015318 Hs.23165 ESTs 17 7 

104909 AW408154 Hs.249184 transcription factor 19 (SC1) 5 6 

5 104916 AW958157 Hs.155489 NS1 -associated protein 1 \'j 
104919 AA026880 Hs.25252 prolactin receptor 14 
104930 AF043467 Hs.32893 neurexophilin 2 22 

104973 NMJ15310HS.6763 KIAA0942 protein 5 0 

104974 Y12059 Hs.278675 bromodomaifwonlaining 4 1.4 
10 104975 AL136877 Hs.50758 SMW (stnjrfuraJ rnaMer^^ 24 

104975 AL136877 Hs.50758 SMC4 (structural maintenance of chromoso 2 3 

104978 AM99268 Hs.19322 Homo sapiens, Similar to RIKEN cDNA 2010317E24 gene, done IMAGE:3502019, mRNA, partial cds7.2 

104979 AA937934 Hs.321062 ESTs " 1 3 
104994 AI499930 Hs.334885 mitochondrial GTP binding protein 3^5 

15 105009 8E379584 Hs.34789 dolktyWiptosphoo^osaa*^^ 5.5 
105012 AF098158 Hs.9329 chromosome 20 open reading frame 1 33 
105028 AI050715 Hs.2331 E2F transcription factor 5, p130-binding 22 
105032 AA127818 gb:zll2a02.s1 Soares_pregnantuterus_NbHPU Homo sapiens cDMA done IMAGE:501674 3* 6.8 

105039 AA907305 Hs.36475 ESTs 2 5 

20 105041 AB037716 Hs.26204 KIAA1 295 protein 22 
105045 BE242899 Hs.129951 speckkMype POZ protein 3 8 

105079 AA151342 Hs,12677 CGM47 protein 9 5 

105087 AA147884 Hs.9812 Homo sapiens cONA FU 14388 fis, done HEMBA 100271 6 5.6 

105088 H58589 Hs.35156 Homo sapiens CONAFU11027 fis. done PLACE10041 14 22 
25 105095 Z78407 Hs.27023 veside uansporWelated protein 22 

105110 BE387350 Hs.33122 K1AA1 160 protein 16 

105126 AW975433 Hs.36288 ESTs 6 3 

105127 AA045648 Hs.301957 nudix (nudeoside diphosphate linked moiety X)-type motif 5 2A 
105141 AA164687 Hs.177576 mannosyl(alpha-1,3->glycoprotein beta-1,4-N-acetylglucosaminyltransferase, isoenzyme A 2.7 

30 105158 AW976357 Hs.234545 hypothetical protein NUF2R 19 
105169 BE245294 Hs.180789 S164 protein i V 

105186 AA191512 Hs.28005 Homo sapiens cDNA FU1 1309 fis, clone PLACE 1010076 4*8 

105254 AA071276 Hs.19469 KIAA0859 protein 19 

105281 AA263143 Hs.24596 RAD51-toteracfing protein i% 

35 105288 N99673 Hs.3585 ESTs, Weakly similar to AF1 26743 1 DNAJ domain-containing proteinMCJ [H.sapiens] 1.9 

105302 AA700122 Hs.3355 sentrin-spedfic protease 8 0 

105331 AW270037 Hs.179507 KIAA0779 protein 18 

105359 NMJ)16015Hs.8054 CGI-68 protein 8 2 

105366 BE264645 Hs.282093 hypothetical protein FU21 91 8 5.0 

40 105373 AW887701 Hs.32356 hypothetical protein FU2G628 2 5 

105374 BE242803 Hs.262823 hypothetical protein FU 10326 2.2 

105387 AW592146 Hs.108636 membrane protein CH1 2.3 

105393 AF1 67570 Hs.256583 interieukin enhancer binding factor 3, 90kD 54 

105399 BE386877 Hs.334811 Npw38-binding protein NpwBP 1.6 

45 105400 AF198620 Hs.65648 RNA binding motif protein 8A 1.6 

105445 AA252395 gb:zs12g10.s1 NCLCGAPJ3C81 Homo sapiens cDNA done IMAGE:685026 3', mRNA sequence. 5.0 

105507 BE268348 Hs.226318 CCR4-NOT transcription complex, subunit 7 1.6 

105529 AA1 13449 Hs.32471 hypothetical protein FU20364 1.3 

105530 AB023179 Hs.9059 KIAA0962 protein 3 4 
50 105547 AA262640 Hs.27445 unknown 93 

105564 BE616694 Hs.288042 hypothetical protein FU 14299 1 4 

105596 AA579535 Hs.18490 hypothetical protein FU20452 10.9 

105597 AF054284 Hs.334826 splicing factor 3b, subunit 1. 155kD 2.9 
105608 AI808201 Hs.287863 hypothetical protein FU 12475 1 7 

55 105610 AA280072 Hs.99872 fetal Alzheimer antigen 1.4 

105617 AK000892 Hs.4069 glucocorticoid modulatory element binding protein 1 17 

105620 AW302245 Hs.181390 casein kinase 1, gamma 2 5 5 

105658 AA985190 Hs.246875 hypothetical protein FU20059 9 4 

105697 AW499988 Hs.27801 zinc finger protein 278 2 0 

60 105708 R26944 Hs.180777 Homo sapiens mRNA; cDNA OKFZp564M0264 (from done DKFZp564M0264) 1.7 

105743 BE246502 Hs.9598 sema domain, immunoglobulin domain (Ig), transmembrane domain (TM) and short 2.6 

105746 AW151952 Hs.46679 hypothetical protein FU20739 * 15 

105759 AI123118 Hs.15159 chemokine^ike factor, alternatively spliced 1.3 

105771 AI267720 Hs.153221 synovial sarcoma, translocated to X chromosome 16 

65 105820 AA741336 Hs.152108 transcriptional unit N 143 22 

105826 AA478756 Hs.194477 E3 ubiquitm ligase SMURF2 1.3 

105856 A1262108 Hs.12653 ESTs 2.4 
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105858 AF151066 Hs.281428 hypothetical protein 2.9 

105875 AK001708 Hs.32271 hypothetical protein FU 10846 1.4 

105930 AF016371 Hs.9880 peptidyl prolyl isomerase H (cyclophilin H) 5.2 

106000 AW194426 Hs.20726 ESTs 1.7 

5 106011 AW081202 Hs.12284 Homo sapiens, done IMAGE:2989556, mRNA, partial cds 2.8 

106017 AA477956 Hs.26268 ESTs 1.4 

106073 AL157441 Hs.17834 downstream neighbor of SON 1.4 

106078 AA130158 Hs.19977 ESTs, Moderately similar to ALU8_HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1 .6 

106094 AA533491 Hs.23317 hypothetical protein FU14681 6.8 

10 106140 AB006624 Hs.14912 WAA0286 protein 1.6 

106271 AA251393 Hs. 289052 Homo sapiens, Similar to RIKEN cDNA 5430429M05 gene, clone MGC: 131 55, mRNA, complete cds 10.8 

106288 AB037742 Hs.24336 KIAA1321 protein 1.3 

106300 Y10043 Hs.19114 high-mobility group (nonhistone chromosomal) protein 4 3.6 

106333 AL043114 Hs.2241Q ESTs, Weakly similar to A54B49 collagen alpha 1 (VI I) chain pre cursor (H.sapiensI 5.4 

15 106350 AK001404 Hs.194698 cycfinB2 5.7 

106359 AW390282 Hs.31130 transmembrane 7 superfamily member 2 6.3 

106381 AB040916 Hs.241Q6 WAA1483 protein 6,5 

106389 AW748420 Hs.6236 Homo sapiens cDNA: FU21487 fis, done COL05419 2.2 

106457 AF1 19256 Hs.27601 zinc finger protein 278 2.7 

20 106470 D63078 Hs. 186 160 Homo sapiens cDNA; FU23038 fis, clone LNG02039 2.3 

106531 AA454036 Hs.8832 ESTs 1.6 

106586 AA243837 Hs.57787 ESTs 1.6 

106589 AK000933 Hs.28661 Homo sapiens cDNAFU 10071 fis, clone HEMBA1001 702 2.4 

106610 AA458882 Hs.79732 fibulln 1 7.9 

25 106624 NM.003595HS.26350 tyrosylprotein sulfotransferase 2 7.7 

106650 AL049951 Hs.22370 Homo sapiens mRNA; cDNA DKFZp564O0122 (from done DKFZp564O01 22) 1.8 

106669 AV6571 17 Hs.1 84164 ESTs, Moderately similar to S65657 alpha-1C-adrenergic receptor splice form 2 [Rsapiens] 1.3 

106713 BE614802 Hs.184352 hypothetical protein FU 12549 4.5 

106717 AA600357 Hs.239489 TIA1 cytotoxic granule-associated RNA-binding protein 1.3 

30 106723 BE388094 Hs.21857 ESTs 1.6 

106795 AF174487 Hs.293753 Bd-2-related ovarian killer protein-like 5.7 

106829 AW959893 Hs.27099 hypothetical protein FU23293 similar to ARL-6 interacting protein-2 ■ 16.2 

106831 BE564871 Hs.29463 centrin, EF-hand protein, 3 (CDC31 yeast homolog) 1.5 

106846 AB037744 Hs.34892 KIAA1323 protein 2.2 

35 106852 AF151031 Hs.300631 hypothetical protein 1.3 

106873 N49809 Hs.11197 Homo sapiens, clone IMAGE:3343149, mRNA, partial cds 16.8 

106886 W79171 Hs.9567 GL002 protein 1.5 

106906 AA661271 Hs.222024 transcription factor BMAL2 2.2 

106920 AK001838 Hs.296323 serum/glucocorticoid regulated kinase 3.3 

40 106945 AK000511 Hs.6294 hypothetical protein DKFZp434L1 435 similar to valyltRNA synthetase 6.8 

106973 BE156256 Hs.1 1923 hypothefcal protein 6.6 

106977 AL043152 Hs.50421 KIAA0203 gene product 4.8 

106978 AW631480 Hs.8688 ESTs 6.0 
107004 AA146872 Hs.300700 hypothetical protein FU20727 1.3 

45 107029 AF264750 Hs.288971 myeloid/lymphoid or mixed-lineage leukemia 3 1.8 

107071 AW385224 Hs.35198 ectonudeotide pyrophosphatase/phosphodiesterase 5 (putative function) 1.7 

107113 AK000733 Hs.23900 GTPase activating protein 2.5 

107125 AK000512 Hs.69388 hypothetical protein FU20505 1.7 

107136 AV661958 Hs.8207 GK001 protein 4.6 

50 107136 AV661958 Hs.8207 GK001 protein 3.3 

107146 AK001455 Hs.5198 Down syndrome critical region gene 2 2.0 

107151 AW378065 Hs.8687 ESTs 6.3 

107155 AW391927 Hs.7946 KIAA1 288 protein 33.5 

107174 BE122762 Hs.25338 ESTs 5.2 

55 107197 W15477 Hs.64639 glioma pathogenesis-related protein 6.1 

107221 AW888411 Hs.81915 Ieukemia-assodatedphosphoprotelnp18(stathmin) 17.4 

107243 BE219716 Hs.34727 ESTs. Moderately similar to I38759 zinc finger/Ieucine zipper protein [H.sapiens] 7.4 

107248 AW263124 Hs.3151 11 nudear receptor co-repressor/HOAC3 complex subunit 1 .8 

107263 D60341 Hs.21198 translocase of outer mitochondrial membrane 70 (yeast) homolog A 6.6 

60 107265 BE379594 Hs.49136 ESTs, Moderately similar to ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 2.5 

107298 N95657 Hs.6820 ESTs, Moderately similar to YOJ1.CAEEL HYPOTHETICAL 63.5 KD PROTEIN ZK353.1 IN 2.5 

107298 N95657 Hs.6820 ESTs, Moderately similar to YOJ1JCAEEL H 1.7 

107299 BE277457 Hs.30661 hypothetical protein MGC4606 3.2 
107316 T63174 Hs.193700 Homo sapiens mRNA; cDNA DKFZp586l0324 (from done DKFZp586I0324) 2.0 

65 107354 NM 006299 Hs.96448 zinc finger protein 193 5.0 

107392 AW299900 Hs.267632 TATA element modulatory fador 1 1.2 

107481 AA307703 Hs.279766 kinesln family member 4A 1.6 
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107529 BE515065 Hs.296585 nucleolar protein (KKE/D repeat) 3.0 

107554 AAQ01386 Hs.59844 ESTs 1.3 
107681 8E379594 Hs.49136 ESTs, Moderately similar to ALU7.HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 2.2 

107772 AA018587 Hs.303055 ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 2.1 

5 107859 AW732573 Hs.47584 potassium voltage-gated channel, delayed-rectifier, subfamily S, member 3 8.4 

107901 L42612 Hs.335952 keratin 6B " 2.5 

107901 L42612 Hs.335952 keratin 6B 1.6 

107922 BE153855 Hs.61460 Ig superfamily receptor LNIR 22 

107974 AW956103 Hs.61712 pyruvate dehydrogenase kinase, isoenzyme 1 6J 

10 108040 AL121031 Hs.159971 SW1/SNF related, matrix associated, actln dependent regulator of chromatin, subfamily b, member 1 1.5 

108230 AA054224 Hs.59847 ESTs 1.3 

108274 AF129535 Hs.272027 F-box only protein 5 7.1 

108298 N31256 Hs.161623 ESTs 2.5 

108496 AA083069 Hs.339659 ESTs 3.5 

15 108607 BE300380 Hs.69476 Homo sapiens cONAFUl 2758 fis, clone NT2RP20O1 3 28 3.4 

108621 AA101809 Hs.182685 ESTs 1.6 

108634 AW022410 Hs.69507 ESTs 1.7 

108647 BE546947 Hs.44276 homeoboxCtO 9.8 

108695 AB029000 Hs.70823 KIAA1 077 protein 72 

20 108717 AA122393 Hs.70811 hypothetical protein FU205 16 1.3 

108740 A1089575 Hs.9071 progesterone membrane binding protein 2.7 

108828 AK001693 Hs.273344 OKFZP564O0463 protein 1.8 

108859 AL121500 Hs.178904 ESTs 1.5 

108872 H06720 Hs.111680 endosulRne alpha 2.1 

25 108891 A1801235 Hs.48480 ESTs 5.3 

108894 AK001431 Hs.5105 hypothetical protein FU 10569 4.0 

108955 AA149754 Hs.195155 Homo sapiens amino acid transport system N2 (SN2) mRNA, complete cds 5.6 

108982 AA151708 Hs.171980 homeo box {expressed in ES cells) 1 1.6 

108987 AA152178 Hs.23467 hypothefical protein FU 1063 3 6.2 

30 109002 AB028987 Hs.72134 KIAA1 064 protein 17 

109011 AA156542 Hs.72127 ESTs 14 
109026 AA157811 gb:zo35d07.s1 Stratagene colon (937204) Homo sapiens cONA clone 3' slmDar to contains A!u repetitive 5.3 

109068 AA164293 Hs.72545 ESTs 2.9 

109101 AW608930 Hs.52184 hypothetical protein FLI2061 8 1.6 

35 109112 AW419196 Hs.257924 hypothetical protein FU1 3782 3.2 

109124 AK000684 Hs.183887 hypothetical protein FLJ22 104 1 7 

109139 AJ132592 Hs.59757 zinc finger protein 281 2.6 

109166 AA219691 Hs73625 RAB6 Interacting. kinesin-Uke (rabkinesin 6) 2.9 

109198 BE566742 Hs.58169 highly expressed In cancer, rich in leucine heptad repeats 2.0 

40 109213 NM_016603Hs.82035 potential nuclear protein C50RF5; GAP-like protein 5.3 

109220 AW958181 Hs.189998 ESTs 5.7 

109233 AU077281 Hs.170285 nudeoporin 214kD (CAIN) 5.3 

109270 N99673 Hs.3585 ESTs, Weakly similar to AF126743 1 DNAJ domain-containing protein MCJ [H.sapiens] 1.4 

109273 AA375752 Hs.82719 Homo sapiens mRNA; cDNA DKFZp566F1822 (from clone DKFZp586F1 822) 2.9 

45 109313 AF153201 Hs.86276 C2H2 (KruppeUype) zinc finger protein 1.3 

109341 AA213506 Hs.115099 EST 2.9 

109391 AL096858 Hs.184245 K1AA0929 protein Msx2 interacting nuclear target (MINT) homobg 1.5 

109420 H83603 Hs.40408 homeo box C9 2.2 

109426 N30531 Hs.42215 protein phosphatase 1, regulatory subunit 6 3.0 

50 109429 A1160029 Hs.61438 ESTs 1.9 

109445 AA232103 Hs.189915 ESTs 1.8 

109450 AB032969 Hs.173042 WAA1143 protein 3.7 

109468 NM 01531 QHs.6763 KIAA0942 protein 3.2 

109478 AW074143 Hs.87134 ESTs 2.0 

55 109570 L40027 Hs. 11 8890 glycogen synthase kinase 3 alpha 2.1 

109662 F02614 Hs.27319 ESTs 1.4 

109825 R71264 Hs.16798 ESTs 1.3 

110039 H11938 Hs.21907 histone acetyltransferase 2.0 

110056 AA503041 Hs.279009 matrix Gla protein 2.5 

60 110085 AA603840 Hs.29956 KIAA0460 protein 1.7 

110110 T07353 Hs.7948 ESTs 2.9 

110129 R51853 Hs.226429 ESTs. Weakly similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 17 

110154 NNL014521KS.17667 SH3-domain binding protein 4 4.2 
1 10240 AI668594 Hs.176588 ESTs, Weakly similar to CP4Y_HUMAN CYTOCHROME P450 4A1 1 PRECURSOR {H^apiens] 4.2 

65 110242 N41744 Hs.19978 CGW0 protein 1.3 

110259 H28428 Hs.32406 ESTs, Weakly similar to I38022 hypothetical protein [H^apiens] 2.2 

110312 BE256986 Hs.11896 hypothetical protein FU12089 Z1 
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110330 AI288666 Hs.16621 DKF2P434I1 16 protein 6.2 
110501 H55748 gb:yq94a01.s1 Soares fetal liverspleen 1NFLS Homo sapiens cDNA done IMAGE:203400 3' 6.1 

110504 H55915 Hs.210859 hypothetical protein FLJ1 1016 6.1 

110525 H57330 Hs.37430 EST 63 

5 110568 AK001160 Hs.5999 hypothetical protein FU 10298 13 

110699 T97586 Hs.18090 ESTs 18 

110705 AB007902 Hs.32168 KIAA0442 protein 1.6 

110742 AW190338 Hs.28029 hypothetical protein MGC1 1256 7.6 

110761 AL138077 Hs.16157 hypothetical protein FLJ 12707 2.5 

10 110762 BE044245 Hs.30011 hypothetical protein MGC2963 '9.3 

110765 AK000322 Hs.1B457 hypothetical protein FU2031 5 5.5 

110769 BE000831 Hs.23837 Homo sapiens cDNA FU 11 81 2 lis, clone HEMBA 1006364 2.1 

110799 AI089660 Hs.323401 dpy-3Wike protein 1.5 

110805 T25829 Hs.24048 FK506 binding protein precursor 6.6 

15 110813 AA767373 Hs.35669 ESTs, Moderately similar to ALU1JWMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 5.7 

1 10820 R33261 Hs.6614 ESTs, Weakly similar to A43932 mucin 2 precursor, intestinal [H.sapiens] 3.4 

110840 N31598 Hs.12727 hypothetical protein FU21 610 1.7 

110844 AI740792 Hs.167531 methyicrotonoyl-Coenzyme A carboxylase 2 (beta) 17 

110854 BE612992 Hs.27931 hypofoetica! protein FU10607. similar to glucosamme-phosphateN-acetyltransferase 4.7 

20 110856 AA992380 gb:ot37g06.s1 Soares>stis_NHTHonwsapIeracONA clone 3' similar to contains element 2.3 

110885 BE384447 Hs.16034 hypothetical protein MGC1 31 86 3.5 

110897 AL1 17430 Hs.6880 DKFZP434D1 56 protein 2.2 

110915 BE092285 Hs.29724 hypothetical protein FU 13187 2.6 1 

110918 H04360 Hs.24283 ESTs, Moderately similar to reduced expression in cancer [H.sapiens] 1.9 

25 110958 NM_005864Hs.24587 signal transduction protein (SH3 containing) 6.7 

110963 AK002180 Hs.11449 DKFZP56401 23 protein 2.0 

110981 AK001980 Hs.24284 ADP-ribosyitransferase (NAD+; poiy(ADP-ribose) po!ymerase)-like 2 1.3 
110984 AW613287 Hs.80120 UDP-N-acetyl-alpha-D-ga!actosamine:polypeptide N-acetylgalactosaminyltransferase 1 (Ga!NAc-T1) 1.8 

111125 N63823 Hs.2691 15 ESTs, Moderately similar to Z195JHUMAN ZINC FINGER PROTEIN 195 [H^apiens] 3.6 

30 111132 AB037807 Hs.83293 hypothetical protein 2.1 

111164 N46180 Hs.122489 Homo sapiens cONAFU 13289 fis, clone OVARC1001170 2.3 

111172 R67419 Hs.21851 Homo sapiens cDNA FU12900 fis, done NT2RP2004321 3.7 

111174 AL050166 Hs.26295 Homo sapiens mRNA; cDNA DKFZp586D1 122 (from done DKFZp586D1 122) 7.5 

111179 AK000136 Hs.10760 asporin (LRR dass 1) 7.1 

35 111184 AI815486 Hs.243901 Homo sapiens cDNA FU20738 fis, clone HEP08257 6.7 

111184 AI815486 Hs.243901 Homo sapiens cDNA FU20738 fis, clone HE 3.3 

111 189 N67603 Hs.272130 ESTs, Weakly similarto S65824 reverse transcriptase homolog [H.sapiens] 3.6 

111216 AW139408 Hs.152940 ESTs 1.5 

111221 AB037782 Hs.15119 KIAA1 361 protein 2.6 

40 111223 AA852773 Hs.334838 KIAA1866 protein 4.6 

111239 N90956 Hs.17230 hypothetical protein FU22087 7.9 

111285 AA778711 Hs.4310 eukaryotic translation initiation factor 1 A 6.9 

111299 AB033091 Hs.74313 KIAA1 265 protein 5.0 

111312 AI523913 Hs.34504 ESTs 3.8 

45 111318 T99755 Hs.334728 ESTs 1,2 

111337 AA837396 Hs.263925 LIS1 -interacting protein NUDE1. rat homolog 5.1 

111352 H58589 Hs.35156 Homo sapiens cDNA FU1 1027 fis, done PLACE10041 14 2.2 

111370 AI478658 Hs.94631 brefeldin A4nhfoited guanine nudeotide-exchange protein 1 2.8 

111384 N94606 Hs.288969 HSCARG protein * 2.2 

50 111389 AK000987 Hs.169111 oxidation resistance 1 2.1 

1 1 1391 NMJX)3896Hs.225939 siatyitransferase 9 (CMP-NeuAciactosylceramide aIpha-2,3-siatyitransferase; GM3 synthase) 5.1 

111392 W46342 Hs.325081 Homo sapiens, done IMAGE :3659680, mRNA, partial cds 8.4 
111452 R02354 Hs.15999 ESTs 27 
111486 AI051194 Hs.227978 EST 6 5 

55 111549 W90638 Hs.20321 ESTs, Moderately similar to ZRF INHUMAN ZUOTIN RELATED FACTOR-1 (M-PHASE 1.4 

111585 R10720 Hs.20670 EST 1.6 

111627 R52656 Hs.21691 ESTs 1.6 

111870 AB037834 Hs.18685 Homo sapiens mRNA for KIAA1413 protein, partial cds 2.4 

111937 BE298665 Hs.14846 Homo sapiens mRNA; cDNA DKFZp564D016 (from done DKFZp564D016) 10.6 

60 111944 AW083791 Hs.21263 suppressor of potassium transport defect 3 6.6 

111987 NM_015310Hs.6763 KIAA0942 protein 5.1 

112134 R41823 Hs.7413 ESTs; calsyntenin-2 28 

112244 AB029000 Hs.70823 KIAA1077 protein 14.6 

112388 R46071 Hs.301693 Homo sapiens, done IMAGE3638994, mRNA, partial cds 90 

65 112456 NMJ)16248Hs.232076 A kinase (PRKA) anchor protein 11 1.4 

112464 AW007287 Hs.28538 Homo sapiens cDNA: FU21086 fis, done CAS03272 14 

112506 AI742756 Hs.26079 ESTs 3.2 
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112513 R68425 Hs.13809 hypofoetica) protein FU10648 2.0 

112752 AK001635 Hs.14838 hypothetical protein FU 10773 1.8 

112884 AK000004 Hs.5013 Homo sapiens mRNA for FLJ00004 protein, partial cds 6.6 

112923 T10258 Hs.5037 EST 1.5 

5 112936 AW970826 Hs.6185 K1AA1557 protein 3.2 

112958 R61388 Hs.6724 ESTs 6.0 

112966 Z44718 Hs.102548 glucocorticoid receptor DN A binding factor 1 6.4 

112978 AK000272 Hs.7099 hypothetical protein R J 20265 1.2 

112995 AA737033 Hs.7155 ESTs, Moderately similar to 2115357A TYKl protein [M.muscutus] 5.6 

10 112996 BE276112 Hs.7165 zinc finger protein 259 2.0 

113047 A1571940 Hs.7549 ESTs 1.9 

113049 AW965190 Hs.7560 Homo sapiens mRNA for KIAA1729 protein, partial cds 2.4 

113089 T40707 Hs.270862 ESTs 1.3 

1 13196 T57317 gb:yb51a03.s1 Stratagene fetal spleen (937205) Homo sapiens cDNA clone IMAGE:74668 3', 1.7 

15 113248 T63857 gb:yc16e01.s1 Stratagene lung (9372 10) Homo sapiens cDNA done 3', mRNA sequence 2.8 

113254 AK002180 Hs.11449 DKFZP564 01 23 protein 1.3 

1 13277 AW971049 Hs. 1 1774 protein (peptidyl-protyi cls/trans isomerase) NlMA-interacting, 4 (parvulin) 3.2 

113429 AA688021 Hs.179808 ESTs \2 

113499 AI467908 Hs.8882 ESTs 5.9 

20 113547 H59588 Hs.15233 ESTs 2.0 

113554 AW503990 Hs,142442 HP1-BP74 3.6 

113647 AA813887 Hs.188173 Homo sapiens cDNAFUl 2187 lis, done MAMMA1000831 1.3 

1 13702 T97307 gb:ye53h05.s1 Scares fetal Over spleen 1NFLS Homo sapiens cDNA clone IMAGE;121497 3 1 , 4.4 

113722 AV653556 Hs.184411 albumin 1.3 

25 1 13759 AW499665 Hs.9456 SW1/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, member 5 1 .2 

113777 BE266947 Hs.10590 zinc finger protein 313 13.4 

113783 AL359588 Hs.7041 hypothetical protein DKFZp762B226 1.7 

113791 AI269096 Hs. 1 35578 diitobiase, oWacetyl- 1.3 

113808 W44735 Hs.9286 Homo sapiens cDNA: FU21278 lis. done Ca01832 3.3 

30 113811 BE207480 Hs.6994 Homo sapiens cDNA: FU22044 (is, clone HEP09141 3.1 

113817 H13325 Hs.332795 hypothetical protein DKF2p76 101 71 21 3.2 

113826 AW378212 Hs.24809 hypothetical protein FU 10826 2.3 

113834 T26483 Hs.6059 EGF-containing fibulin-like extracellular matrix protein 2 11.3 

113868 W57902 Hs.90744 proteasome (prosome, maaopain) 26S subunit, non-ATPase, 11 2.7 

35 113870 AL079314 Hs.16537 hypothetical protein, similar to (U06944) PRAJA1 6.1 

113885 AW959486 Hs.21732 ESTs 6.6 

1 13923 AW953484 Hs.3849 hypothetical protein FU22041 similar to FK506 binding proteins 1.9 

113989 W87544 Hs.268828 ESTs \2 

114022 A1539519 Hs.120969 Homo sapiens cDNAFU11562fe, done HEMBA10031 97 5.4 

40 114030 AI825386 Hs. 164478 hypothetical protein FU21 939 similar to tazacytidine Induced gene 2 9.4 

114060 AB029551 Hs.7910 RING1 and YY1 binding protein 1.8 

114196 AF017445 Hs.150926 fucose-1 -phosphate guanyiyltransferase 1.5 

114226 AB028968 Hs.7989 K1AA1045 protein 1.8 

114253 BE149866 Hs,14831 Homo sapiens, Similar to zinc finger protein 136 (done pHZ-20), done MGC:10647,irtfiNA, complete cds 2.3 

45 114262 AL1 17518 Hs.3686 KIAA0978 protein 1.4 

114275 AW515443 Hs.306117 KIAA0306 protein 15.8 

114292 AI815395 Hs.184641 fatty acid desaturase 2 1.9 

114309 AA332453 Hs.20824 CGI-85 protein 2.4 

114392 AA249590 Hs.100748 ESTs, Weakly similar to A28996 proline-rlch protein M14 precursor - mouse [Mmusculus] 1.8 

50 114407 BE539976 Hs.103305 Homo sapiens mRNA; cDNA DKFZp434B0425 (from clone DKFZp434B0425) \2 

114455 H37908 Hs.271616 ESTs, Weakly similar to ALU8.HUMAN ALU SUBFAMILY SX SEQUENCE 5.5 

114463 AL120247 Hs.40109 KIAA0872 protein 5.2 

114464 AI091713 Hs.106597 Homo sapiens, Similar to RIKEN cDNA 1110012M11 gene, done IMAGE3688605, mRNA, partial cds1.2 
114471 AAO28074 Hs.104613 RP42homo!og 1.8 

55 114480 BE066778 Hs.151678 UDP4kK»ty^lpha0^aladDsarairw: 13.4 

114671 AA766268 Hs.266273 hypothetical protein FU 13346 1.9 

114698 AA476966 Hs.110857 potymerase (RNA) 111 (DNA directed) polypeptide K (12.3 kDa) 3.5 

114730 AI373544 Hs.331328 intermediate filament protein syncoiiin 3.8 

114767 AI859865 Hs.154443 minfohromosome maintenance defident (S. cerevisiae) 4 1.6 

60 114774 AV656017 Hs.184325 CGI-76 protein 3.1 

114798 AA159181 Hs.54900 serologically defined colon cancer antigen 1 3.5 

114860 AL157545 Hs.42179 bromodomain and PHD finger containing, 3 4.3 

114895 AA236177 Hs.76591 K1AA0887 protein 7.1 

114896 BE539101 Hs.5324 hypothetical protein 1.3 

65 114911 AA236672 gb:zt29f02s1 Soares ovary tumor NbHOT Homo sapiens cDNA done IMAGE:723771 3', mRNA sequence. 1.5 

114930 AA237022 Hs.188717 ESTs 2.0 

114938 AA242834 Hs.58384 ESTs 2.9 
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114965 AI733881 Hs72472 BMP-R1B 2.3 

115023 AF102546 Hs.63931 dachshund (Orosophlla) homolog 1.3 

115038 AA252360 Hs.87968 toll-fike receptor 9 1.6 

115061 A1751438 Hs.41271 Homo sapiens mRNAIul I length insert cDNA done EUR01MAGE 1913076 118 

5 115062 AA253314 Hs.154103 UM protein (similar to rat protein Wna 1.5 

115117 AI670847 Hs.5324 hypothetical protein 1.5 

115121 AI634549 Hs.88155 ESTs 18 

115206 AW183695 Hs.186572 ESTs 2.5 

115221 AW365434 Hs.79741 hypothetical protein FU 101 16 1.5 

10 115239 BE251328 Hs.73291 hypofretical protein FU 10881 1.3 

115242 AI368236 Hs.283732 ESTs, Moderately similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE 1.4 

115278 AK002163 Hs.301724 hypothetical protein FU 11301 1.5 

115285 AW972872 Hs.293736 ESTs 2.4 

115291 BE545072 Hs.122579 hypothetical prolan FU 10461 6.2 

15 115400 AI215069 Hs.89113 ESTs 6.6 

115468 AA314349 Hs.48499 tumor antigen SLP4p 7.4 

115471 AK001376 Hs,59346 hypothetical protein FU 10514 1.4 

115479 AW301608 Hs.278188 ESTs, Moderately similar to 154374 gene NF2 protein [Ksapiens] 4,0 

115496 AW247593 Hs.71819 eukaryotic translation initiation factor 4E binding protein 1 16.3 

20 115500 Y14443 Hs.88219 zinc finger protein 200 5.0 

115553 AJ275986 Hs.71414 transcription fector(SMlF gene) 2.5 

115581 AI540842 Hs.61082 ESTs 6.1 

115587 BE081342 Hs.283037 HSPC039 protein 2.9 

115590 AA399477 Hs.67896 7-60 protein 5.3 

25 115646 N36110 Hs.305971 solute carrier family 2 {fadiirated gtose transpbrter), member 10 4.7 

115652 BE093589 Hs.38178 hypotheUcal protein FU23468 10.6 

115655 AL048269 Hs.288544 Homo sapiens, done MGC:16063, mRNA, complete ods 117 

115663 AI138785 Hs.40507 EST* 2.0 

115676 AA953006 Hs.88143 ESTs 3.0 

30 115690 AA625132 Hs.44159 hypothetical protein FU21615 1.7 

115693 AF231023 Hs.55173 cadherin, EGF LAG seven-pass 6-type receptor 3, flamingo (Drosophila) homolog 6.8 

115715 BE395161 Hs.1390 proteasome (prosome, macropaln) subunit beta type, 2 1.7 

115734 A1950339 Hs.40782 ESTs 2.6 

115811 NM_015434Hs.48604 DKFZP434B1 68 protein 2.1 

35 115823 AI732742 Hs.87440 ESTs 2.1 

115837 AI675217 Hs.42761 ESTs 1.3 

115844 AI373062 Hs.332938 hypothetical protein MGC5370 4.4 

115866 AW062629 Hs.52081 KIAA0867 protein 7.2 

115875 N55669 Hs.333823 mitochondrial ribosomal protein L13 1.2 

40 115941 AI867451 Hs.46679 hypothetical protein FU20739 5.5 

115968 AB037753 Hs.62767 KIAA1 332 protein 9.8 

116003 BE275469 Hs.66493 Down syndrome critical region gene 5 1.4 

116011 ,AL359053 Hs.57664 Homo sapiens mRNA full length Insert cDNA clone EUROiMAGE 2005735 2.4 

116108 AA770688 Hs.28777 H2A histone family, member L 1.8 

45 116134 BE243834 Hs.50441 CGI-04 protein 1.4 

116189 N35719 Hs.44749 ESTs, Moderately similar to T0G358 hypothetical protein KIAA0684 [Usapiens] 1.2 

116195 AW821113 Hs.72402 ESTs 2.1 

116236 AV660717 Hs.47144 DKFZP586N081 9 protein 1.7 

116246 AF265555 Hs.250646 baculoviral IAP repeat-containing 6 1.7 

50 116262 AI936442 Hs.59838 hypothetical protein FU 10808 1.7 

116298 AI955411 Hs.94109 Homo sapiens cONAFU 13634 fis, clone PUCE 101 11 33 1.9 

116318 AF097645 Hs.58570 deleted in cancer 1; RNA helicase HDB/DICE1 4.9 

116325 AI472106 Hs.49303 Homo sapiens cDNA FLJ1 1663 lis, clone HEMBA1004631 1.4 

116336 AL133033 Hs.4084 KIAA1 025 protein 1.9 

55 116339 AK000290 Hs.44033 dipeptidyl peptidase 8 1.5 

116350 AA497129 Hs.184771 nudear factor l/C (CCAAT-binding transcription factor) 1.9 

116358 AM49586 Hs.38125 in terferon-induced protein 75, 52kD 1.9 

116365 N50174 Hs.46765 ESTs 6.1 

116368 N90466 Hs.71109 K1AA1 229 protein 1.6 

60 116417 AW499664 Hs.12484 Human done 23826 mRNA sequence 7.4 

116436 AA161411 Hs.58668 chromosome 21 open reading frame 57 2.1 

116462 AF218313 Hs.236828 putative helicase RUVBL 1.5 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 2.1 

116470 A1272141 Hs.83484 SRY (sex determining region Y)-box 4 1.2 

65 116575 AA312572 Hs.6241 phosphoinositide-3-kinase, regulatory subunit, polypeptide 1 (p85 alpha) 1.5 

116637 AK001043 Hs.92033 integrin-linked Wnase-assodated serine/threonine phosphatase 2C 2.7 

116640 X89984 Hs.211563 B-ce« CLL/lymphoma 7A 2.3 



122 



WO 02/059377 PCT/US02/02242 



116700 A1800202 Hs.317589 hypothetical protein MGC10765 J-4 

116705 AWQ74819 Hs.12313 hypothefcal protein FU 14556 j4 

116732 AW152225 Hs.1659Q9 ESTs, Weakly similar to 138022 hypothetical protein [H.sapiensJ 2.9 

116921 AW068115 Hs.821 biglycan « 

5 116926 H73608 Hs.290830 ESTs \ 7 

117034 U72209 Hs. 1 80324 YY1 associated factor 2 3.4 

117132 AI393666 Hs.42315 p10-binding protein „ nM „ M1 , 5,2 cc 
1 17247 N21032 gb:yx46f06.s1 Scares melanocyte 2NbHM Homo sapiens cDNA done IMAGE264803 3 , mRNA sequence. 5.5 

117276 N71183 Hs.121806 Homo sapiens cDNAFUl 1971 fe. clone HEMBB1001208 1 5 

1 0 1 17284 AK001701 Hs.183779 Homo sapiens cONA RJ10590 fis, ctone NT2RP2004392, weakly similar to MNN4 PROTEIN 2.0 

117367 AI041793 Hs.42502 ESTs 2.0 

117368 AI878942 Hs.90336 ATPase, H+ transporting, lysosomal (vacuolar proton pump], member J £1 
117382 AF150275 Hs.40173 ESTs 21 
117412 N32536 Hs.42645 solute carrier family 16 {monocarboxylic acid transporters), member 6 1.4 

15 117557 AF123050 Hs.44532 dlubiquifin *J 

117588 N34895 Hs.44648 ESTs J.4 

117745 BE294925 Hs.46580 CGM2 protein *" 

117m AA121673 Hs.59757 anc finger protein 281 \* 

117879 N54706 Hs.303025 chromosome 11 open reading frame 24 J* 

20 117881 AF161470 Hs.260622 butyrate-induced transcript 1 ( 5.7 

117904 BE540675 Hs.332938 hypothetical protein MGC5370 *9 

117911 AL137379 Hs.47125 hypothetical protein FU13912 ™ 

117933 Y10518 Hs.116470 hypothetical protein RJ20048 ™ 

117983 AL1 10246 Hs.47367 KIM1785 protein *4 

25 118078 N54321 Hs.47790 EST ** 

118301 AA453902 Hs.293264 ESTs ** 
118429 AA243332 Hs.74649 cytochrome c oxidase subunit Vic 

118472 AL157545 Hs.42179 bromodomain and PHD finger containing, 3 J-J 

118488 AJ277275 Hs.50102 rapa-2 (rapa gene) ]■? 

30 118509 N22617 Hs.43228 Homo sapiens cDNA FU11835 fis, clone HEMBA1 006595 1-5 

118528 A1949952 Hs.49397 ESTs 7A 

118656 AI458020 Hs.293287 ESTs nj , , " 

118670 AA332845 Hs.152618 ESTs, Moderately similar to ZN91_HUMAN ZINC FINGER PROTEIN 91 (H,saptensl 1.2 

118698 AB033113 Hs.50187 K1AA1287 protein IMArc c ... flfl " 

3 5 1 18737 AA199686 gb:zq75g09.r1 Stratagene hNT neuron (937233) Homo sapiens cDNA done IMAGE:647488 5 5.^ 

1 18925 N92293 Hs.206832 ESTs, Moderately similar to ALU8_HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1.4 
118984 AI668709 Hs.240722 ESTs, Moderately similar to ALU8_HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 3.6 

118986 AF148713 Hs.1 25830 bladder cancer overexpressed protein J* 

119206 W24781 Hs.293798 K1AA1710 protein l J 

40 119235 AW453069 Hs.3657 activity-dependent neuroprotective protein " 

119235 AW453069 Hs.3657 acfivity-dependent neuroprotective prote % J* 

119265 BE539706 Hs.285363 ESTs J' 4 

119279 N57568 Hs.48028 EST f' 1 

119298 NMJ301241HS.155478 cycfinT2 J* 

45 119338 AM17240 Hs.320836 EST s, Weakly similar to A47582 B-cell growth factor precursor (H.saptens] W 

119349 T65004 Hs.163561 ESTs J-J 

119403 AL1 17554 Hs.1 19908 nucleolar protein NOP57NOP58 J J 

119478 AI624342 Hs.170042 ESTs l A 

119486 AI796730 Hs.55513 ESTs J- J 

50 119513 W37933 Empirically selected from AFFX single probeset J-f 

119601 AK000155 Hs.91684 Homo sapiens mRNA; cONADKFZp667l103 (from done DKFZp667l103) 3.7 

119602 AW675298 Hs.233694 hypothetical protein FU1 1350 3.0 
119676 AA243837 Hs.57787 ESTs \ A 
119682 W61019 Hs.57811 ESTs J-J 

55 119774 AB032977 Hs.6298 KIAA1 151 protein J» 

119780 NM.016625HS.191381 hypothetical protein *J 
119789 BE393948 Hs.50915 kaBlkrein 5 (KLK5; KLK-L2; stratum comeum trypfic enzyme) _ n _ imrcppn 
119805 AJ223810 Hs.43213 ESTs, Weakly similar to lEFSJiUMAN TRANSFORMATION-SENSITIVE PROTEIN IEFSSP 3.6 
119818 AA130970 Hs.58382 hypofoefica) protein FL)1 1101 

60 119863 AA081218 Hs.58608 Homo sapiens cDNA FU 14206 fis, done NT2RP30031 57 2.7 

119905 AW449064 Hs.119571 collagen, type Hi. alpha 1 (Ehlers-Danlos syndrome type IV, autosomal dominant) 2.6 

119966 AA703129 Hs.58963 ESTs « 

120132 W57554 Hs. 12501 9 lymphoid nuclear protein (LAF4)mRNA y 

120206 H26735 Hs.91668 Homo sapiens done PP1498 unknown mRNA 45.7 

65 120248 AI924294 Hs. 1 73259 uncharacterized bone marrow protein BM033 4 12 

120253 AA131376 Hs.326401 fibroblast growth factor 12B ™f 

120269 AW131940 Hs.104030 ESTs 9.6 
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120274 AA177051 gb:nc02aQ2.s1 NCI_CGAP_Pr3 Homo sapiens cDNA clone IMAGE:194 similar to contains Aiu 4.6 

1 20280 AA1 90577 gb:zp52gOZs1 Stratagene HeLa cell s3 93721 6 Homo sapiens cDNA clone 3', mRNA sequence 2.0 

120296 AW99591 1 Hs.299883 ' hypothetical protein FU23399 1 .8 

120297 AA191384 Hs.104072 ESTs, Weakly similar to Z195.HUMAN ZINC FINGER PROTEIN 195 [Rsaplensj 15.2 
5 120324 AA195517 Hs.191643 ESTs 5.5 

120325 AA195651 Hs.104106 ESTs 6.4 

120327 AK000292 Hs.278732 hypothetical protein FU20285 16.1 

120336 N85785 Hs.181165 eukaryotic translation elongation factor 1 alpha 1 2.9 

120342 AW450669 Hs.45068 hypothetical protein DKFZp434l1 43 5.7 

10 120345 AA210722 Hs,104158 ESTs 4.5 

120349 AW969481 Hs.55189 hypothetical protein 16.8 

120352 R06859 Hs.193172 ESTs, Weakly similar to 138022 hypothetical protein [Rsapiens] 5.0 

120356 AF000545 Hs.296433 putative purinerglc receptor ' 28.1 

* 120371 AA219305 Hs.104196 EST 12.4 

15 120382 AA228026 Hs.38774 ESTs 4,0 

120363 AL109963 Hs.123122 FSH primary response (LRPR1, rat) homolog 1 9.7 

120386 AW969665 Hs.154848 hypothetical protein DKFZp434D0127 32.6 

120388 AA232874 Hs.104245 ESTs 3.1 

120389 AW967985 Hs.325572 ESTs, Moderately similar to ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 21.7 
20 120396 AA134006 Hs.79306 eukaryotic translation initiation factor 4E 12.5 

120404 AB023230 Hs.96427 KIAA1013 protein 7.2 

120418 AW966893 Hs.26613 Homo sapiens mRNA; cONA DKFZp586F 1323 (from clone DKFZp586F 1323) 11.4 

120423 AA236453 HS.1B978 Homo sapiens cDNA: FU22822 fis, clone KAIA3968 1.9 

120472 AI950087 gb.-wq05c02.x1 NCLCGAPJ<!d12 Homo sapiens cONA clone 3\ mRNA sequence 19,4 

25 120473 AA251973 Hs.269988 ESTs 5.4 

120484 AA253170 Hs.96473 EST 10.4 

120504 AA256837 gb:zr84d10.s1 Soares_NhHMPu_S1 Homo sapiens cDNA clone IMAGE:682387 3\ mRNA sequence. 3.9 

. 120509 BE047718 Hs.96545 ESTs 9.4 

120520 AA258601 Hs.161731 EST 2,4 

30 120535 BE350244 Hs.96547 ESTs 2.5 

120551 AA279160 Hs.111407 Homo sapiens, done IMAGE:3613029, mRNA, partial cds 5.2 

120570 AA280679 Hs.271445 ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 14.4 

120582 BE244830 Hs.284228 ZNF135-Bke protein 10.2 

120590 AW372799 Hs.125790 leucine-rich repeatontaining 2 2.1 

35 120596 AA282074 Hs.237323 . N^cetylglucosarnine-phosphate mutase 7.5 

120619 AW965339 Hs.111471 ESTs 2.5 

120624 AW407987 Hs.173518 M-phase phosphoprotein homolog 52.0 

120639 AA286942 gb:zs56fl)5.s1 NC1_CGAP_GCB1 Homo sapiens cDNA clone IMAGE:70 1505 3' similar to contains Alu2.4 

120648 AA287095 Hs.140309 Homo sapiens, clone IMAGE:3677194, mRNA, partial cds 5.0 

40 120653 AW063659 Hs.191649 ESTs 2.2 

120668 AW969638 Hs.1 12318 6.2 kd protein 2.2 

120669 BE536739 Hs.109909 ESTs ' 1.9 

120695 AA976503 gb:oq30a04.s1 NCLCGAPJ3C4 Homo sapiens cONA clone 3' similar to contains PTR7.I1 PTR7 46,8 

120696 AI821539 Hs.97249 ESTs 2.5 
45 120713 AW449855 Hs.96557 Homo sapiens cDNAFLM 2727 lis, done NT2RP200Q027 5.9 

120718 AA292747 Hs.97296 ESTs 2.9 

120750 AI191410 Hs.96693 ESTs, Moderately similar to 2109260A B cell growth factor [H.sapiens] 7.0 

120774 AI608909 Hs.193985 ESTs 7.8 

120807 AA346385 Hs.30002 SH3-containkig protein SH3GLB2; KIAA1848 protein 6.8 

50 120809 AA346495 gb:EST52657 Fetal heart II Homo sapiens cDNA 3' end similar to EST containing O family repeat 4.4 

120938 AA386260 Hs.104632 EST 4.4 

120977 AA398155 Hs.97600 ESTs 4.4 

120984 BE262951 Hs.99052 ESTs 5.6 

120985 AI219896 Hs.97592 ESTs 1.2 
55 121011 AA398360 Hs.97608 EST 3.1 

121026 A1439713 Hs.165295 ESTs 3.5 

121081 AA398721 Hs. 186749 ESTs. Highly similar to 137550 mismatch repair protein MSH2[H .sapiens] 5.4 

121133 AA363307 Hs.97032 ESTs 3.7 

121176 AL121523 Hs.97774 ESTs 1.7 

60 121223 AI0021 10 Hs.97169 ESTs, Weakly similar to dJ667H 12. 2.1 {H.sapiens] 2.9 

121320 AA403008 Hs.301927 c6.1A 1.9 

121340 AW956981 Hs.97910 Homo sapiens cDNAFU13383fis, done PLACE1001024 3.5 

121408 AA406137 Hs.98019 EST 6.0 

121439 AA410190 Hs.98076 ESTs, Weakly similar to A47582 B-cell grov/th factor precursor [H.sapiensl 7.4 

65 121450 AA406430 Hs.105362 Homo sapiens, clone MGC: 18257, mRNA, complete cds 6.9 

121452 AW971063 Hs.292882 ESTs 1.8 

121455 H58306 Hs.15165 retinoic add induced 14 10.5 
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W07404 Hs.144502 hypothetical protein FU22055 

AA442224 Hs.9790Q ESTs 

AA494172 Hs.194417 ESTs 

AA402515 Hs.97887 ESTs 

AA416653 Hs.181510 ESTs 



3.4 

14.4 

13.1 

28.0 

6.2 



AA412112 

AA412477 Hs.98142 
AA412497 

AA411970 Hs.98098 
AA416568 

AO001528 Hs.89718 

AA626010 Hs.98247 



gb:zt69b0ls1 Soares_testis_NHT Homo sapiens cDNA clone JMAGE:727563 3\ mRNA sequence. 2.6 
EST 7.4 
gb:zt95g1Zs1 SoaresJestis.NHT Homo sapiens cDNA clone IMAGE:730150 3' similar to contalns2-8 



EST 



gb:zu05c10.s1 Soares_tesiis_NHT Homo sapiens cDNA clone 3\ mRNA sequence 
spermine synthase 
ESTs 

AA416931 Hs.126065 ESTs 

AA421537 Hs.178072 Homo sapiens mRNA; cDNA DKFZp434B1023 (from clone DKFZp434B1023) 

AA418160 Hs.86043 Homo sapiens cONA FU 13558 lis, clone PLACE 1007743 

AV660305 Hs.1 10286 ESTs 

U55184 Hs.154145 hypothetical protein FU11585 

AA419225 Hs.98269 Homo sapiens cDNA FU11953 fis, clone HEMBB1000883 

AI949597 Hs.98325 ESTs 

AA421041 Hs.180744 ESTs 

AA398784 Hs.97514 ESTs 

BE536911 Hs.234545 hypothetical protein NUF2R 

AB033022 Hs.158654 KIAA1 196 protein 

AA421773 Hs.161003 ESTs 

AA292579 Hs.125133 hypothetical protein FU22501 

A1810774 Hs.98376 ESTs 

AW340797 Hs.98434 ESTs 

AA328348 Hs.218289 ESTs 

AA425691 Hs.191606 ESTs, Highly similar to KIAA1 048 protein [H.sapiens] 
AF027406 Hs.104865 serine/threonine kinase 23 
AA446628 Hs.2799 cartilage finking protein 1 
AW972668 Hs.293044 ESTs 
AA426376 Hs.98459 ESTs 

AA427950 gb:zw50f02.s1 Soares_totalJetus^Nb2HF8„9w Homo sapiens cDNA clone IMAGE:773499 3' 

AA428179 Hs.223405 ESTs, Moderately similar to A46010 X-finked retinopathy protein [H.sapiens] 

AA428647 Hs.98611 EST 

AA298760 Hs.180191 hypothetical protein FU 14904 

AI862570 Hs.299214 Homo sapiens, done IMAGE:2822295, mRNA, partial cds 

AA210863 Hs.3532 nemo-like kinase 

AA430211 Hs.98668 EST 

AW292763 Hs.160822 Homo sapiens cDNA: RJ20863 fis, clone ADKA0 1804 
AA431085 Hs.98706 ESTs 

W92142 Hs.271963 ESTs. Weakly similar to ALU5_HUMAN ALU SUBFAMILY SC SEQUENCE CONTAMINATION 
AI453076 Hs.166109 ELAV (embryonic lethal, abnormal vision, Drosophila)-like 2 
AA431738 Hs.98750 EST 
AW161023 Hs.104921 ESTs 

AA398838 gb:ztB0d01 s\ Soares_testis_NHT Homo sapiens cDNA clone 5\ mRNA sequence 

AA435936 Hs.98842 EST 

AA329550 Hs.29417 HCF-binding transcription factor Zhangfei 
AA436819 Hs.98899 ESTs 
AA441801 Hs.104947 ESTs 

AW601969 Hs.99010 hypoftellcal protein FU22263 suwlar to nudear receptor^hding SET-domain protdn 1 . 
AA443794 Hs.98390 ESTs 
AA443985 Hs.303222 ESTs 
AA868555 Hs.178222 ESTs 
AA446008 Hs.336677 EST 

AB032948 Hs.21356 hypolheticaJ protein DKF2p762K201 5 
AA446572 Hs.303223 EST 
AA446869 Hs.1 19316 ESTs 
AA446918 Hs.99088 EST 

AA446966 Hs.99090 ESTs, Moderately similar to similar to KIAA0766[H .sapiens] 

AW505139 Hs.9460 Homo sapiens mRNA; cDNA OKFZp547C244 (from clone DKFZp547C244) 

AA447603 Hs.99123 EST 

AA447626 Hs.99127 EST 

AI266159 Hs.104980 ESTs 

AW418788 Hs.99148 ESTs, Weakly similar to S43569 R01H10.6 protein - Caenomabditis elegans [Celegans] 
AA448158 Hs.99152 EST 
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I. 7 
6.6 
10.5 
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5.0 
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2.3 
2.9 
5.0 
72 
2.5 
2.3 
3.4 

II. 4 
3.8 
6.4 
2.2 
6.5 
13.1 
9.1 
13.1 
1.5 
3.3 
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2.0 
7.3 
12.2 
5.0 
7.6 
2.5 
2.8 
7.3 
1.9 
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1.8 
3.5 
1.5 
9.7 
4.8 
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M448349 Hs.238151 EST 
AA446417 Hs.104990 ESTs 

AA204969 Hs.234863 Homo sapiens cDNA FU12082 fis, clone HEMBB1Q02492 
AA449232 Hs.99195 ESTs 

AW959741 Hs.40368 adaptor-related protein complex 1, sigma 2 subunit 
AA779725 Hs.164589 ESTs 
AA194055 Hs.293858 ESTs 
AA452578 Hs.262907 ESTs 
AA452601 Hs.99287 EST 

Homo sapiens cDNA FU1 1048 fis, done PLACE10Q4516 
KIAA146Q protein 
ESTs 

hypothetical protein FU23588 



AK001910 Hs.99303 
AB040893 Hs.6968 
AI028173 Hs.99329 
AL355841 Hs.99330 
AA411925 Hs.301960 ESTs 
AA453518 Hs.98023 ESTs 
M453630 Hs.99339 EST 
AA453638 Hs.161873 ESTs 
AI681535 Hs.148135 serine/lhreonine kinase 33 

AA453641 gb:zx48e06.s1 Soares JestisJJHT Homo sapiens cONA clone 3\ mRNA sequence 

AA453987 Hs.144802 ESTs 
AA456859 Hs.178358 ESTs 
AI376875 Hs.105119 ESTs 
AW204530 Hs.99500 ESTs 

AA461492 Hs.99545 Homo sapiens cDNAFU10658fc, clone NT2RP2006052 
AA460581 Hs.290996 ESTs 

AA461509 Hs.293565 ESTs, WeaWy similar to putative p1 50 [H.sapiensJ 
AA460584 Hs.334386 ESTs 

AA600235 Hs.9625 NIMA (never in mitosis gene a)-related kinase 6 
AI929374 Hs.75367 Src-like-adapter 
AA335721 Hs.1 19394 ESTs 
BE539656 Hs.283705 ESTs 

AF005216 Hs.1 15541 Janus kinase 2 (a protein tyrosine kinase) 
AW576312 Hs.318722 Homo sapiens cDNA: FLJ21766 fis, clone COLF7 179 
AW061394 Hs.97103 ESTs 
AA769410 Hs.128654 ESTs 
AA470074 Hs.169896 ESTs 
AA470140 Hs.229170 EST 
AA478951 Hs.105629 ESTs 
AW968324 Hs.17384 ESTs 

AW338067 Hs.323231 Homo sapiens cONAFUI.1946 fis. done HEMBB1000709 
AL359571 Hs.44054 ninein (GSK3B Interacting protein) 
AI382600' Hs.104308 ESTs, Weakly similar to KIAA1395 protein [H.sapiens] 
AA485360 Hs.105661 ESTs 
AI343652 Hs.105667 ESTs 
AA486256 Hs.193510 EST 
BE304942 Hs.265848 myomegalin 

T52027 Hs .271795 ESTs, Weakly similar to 138022 hypothetical protein [H.sapiensj 
AI061582 Hs.324179 Homo sapiens cDNA FU12371 fis, done MAMMA1002434 
AW451999 Hs.194024 ESTs 
AI734179 HS.105676 ESTs 
AW601773 Hs.270259 ESTs 

AA490929 Hs.105274 ESTs, Weakly similar to RMS1 .HUMAN REGULATOR OF MITOTIC SPINDLE ASSEMBLY 1 [H.sapiens] 
AA496369 gb:zv37d10.s1 Soares ovary tumor NbHOT Homo sapiens cDNA done IMAGE:755827 3* similar to4.1 

AA504757 Hs. 105738 ESTs 6.9 
AA731404 Hs.105510 ESTs 3.6 
AW450922 Hs.1 12478 ESTs 3.7 
AA599042 Hs.112503 EST 7.4 
AW303285 Hs.303632 Human DNA sequence from done RP11-110H4 on chromosome 5 Contains a pseudogene similar to 3.5 
AB021644 Hs.197219 zinc finger protein 14 (KOX 6) 5.2 
BE439553 Hs.250528 Homo sapiens, done IMAGE:4098694, mRNA, partial cds 1.7 
N95059 Hs.55098 ESTs 1.6 
BE019072 Hs.334802 Homo sapiens cDNA FU14680 fis. done NT2RP2004242, weakly similar to 2.4 
AW380388 Hs.155546 K1AA1080 protein; Golgi-assodated, gamma-adaptin ear containing, ARF-binding protein 2 2.2 
AA609170 gbaf12a12.s1 Soares_testis__NHT Homo sapiens cDNA done 3', mRNA sequence 7.8 

AA602964 gb:no97c02.s1 NCI_CGAP_Pr2 Homo sapiens cDNA done, mRNA sequence 18 

AA609364 — - . 
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II. 0 
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I. 3 
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II. 5 
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2.4 
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9.3 



1.7 



A1269609 Hs.105187 k'mesin protein 9 gene 
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123735 NMJJ13241HS.95231 FH1/FH2 domain-containing protein 10.0 

123738 AA609891 Hs.112777 EST 5.2 

123753 AA609955 Hs.234961 Huntingtin interacting protein E 30.6 

123804 AA620464 Hs.261915 EST, Weakly similar to S65557 a!pha-1C-adrenergic receptor splice form 2 [H.sapiensj 2.1 
5 12381 1 AA620586 gb:ae60g05.s1 Stratagene rung carcinoma 937218 Homo sapiens cDNA clone IMAGE:951320 3' Z7 

123951 AB012922 Hs.173043 metastasis-associated 1-Eke 1 6.2 

123983 AJ272267 Hs.146178 choline dehydrogenase 4.4 

124001 L42542 Hs.75447 raiA binding protein 1 7.0 

124006 AI147155 Hs.270016 ESTs 8.1 

10 124070 AI950314 Hs.1 54762 HIV-1 rev binding protein 2 3.7 

124074 H05635 Hs.294030 topoisomerase-related function protein 4-2 1.2 

124178 BE463721 Hs.97101 putative G protein-coupled receptor 3.1 

124203 AA372796 Hs.269339 ESTs, Weakly similar to AF161356 1 HSPC093 [Rsapiens] 5.7 

124352 AA640891 Hs.102406 ESTs 3-1 

15 124375 D87454 Hs.192966 KIAA0265 protein 33 

124385 AI267847 gb:aq49a10J(1 Stanley Frontal NB pool 2 Homo sapiens cDNA clone similar to contains 57.1 

124390 AA317338 Hs.7535 COBW-Bke protein 2.8 

124391 AF155099 Hs.279780 NY-REN-18 antigen 7.1 
124417 N34059 gb:yv28h09.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone 3* similar to contains Alu 3.3 

20 124428 H13540 Hs.82202 ribosomal prote^ L17 2.9 

124440 AA532519 Hs.129043 Human ONA sequence from clone 989H11 on chromosome 22q13.1-13.2. Contains part of a 7,8 

124466 R10084 Hs.1 13319 Hnesin heavy chain member 2 2.6 
124482 N53935 gb:yv59d09.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done 3', mRNA sequence 7.9 

124498 H79433 Hs.268997 ESTs 7.8 

25 124515 AA669097 Hs.109370 ESTs 3.3 

124608 N71076 Hs.102800 ESTs, Weakly similar to neuronal thread protein AD7C-NTP [H.sapiens] 4.5 

124631 NMJ)14053Hs.270594 FLVCR protein 3.2 

124634 A/765123 Hs. 143671 Homo sapiens cDNA FU13533 fis, done PLACE 1006371 5.8 

124637 AA160474 Hs.75798 hypothetical protein 9.3 

30 124642 AW968856 Hs.278569 sorting nexin 17 3.5 

124649 N92593 Hs.313054 ESTs 6.1 

124656 AW297702 Hs.102915 ESTs 8 3 

124661 R48170 Hs.78436 EphB1 5.6 

124683 AA381661 Hs.119878 ESTs, Weakly similar to M3K9_HUMAN MFTOGEN-ACTIVATED PROTEIN KINASE KINASE 7.9 

35 124712 R09166 Hs.191148 ESTs 5.7 

124735 R22952 Hs.268685 ESTs 11-3 

124761 AA374756 Hs.93560 Homo sapiens mRNA for KIAA1771 protein, partial cds 9.0 

124768 AW368528 Hs.100855 ESTs 81 

124775 R41772 Hs.100878 ESTs 4.9 

40 124777 R41933 Hs.140237 ESTs, Weakly similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE 2.8 

124788 R43543 Hs.100912 Homo sapiens cDNA: FU22726 fis. done HS1 15005 5.1 

124809 AL355722 Hs.106875 Homo sapiens EST from done 35214, full insert 4.2 

124811 R46068 Hs.28891 2 hypothetical protein FU22604 14.2 

124812 R47948 Hs.188732 ESTs 7.9 
45 124822 AA418160 Hs.86043 Homo sapiens cDNAFU! 3558 fis. done PLACE 1007743 6.6 

124825 AA501669 Hs.336693 ESTs 2.3 

124833 AW975868 Hs.294100 ESTs 2.7 

124857 R63652 Hs.137190 ESTs 2.3 

124860 R65763 Hs.101477 EST 23.9 

50 124863 AI382555 Hs. 1 27950 bromodomain-containing 1 2.0 

124876 AF135422 Hs.27059 GOP-mannosepyrophosphorylaseA 4.4 

124878 BE397530 Hs.288057 hypothetical protein FU22242 2.7 

124902 H37941 Hs.101883 ESTs 5.7 

124903 AW296713 Hs.221441 ESTs 32.4 
55 124930 AI076343 Hs.173939 ESTs, Weakly similar to ALUB_HUMAN Ull ALU CLASS B WARNING ENTRY HI [H.sapiens] 22.8 

124942 R99978 Hs.268892 ESTs, Moderately similar to B34087 hypothetical protein [H.sapiens] 6.1 

124958 A1078645 Hs.431 murine leukemia viral (bmi-1) oncogene homolog 1.9 

124980 T40841 Hs.98681 ESTs 4.5 

125002 T59338 Hs.269463 ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 4.9 

60 125047 T79815 Hs.279793 ESTs 5.0 

125051 T79956 Hs.100588 EST 135.3 

125056 T81310 Hs.100592 ESTs 5.4 

125101 A1472068 Hs.286236 KJAA1856 protein 5.6 

125113 T96595 Hs.302270 ESTs, Weakly similar to ALUFJiUMAN HI! ALU CLASS F WARNING ENTRY II! [H.sapiens] 1.8 

65 1251 15 T97341 gb:ye57e05.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done IMAGE:121856 3" simllarto 9.6 

125125 AI222382 Hs.240767 Hiirr^DrMser^eru»fromdoneRP1-12G14oncto^ 1.5 

125147 W38150 Empiricafly selected from AFFX single probeset 1.7 
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125161 W44657 Hs.144232 EST 10.7 
125249 AA630863 Hs.131375 ESTs, Moderately similar to ALUBJiUMAN Hi! ALU CLASS B WARNING ENTRY 111 [H.sapiens] 1.3 

125255 AF098162 Hs.118631 timeless (Drosophila) homolog 9.4 

125279 AW401809 Hs.4779 KIAA1 150 protein 1.5 

5 125280 AI123705 Hs.106932 ESTs 8.0 

125298 AW972542 Hs.289008 Homo sapiens cONA: FU21814 fis, clone HEP01068 1,5 

125660 AW292171 Hs.23978 scaffold attachment factor 5 5.9 

125827 NMJ503403HS.97496 YY1 transcription factor 1.2 

125891 U29589 Hs.7138 cholinergic receptor, muscarinic 3 6.4 

10 126005 AW409701 Hs.1578 baculovirai 1AP repeat-containing 5 (survivin) 14.3 

126202 AA1 57632 Hs.272630 vacuolar proton pump delta polypeptide 2.4 

126695 AA643322 Hs.172028 a disintegrin and metalloproteinase domain 10 9.1 

127050 AW411066 Hs.274351 CGI-89 protein 17.0 

127274 AW966158 Hs.58582 Homo sapiens cONAFU12789fis. done NT2RP2001 947 12.8 

15 128355 AW293012 Hs.161623 ESTs 7.3 

128493 D87466 Hs.240112 K1AA0276 protein 3.1 

128493 D87466 Hs.240112 KIAA0276 protein - 1.3 

128522 BE173977 Hs.10098 putative nucleolar RNA heOcase 9.4 

128527 AA504583 Hs.1 01047 transcription factor 3 (E2A immunoglobulin enhancer binding factors E12/E47) 1.5 

20 128528 R39234 Hs.251699 ESTs, Weakly similar to ION4-GGTR14 (H.sapiens] 2,8 

128595 U31875 Hs.272499 short-chain alcohol dehydrogenase family member 12.1 

128599 NM_015366Hs.102336 Rho GTPase activating protein 8 2.3 

128604 AI879099 Hs.102397 GIOT^ for gonadotropin Indudble transcripabn repressor.3 . 1.3 

128608 BE267994 Hs.102419 zinc finger protein 7.1 

25 128625 AB037841 Hs.102652 hypothetical protein ASH1 1.3 

128629 AL096748 Hs.102708 DKFZP434A043 protein 3.2 

128639 AW582962 Hs.102B97 CGI-47 protein 2.0 

128656 AA458542 Hs.10326 coatomer protein complex, subunitepsilon 1.4 

128656 AA458542 Hs.10326 coatomer protein complex, subunit epsilo 1.3 

30 128658 BE397354 Hs.324830 diptheria toxin resistance protein required for diphthamide biosynthesis (Saccharomyces)-like 2 2.4 
128670 AA975486 Hs.103441 Homo sapiens, Similar to RIKEN cONA 1700010L19 gene, clone MGC:16214, mRNA, complete cds 7.1 

128691 W27939 Hs.103834 hypothetical protein MGC5576 7.7 

128696 BE081143 Hs.225977 nuclear receptor coactivator 3 3.8 

128700 Y15221 Hs.103982 small Inducible cytokine subfamily B (Cys-X-Cys), member 1 1 1.6 

35 128714 T85231 Hs.179661 tubulin, beta 5 7 A 

128717 AK001564 Hs.104222 hypothetical protein FU 10702 5.5 

128733 BE147740 Hs.104558 ESTs, Moderately similar to I38022 hypothetical protein [H.sapiens] 2.7 

128737 AF292100 Hs.104613 RP42 homolog 2.8 

128742 AA307211 Hs.251531 proteasome (prosome, macropaln) subunit, alpha type, 4 4,4 

40 128746 AI470163 Hs.323342 actin related protein 2/3 complex, subunit 4 (20 kD) 2.2 

128747 AB027249 Hs.104741 PDZ-binding kinase; T-cell originated protein kinase 2.8 

128772 BE302796 Hs.105097 thymidine kinase 1, soluble 5.3 

128781 N71826 Hs.105465 small nuclear ribonudeoprotein polypeptide F 53.9 

128797 NMJQ2975HS.105927 stem cell growth factor; lymphocyte seaetedC-type lectin 13.3 

45 128805 AW630942 Hs.106061 RD RNA-blnding protein 2.6 

128814 AW248431 Hs.256526 nuclear prelamin A recognition factor 2.2 

128830 BE281170 Hs.106357 valosin-oontaining protein 5.9 

128835 AK001731 Hs.106390 Homo sapiens mRNA; cDNA DKFZp586H0924 (from clone DKFZp586H0924) 1.6 

128854 BE159181 Hs.168232 hypothetical protein FU 13855 2.2 

50 128854 BE159181 Hs.168232 hypothetical protein aj 13855 1.9 

128868 AA4 19008 Hs. 1 06730 chromosome 22 open reading frame 3 3.0 

128868 AA4 19008 Hs. 106730 chromosome 22 open reading frame 3 22 

128871 AF189723 Hs.1 06778 ATPase,Ca++ transporting, type 2C, member 1 1.5 

128891 F34856 Hs.292457 Homo sapiens, clone MGC:16362, mRNA, complete cds 13.3 

55 128906 R57988 Hs.10706 epithelial protein lost in neoplasm beta 4.7 

128920 AA622037 Hs.166468 programmed cell death 5 14 

128925 R67419 Hs.21851 Homo sapiens cONA FU12900 fis, done NT2RP2004321 1.9 

128946 Y13153 Hs.1 0731 8 kynurenine 3*monooxygenase (kynurenine 3-hydroxylase) 7.2 

128949 AA009647 Hs.8850 a disintegrin and metalloproteinase domain 12 (meltrin alpha) (ADAM-12) 2.4 

60 128958 AW139032 Hs.107376 hypothetical protein DKFZp434N035 1.3 

128959 AI580127 Hs.107381 hypothetical protein RJ 11 200 10.9 

128965 AW150697 Hs.107418 ESTs 1.4 

128970 AI375672 Hs.165028 ESTs 1.3 

128975 BE560779 Hs.284233 NICE-5 protein 14.0 

65 128979 AW271217 Hs.281434 Homo sapiens cDNA FU14028 fis, done HEMBA1003838 16 

128995 A1816224 Hs.107747 OKFZP566C243 protein 19 

129019 A1950087 gb:wq05cG2.x1 NCLCGAP Kid12 Homo sapiens cDNA done 3'. mRNA sequence 2.9 
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129021 AL044675 Hs.173081 KIAA0530 protein 3.8 

129021 AL044675 Hs.173081 KIAA0530 protein 2.5 

129032 R80088 Hs.108104 ubiquitin-conjugating enzyme E2L 3 3.4 

129076 AW296806 Hs.326234 ESTs. Highly similar to T46422 hypothetical protein DKFZp434M2023.1 [H.sapiens] 5.0 

5 129078 AI351010 Hs.102267 lysosomal 2.1 

129088 AA744610 Hs.194431 paBadin 17.1 

129095 112350 Hs.108623 thrombospondin 2 2.7 

129096 AA463189 Hs.288906 WW Domain-Containing Gene 20.9 

129097 BE243933 Hs.108642 zinc finger protein 22 (KOX 15) 3.0 
10 129099 AF146074 Hs.108660 ATP-binding cassette, sub-family C(CFTR/MRP), member 5 5.8 

129135 W93048 Hs,250723 hypothetical protein MGC2747 55 

129149 AA356620 Hs.108947 K1AA0050 gene product 6.3 

129172 AW162916 Hs.241576 hypothetical protein PR02577 1.8 

129192 AA286914 Hs.183299 ESTs 2.1 

15 129194 AA150797 Hs.109276 latexin protein 3.2 

129198 N57532 Hs.109315 KIAA1415 protein 5.8 

129207 AI934365 Hs.109439 osteoglycin (osteoinductive factor, mimecan) 8.0 

129228 U40714 Hs.239307 tyrosyl-tRNA synthetase 2.9 

129229 AF013758 Hs.109643 pdyadenylate binding protein-interacting protein 1 3.2 
20 129254 AA252468 Hs.1098 DKFZp434J 1813 protein 2.6 

129255 A1961727 Hs.109804 H1 histone family, member X 7.3 

129288 W26392 Hs.110080 ESTs, Weakly similar to S13495 pregnancy zone protein (H.sapiens) 9,6 

129296 AI051967 Hs.1 10122 ESTs 1.2 

129323 AA287239 Hs.5518 Homo sapiens cDNAFU 11 31 1 fe, done PLACE 101 01 02 5.1 

25 129340 H75334 Hs.11050 F-box only protein 9 4.6 

129347 BE614192 Hs.279869 melanoma-associated antigen recognised by cytotoxic T lymphocytes 7.6 

129362 U30246 Hs.1 10736 solute carrier family 12 (sodium/potassium/chloride transporters), member 2 6.7 

129366 BE220806 Hs.1 84697 Homo sapiens clone 23785 mRNA sequence 8.6 

129370 A1686379 Hs.1 10796 SAR1 protein 14 

30 129372 NM_016039Hs.1 10803 CGI-99 protein 2.0 

129403 AF149785 Hs.1 11126 pituitary tumor-transforming 1 interacting protein 7.4 

129404 AI267700 Hs.317584 ESTs 5.0 
129404 AI267700 Hs.317584 ESTs 2.5 
129423 AA204686 Hs.234149 hypothetical protein FU20647 10.2 

35 129449 AI096988 Hs.111554 ADP-ribosylation factor-tike 7 8.0 

129453 AW974265 Hs.1 11632 Lsm3 protein 3,2 

129482 AA188185 Hs.289043 spindOn 6.7 

129482 M188185 Hs.289043 spindfin 3.6 

129513 AW843633 Hs.306163 hypothetical protein AM 101 1 5 7.1 

40 129515 AF255303 Hs.1 12227 membrane-associated nudeic acid binding protein 2,5 

129527 AA769221 Hs.270847 delta-tubuiin ' 3.2 

129559 W01296 Hs.1 1360 hypothetical protein FU 14784 7.5 

129560 AA317841 Hs.7845 . hypothetical protein MGC2752 6.8 
129570 AI923097 Hs.11441 chromosome 1 open reading frame 8 2.0 

45 129575 F08282 Hs.278428 progestin induced protein 1.6 

129587 H14718 Hs.1 1506 Human done 23589 mRNA sequence 6.8 

129588 BE408300 Hs.301862 postmeiotic segregation increased 2-tike 9 1.4 
129591 N57423 Hs.179898 HSPC055 protein 7.3 
1 29594 AW403724 Hs.36989 coagulation factor VII (serum prothrombin conversion accelerator) 9.0 

50' 129596 AF035537 Hs.115521 REV3 (yeast homoiog)-nke, catalytic subunit of DNA polymerase zeta 1.6 

129628 U38945 Hs.1174 cycDn-dependent kinase inhibitor 2A (melanoma, p1 6, inhibits COK4) 2.2 

129628 U38945 Hs.1174 cydin^ependent kinase inhibitor 2A (me 1.4 

129629 AKQ00398 Hs.11747 hypothetical protein FU20391 3.8 
129649 AD000092 Hs.16488 calreficufin 3.3 

55 129675 NIVL015556HS.172180 KIAA0440 protein 13.4 

129680 U03749 gb:Human chromogranin A (CHGA) gene, promoter an 14.1 

129689 AW748482 Hs.77873 B7homotog3 2.6 

129702 A1304966 Hs.12035 ESTs, Weakly similar to I38022 hypothetical protein [H.sapiens] 7.4 

129720 AA156214 Hs.12152 APMCF1 protein ' 2.0 

60 129721 NMJ)01415Hs.211539 eukaryofic tola^ initiafion factor 2, subunit 3 (gamma, 52x0) 1.7 

129726 H15474 Hs.132898 fatty add desatorase 1 8.3 

129778 AK001676 Hs.12457 hypothetical protein FU 10814 1.8 

129779 AA394090 Hs.12460 Homo sapiens done 23870 mRNA sequence 5.4 
129800 AF052112 Hs.12540 lysosomal 1.7 

65 129806 AB023148 Hs.173373 K1AA0931 protein 1.2 

129815 BE565817 Hs.26498 hypothetical protein FU21 657 3.1 

129840 NM_006590Hs.12820 SnRNP assembly defective 1 homolog 1.8 
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AL049999 Hs.85963 OKFZP564M1 82 protein 

AI393237 Hs.129914 runt-related transcription factor 1 (acute myeloid leukemia 1; aml1 oncogene) 

AI222069 Hs.13015 hypothetical protein similar to mouse DnaJH 

AF042379 Hs. 13386 gamma-tubulin complex protein 2 

BE514376 Hs.165998 PAM mRNA-bindlng protein 

AA412195 Hs.13740 ESTs 

AW753185 Hs.180628 dynamin Mike 

U09848 Hs.132390 zinc finger protein 36 (KOX 18) 

AB015856 Hs.247433 activating transcription factor 6 

AA301116 Hs.142838 nucleolar phosphoprotein Nopp34 

AA287325 Hs.14713 ESTs 

S73265 Hs.1473 gastrin-releasing peptide 

AL046962 Hs.14845 forkhead box 03A 

AL135561 Hs.14891 hypothetical protein FU21047 

X53002 Hs.149846 integrin, beta 5 

AA916785 Hs. 180610 splicing factor proline/glutamlne rich (polypyrimidine tract-binding protein-associated) 

AA916785 Hs.1 80610 splicing factor proline^lutamine rich ( 

L76937 Hs.150477 Werner syndrome 

AA311426 Hs.21635 tubulin, gamma 1 

NM_003358Hs.23703 ESTs, Moderately similar to CEGTJWMAN CERAMIDE GLUCOSYLTRANSFERASE 

D80001 Hs.152629 KIAA01 79 protein 

R85367 Hs.51957 splicing factor, arginine/serine-fich 2, interacting protein 

AL035588 Hs.153203 MyoD family inhibitor 

X79201 Hs.153221 synovial sarcoma, translocated to X chromosome 

D81983 Hs.322852 GAS2-related on chromosome 22 
NM_002497Hs.153704 NIMA (never in mitosis gene a)-related kinase 2 

AA479005 Hs. 154036 tumor suppressing subtransferable candidate 3 

AB01 1 121 Hs.1 54248 amyotrophic lateral sclerosis 2 (Juvenile) chromosome region, candidate 3 

Z19084 Hs.172210 MUF1 protein 

AF127577 Hs. 155017 nuclear receptor Interacting protein 1 

AJ224442 Hs.155020 putative methyltransferase 
NM_01 3449HS.277401 bromodomain adjacent to rinc finger domain, 2A 

AL135301 Hs.8768 hypothetical protein FU 10849 

AI077464 Hs.5011 RNA binding motif protein 9 

N89487 Hs,155291 K1AA0005 gene product 

AW374106 Hs.155356 hypothetical protein MGC2840 similar to a putative glucosyltransferase 

BE385099 Hs.334727 hypothetical protein MGC3017 

NMJHJ1 197Hs. 155419 BCL2-interacting killer (apoptosis-inducing) 

AF037448 Hs.155489 NS1 -associated protein 1 

U63630 Hs.155637 protein kinase, DNA-acuVated, catalytic polypeptide 

BE513202 Hs.15589 PPAR binding protein 

D90041 Hs.1 55956 N-acetyltransf erase 1 (arylamine N-acetyltransferase) 

D90041 Hs.1 55956 N-acetyltransferase 1 (arylamine N-acety 

AL121438 Hs.183706 adducin 1 (alpha) 

BE245851 Hs.180779 H2B histone family, member B 

U49844 Hs.77613 ataxia telangiectasia and Rad3 related 

L38951 Hs.180446 karyopherln (importin) beta 1 

BE208491 Hs.295112 KIAA0618 gene product 

L32137 Hs.1 584 cartilage oligomeric matrix protein (pseudoachondroplasia, epiphyseal dysplasia 1 , multiple) 

L32137 Hs.1 584 cartilage oligomeric matrix protein (pse 

AW876523 Hs.15929 hypothetical protein FU12910 

U64675 Hs.179825 RAN binding protein 2-like 1 

AA321238 Hs.4310 eukaryotic translation Initiation factor 1 A 

AF062649 Hs.252587 pituitary tumor-transforming 1 

AI907018 Hs.15977 Empirically selected from AFFX single probeset 

AA383092 Hs.1608 replication protein A3 (14kD) 

AA232119 Hs.16085 putative G-protein coupled receptor 

AF083208 Hs.1 6178 apoptosis antagonizing transcription factor 

AB007891 Hs.16349 KIAA04 31 protein 

AL042210 Hs.16493 hypothetical protein DKFZp762N2316; KIM1803 protein 

AA609738 Hs.16525 ESTs 

A1354355 Hs.1 6697 down-regulator of transcription 1, TBP-binding (negative cofactor 2) 

M90516 Hs.1674 glutamlne-mjctose-6-phosphate transaminase 1 

M9D516 Hs.1674 glutamine-fructose-6-phosphatetransamin 

AA383439 Hs.16758 Splr-1 protein 

6E246961 Hs.1 7639 Homo sapiens ubiquitin protein ligase (UBE3B) rnRNA, partial cds 

AL048842 Hs,194019 attractin 
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130675 AA442233 Hs.17731 hypothetical protein FU1 2892 54 

130692 AA652501 Hs.13561 hypothetical protein MGC4692 5 0 

130693 R68537 Hs.17962 ESTs 20 
130712 AJ271881 Hs.279762 bromodomain-containing 7 1*8 

5 130714 A1348274 Hs.18212 DNA segment on chromosome X (unique) 9879 expressed sequence 2 0 

130730 AB007920 Hs.18586 K1AA0451 gene product 37 
130744 H5969S HsA8747 POP7 (processing of precursor. S. cerevisiae) homofog 11 
130751 AF052105 Hs.18879 chromosome 12 open reading frame 14 
130757 AUJ36067 Hs.18925 protein x 0001 57 

10 130768 AF258627 Hs.211562 ATP-binding cassette, sub-family A (ABC1), member 1 51 
130789 AKDG0355 Hs.8899 sirtuin (silent mating type InformaOon regulation 2, Sxerevisiae, homolog) 5 5^ 
130815 AB018298 Hs.19822 SEC24 (S. cerevisiae) related gene family, member O 15 
130836 J05068 Hs.2012 transcobalamln I (vitamin B12 binding protein, R binder family) 157 

130841 AL157468 Hs.325825 Homo sapiens cDNA FU2084B fis, done ADKA0 1732 28 

15 130843 AA447492 Hs.20183 ESTs, Weakly similar to AF1 64793 1 protein x 013 [H. sapiens] 15 
130844 U76248 Hs.20191 seven in absentia (Drosophila) homolog 2 34 
130855 AJ243706 Hs.143323 putative DMA/chromatin binding motif 17 
130861 NM_016578Hs.20509 H6V pX associated protein-8 1*9 
130879 NM_003416Hs.2076 zincfingerproteln7(KOX4,doneHF.16) 14 

20 130880 BE514434 Hs.20830 (dnesirv^ke 2 21 

130892 AL120837 Hs.20993 high -glucose-regulated protein 8 24 
130898 AB033078 Hs.186613 sphingosine-1 -phosphate lyase 1 17 

130911 BE409769 Hs.21189 DnaJ (Hsp40) homolog, subfamily A, member 2 18 

130919 N79110 Hs.21276 collagen, type IV. alpha 3 (Goodpasture antigen) binding protein Z3 

2,5 130944 BE382657 Hs.21486 signal transducer and activator of transcription 1,91 kD 54 

130971 N39842 Hs.301444 KIAA1673 2 2 

130992 BE398091 Hs.74316 desmoplakin (DP), DPI)) 18 

130993 T97401 Hs.21929 ESTs j'g 
131005 AV658308 Hs.2210 thyroid hormone receptor interactor 3 16 

30 131028 AI879165 Hs.2227 CCAAT/enhancer binding protein (C/EBP), gamma 12 

131042 A1826288 Hs. 171637 hypothetical protein MGC2628 1 6 

131046 AA321649 Hs.2248 sman inducible cytokine subfamily B (Cys-X-Cys), member 10 7.4 

131046 AA321649 Hs.2248 small inducible cytokine subfamily B(Cy 3 0 

131047 H23230 Hs.22481 ESTs. Moderately similar to A46010 X-linked retinopathy protein JRsapiens] 1.7 
35 131060 AA194422 Hs.22564 myosin VI 51 

131060 AA194422 Hs.22564 myosin VI 2*5 

131070 N53344 Hs.22607 ESTs y\\ 

131076 AA749230 Hs.26433 dolichyl-phosphate (UDP^-acetylgiucosamine) N-acetylglucosaminephosphotransferase 1 ( 2.0 

131076 AA749230 Hs.26433 dolichyl-phosphate (UDP-N-acetylglucosam 19 

40 131099 AL133353 Hs.226581 COX1 5 (yeast) homolog. cytochrome c oxidase assembly protein 70 

131174 NMJ)06540Hs.29131 nuclear receptor coactivator 2 19 

131185 BE280O74 Hs.23960 cydinBI 58 

131206 AW138839 Hs.24210 ESTs 2 0 

131213 AA885699 Hs.24332 CG1-26 protein 7 0 

45 131225 H62087 Hs.31659 thyroid hormone receptor-associated protein, 95-kD subunit 75 

131231 N47468 Hs.59757 zinc finger protein 281 29 

131233 D89053 Hs.268012 fatty-acid-Coenzyme A ligase, long-chain 3 3 5 

131243 AW383256 Hs.24752 spectrin SH3 domain binding protein 1 2 8 

131245 AL080080 Hs.24766 thbredoxin domairnxxitaining 2 8 

50 131247 AL043100 Hs.326190 fatty acid amkte hydrolase 56 

131281 AA251716 Hs.25227 ESTs 5 7 

131283 X80038 Hs.339713 Homo sapiens done F19374 APO E-C2 gene cluster 13 

131305 AV656017 Hs.184325 CGl-76 protein 5 0 

131320 AA505691 Hs.145696 spficing factor (CCU) {& 

55 . 131339 AF058695 Hs.25812 Nijmeg en breakage syndrome 1 (nibrin) 26 

131339 AF058696 HS25812 Nijmegen breakage syndrome 1 (nibrfn) 26 

131375 AW293165 Hs.143134 ESTs 54 

131390 BE269388 Hs.182698 mitochondrial ribosomal protein U0 53 

131410 BE259110 Hs.279836 HSPC166 protein 2 2 

60 131412 NM_012247Hs.1 24027 SELENOPHOSPHATE SYNTHETASE ; Human selenium donor protein 20 

131429 AL046302 Hs.26750 hypothetical protein FU21 908 14 

131458 BE297567 Hs.27047 hypothetical protein FU20392 1*7 

131475 AA992841 Hs.27263 K1AA1458 protein 2 0 

131501 AV661958 Hs.8207 GK001 protein 26 

65 131501 AV661958 Hs.8207 GKflOl protein 16 

131511 AA732153 Hs.27865 Homo sapiens cDNA: FU21333 fis, ctone COL02535 2 0 

131528 AU076408 Hs.28309 UOP-glucose dehydrogenase 1.6 
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BE268278 Hs.28393 
AW966881 Hs.41639 
AL355715 Hs.28555 
NMJW3512HS.28777 
T93500 Hs.28792 
T93500 Hs.28792 
AL389951 Hs.271623 
BE393822 Hs.29645 
R78195 Hs.29692 
AB037791 Hs.29716 
AB037791 Hs.29716 
AW410601 Hs.30026 
AW960597 Hs.30164 
AI218918 Hs.30209 
X52486 Hs.3041 
BE559681 Hs.3Q736 
AA642831 Hs.31016 
013757 Hs.311 
AK001641 Hs.31323 
X76732 Hs.3164 
X76732 Hs.3164 
AI878932 Hs.317 
AA382590 Hs.170980 
BE267158 Hs.169474 
D87077 Hs.196275 
AW966127 Hs.32246 
BE501849 Hs.32317 
X86098 Hs.301449 
U20536 Hs.3280 
U28838 Hs.32935 
AI251317 Hs.33184 
AA083764 Hs.6101 
BE502341 Hs.3402 
BE502341 Hs.3402 
W17084 Hs.332848 
AA099014 Hs.231029 
AA099014 Hs.231029 
AF078866 Hs.28429$ 
AA179298 Hs.3439 
AW207440 Hs.185973 
AA025976 Hs.34569 
AF151048 Hs.183180 
BE541211 Hs.34804 
BE252983 Hs.35086 
AA355113 Hs.35380 
AK000046 Hs.267448 
W79283 Hs.35962 
BE567100 Hs.154938 
U90441 Hs.3622 
AA503020 Hs.36563 
AF053306 Hs.36708 
H56995 Hs.37372 
AF193844 Hs.3758 
BE266155 Hs.3832 
NM_002267Hs.3886 
BE171921 Hs.3991 
AV646076 Hs.39959 
AW960474 Hs.40289 
AA857025 Hs.8878 
NM_00446QHs.418 
AA206153 Hs.4209 
R42432 Hs.4212 
NM.004782HS.194714 
BE206939 Hs.42287 
AV658411 Hs.42656 
AB018324 Hs.42676 
AI566004 Hs.141269 



hypothetical protein MGC2592 
programmed ceil death 2 



H2A histone family, member L 
Homo sapiens cONA FU1 1041 lis, clone PLACE1004405 
Homo sapiens cONA FU11041 fis, clone PL 
nudeoporinSOkD 

Homo sapiens mRNA; cONA DKFZp761C029 (from clone DKFZp761C029); partial cds 

Homo sapiens cONA FU11436 fis. clone HEMBA1001213 

hypothetical protein FU 10980 

hypothetical protein FU10980 

HSPC182 protein 

ESTs 

KIAA0854 protein 

uracil-DNAglycosylase 2 

KIAA0124 protein 

putative ONA binding protein 

phosphoribosyt pyrophosphate amidotransf erase 

inhibitor of kappa light polypeptide gene enhancer in B-cells, kinase complex-associated protein 
nucleobindin 2 
nucleoblndin 2 



KtAA0948 protein 
DKFZP586J01 19 protein 
KIAA024Q protein 

Homo sapiens cONA RJ14656 fis, done NT2RP2002439 

high-mobility group 208 

adenovirus 5 E1A binding protein 

caspase 6, apoptosis-related cysteine protease 

TATA box binding protein (TBP)-assodated factor, RNA polymerase III, GTF3B subunit 2 
ESTs 

hypothetical protein MGC3178 

ESTs 

ESTs 



7.4 

2.2 

2.1 

1.7 

5.1 

1.8 

5.0 

1.8 

1.3 

22 

1.9 

2.9 

1.3 

2.8 

2.8 

5.6 

2.9 

3.4 

3.8 

2.9 

2.8 

3.4 

25.5 

5.5 

ZA 

7.9 

1.4 

4.1 

4.2 

3.5 

5.1 

5.8 

13.7 

2.4 



SW1/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily e, member 1 3.2 

Homo sapiens, clone MGC:15961 , mRNA, complete cds 8.7 

Homo sapiens, clone MGC:15961, mRNA, com 2.0 

Homo sapiens cONA: FU22993 lis, done KAT11914 5.5 

stomatin-like2 11.3 

degenerative spermatocyte (homolog Drosophiia; lipid desaturase) 1.7 

ESTs 5.2 

anaphase promoting complex subunit 1 1 (yeast APC1 1 homolog) 2.7 

Homo sapiens cONA FU1 1472 fis, done HEMBA100171 1 5.3 

ubiquitin specific protease 1 2.3 

x 001 protein 1.5 

hypothetical protein FU20039 2.3 

ESTs 1.4 

hypothetical protein MDS025 3.5 
procollagen-proline, 2-oxogiutarate 4-dioxygenase (proline 4-hydroxytase), alpha polypeptide U 6.5 

hypothetical protein FU22418 2.4 

budding uninhibited by benzimidazoies 1 (yeast homolog), beta 2.1 

Homo sapiens DNA binding peptide mRNA, partial cds 3.2 

COP9 complex subunit 7a 5.8 

clathrin-assodated protein AP47 - 1.5 

karyopherin alpha 3 (importin alpha 4) 3.7 

ESTs 1.4 

ESTs 5.8 

ESTs 1.7 

kinesin-iike 1 3.3 

fibroblast activation protein, alpha 14.7 

mitochondrial ribosomal protein 137 5.5 

ESTs 4.4 

synaptosomat-assodated protein, 29kD 2.2 

E2F transcription factor 6 2.2 

K1AA1681 protein 7.8 

KIM0781 protein 1.5 

Homo sapiens cONA: FU21550 fis, clone COL06258 1.3 

132 



WO 02/059377 PCT/US02/02242 



132266 AA301228 Hs,43299 hypothetical protein FU 12890 5.7 

132273 AA227710 Hs.43658 OKFZP586L1 51 protein 4.2 

132276 AA653507 Hs.285711 hypothetical protein RJ13089 2.1 

132288 N36110 Hs.305971 solute carrier family 2 (facilitated glucose transporter), member 10 1.5 

5 132294 AB023191 Hs.44131 K1AA0974 protein 10.0 

132298 NM_015986Hs.7120 cytokine receptor-like molecule 9 1.9 

132299 AW405882 Hs.44205 cortistatin 9.2 
132325 N37065 Hs.44856 hypothetical protein RJ 121 16 2.0 
132348 AW067708 Hs.170311 heterogeneous nuclear ribonucteoprotein 04ike 6.5 

10 132370 AW572805 Hs.46645 ESTs 3.8 

132374 AF155582 Hs.46744 corel UDP-^actose:N-acetylgaladosarnIne-alpha-R beta 1,3-galactosyl transferase 1.5 

132376 AI279892 Hs.46801 sorting nexin 14 12.5 

132384 AA312135 Hs.46967 HSPC034 protein 28.3 

132393 AU35094 Hs.47334 hypothetical protein FU 14495 1.9 

15 132450 AA100012 Hs.48827 hypothetical protein FU 12085 1.9 

132452 AW973521 Hs.247324 mitochondrial ribosomal protein S1 4 6.1 

132456 AB0110S4 Hs.48924 KIAA05 12 gene product; ALEX2 1.7 

132465 AW169847 Hs.49169 K1AA1 634 protein 8.6 

132470 AI224456 Hs.4934 H.sapiens poIyA site DNA 5.2 

20 132484 X16660 Hs.1 19007 RAB4, member RAS oncogene family 1.4 

132518 AW885606 Hs.5064 ESTs 6.1 

132528 T78736 Hs.50758 SMC4 (structural maintenance of chromosomes 4, yeast)-like 1 3.3 

132530 AA306105 Hs.50785 SEC22, vesicle trafficking protein (S. cerevisiaej-like 1 2.0 

132532 AA454132 Hs.5080 mitochondrial ribosomal protein L1 6 2.9 

25 132534 BE388673 Hs.5086 hypothetical protein MGC1 0433 2.2 

132543 BE568452 Hs.5101 protein regulator of cytokinesis 1 7.3 

132571 AW674699 Hs.5169 suppressor of G2 allele ofSKPl.S.cerevislae, homolog of 1.7 

132574 AW631437 Hs.5184 TH1 drosophUa homolog 7.1 

132596 AK001484 Hs.5298 CGI45 protein 2.2 

30 132611 AA345547 Hs.53263 hypothetical protein FU 13287 2.2 
132612 H12751 Hs.5327 PR01914 protein ' 6.8 

132616 BE262677 Hs.283558 hypothetical protein PR01 855 14.0 

132638 AI796870 Hs.54277 DNA segment on chromosome X (unique) 9928 expressed sequence 11.4 

132648 U51127 Hs.54434 hypothetical protein MGC1 71 5 1.9 

35 132668 AB018319 Hs.5460 KIAA0776 protein 2.6 

132692 AW191962 Hs.249239 collagen, type VIII, alpha 2 2.0 

132715 F1 1875 Hs.5534 Homo sapiens cDNA FU12961 fis, clone NT2RP2005645 1.5 

132718 NMJK)4600Hs.554 Sjogren syndrome antgen A2 (60kD, ribonucteoprotein autoantigen SS^A/Ro) 3.0 

132724 AI142265 Hs.55498 geranylgeranyi diphosphate synthase 1 2.4 

40 132731 AI189075 Hs.301872 hypothetical protein MGC4840 114 

132744 AA010233 Hs.55921 glutamyl-prolyWRNA synthetase 14.6 

132760 AA125985 Hs.56145 thymosin, beta, Identified in neuroblastoma cefls 2.7 

132771 Y10275 Hs.56407 phosphoserine phosphatase 3.0 

132773 AA459713 Hs.295901 KIAA0493 protein 2.3 

45 132784 A1142133 Hs^6845 GDP dissociation Inhibitor 2 18 

132798 A1026701 Hs.5716 KIAA0310 gene product 3.7 

132807 U07418 Hs.57301 mul (E. coli) homolog 1 (colon cancer, nonpolyposis type 2) 1.8 

132810 AB007944 Hs.5737 KIAA0475 gene product 5.9 

132813 BE313625 Hs.57435 solute carrier family 11 (proton-coupled divalent metal ion transporters), member 2 8.7 

50 132815 A1815189 Hs.57475 sex comb on midleg homolog 1 6.4 

132817 N27852 Hs.57553 tousled-tike kinase 2 3.6 

132821 AJ251595 Hs.169610 CD44 antigen (homing function and Indian blood group system) 2.8 

132833 U78525 Hs.57783 euteryoiic translation Initiation factor 3, subunit 9 (eta, 11 6kD) 14.6 

132842 NWL016154HS.279771 Homo sapiens done PP1 596 unknown mRNA 1.6 

55 132844 F12200 Hs.5811 chromosome 21 open reading frame 59 2.5 

132851 U09716 Hs.287912 lectin, mannose-binding, 1 1.4 

132863 BE268048 Hs.236494 RAB10, member RAS oncogene family 4.2 

132869 AW963217 Hs.203961 ESTs, Moderately similar to AF1 16721 89 PR02168 {Ksapiens] 2.8 

132873 AW007683 Hs.58598 K1AA1 266 protein 2.0 

60 132875 NAl.004850Hs.5861 7 Rho-assodated, coiled-coil containing protein kinase 2 1.6 

132891 BE267143 Hs.59271 U2(RNU2) small nuclear RNA auxiliary factor 1 (non-standard symbol) 1.4 

132897 AW503667 Hs.59545 ring finger protein 15 5.4 

132902 AI936442 Hs.59838 hypoftetical protein FU10808 6.1 

132912 AW732760 Hs.167578 Homo sapiens cONA FU11095 fis, clone PIACE1005374 7.1 

65 132913 W78714 Hs.60257 Homo sapiens cDNA FU 13598 fe, done PLACE 1009921 2.8 

132940 T79136 Hs.127243 Homo sapiens mRNA for K1AA1724 protein, partial cds 6.1 

132941 AI817165 Hs.6120 hypothetical protein FU13222 10.3 
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132942 AA554458 Hs.197751 K1AA0666 protein 1.8 

132952 A1658560 Hs.61426 Homo sapiens mesenchymal stem cell protein DSC96 mRNA, partial cds 2.2 

132962 AA576635 Hs.6153 CGI-48 protein 4.9 

132972 AA034365 Hs.288924 Homo sapiens cONARJ 11 392 fis, clone HEMBA1 000575 2.7 

5 132973 AA035446 Hs.323277 ESTs 5.3 

132977 AA093322 Hs.301404 RNA binding motif protein 3 3.2 

132980 AA040696 Hs.62016 ESTs 1.3 

132994 AA1 12748 Hs.279905 clone HQ0310 PRO0310p1 3.0 . 

133012 AA847843 Hs.62711 Homo sapiens, clone IMAGE:335 1295, mRNA 10.3 

10 133015 AJ002744 Hs.246315 UDP-N-ace!yte!pha-D^alactosamine:polypeptide N-acetylgalactosaminyltransferase 7 (GalNAc-T7) 2.1 

133016 AI439688 Hs.6289 hypothetical protein FU20886 1.3 

133053 AI065016 Hs.6390 Homo sapiens clone FLB3344 PRO0845 mRNA, complete cds 6.0 

133062 AW500374 Hs.64056 PRO01 49 protein 5.3 

1 33069 BE247441 Hs.6430 protein with polyglutamlne repeat; calcium (ca2+) homeostasis endoplasmic reticulum protein 4.9 

15 133091 AK001628 Hs.64691 KIAA0483 protein 3.5 

133110 AA808177 Hs.65228 ESTs 13.1 

133134 AF198620 Hs.65648 RNA binding motif protein 8A 1.3 

133145 H94227 Hs.6592 Homo sapiens, clone IMAGE:2961368, mRNA, partial cds . 2.2 

133152 Z11695 Hs.324473 mitogen^ctwated protein kinase 1 1.3 

20 133174 AA431620 Hs.324178 hypothetical protein MGC2745 17.1 

133175 AW955632 Hs.66666 ESTs, Weakly similar to S1 9560 proline-rich protein MP4 - mouse [M.musculus] 1.8 

133177 X97795 Hs.66718 RAD54 (S.cerevislae)-like 4.9 

133197 AI275243 Hs.180201 hypometical protein FU20671 3.1 

133208 A1801777 Hs.6774 ESTs 4.4 

25 133226 AW954569 Hs.296287 Homo sapiens, Similar to bromodomain-containing 4, clone IMAGE:3542455, mRNA, partial cds 1 .7 

133228 AI492924 Hs.6831 golgi phosphoproteln 1 6.0 

133240 AK001489 Hs.242894 ADP-ribosylation factor-like 1 1.5 

133254 AI567421 Hs.273330 Homo sapiens, clone IMAGE :3544662, mRNA. partial cds 1.4 

133266 AI160873 Hs.69233 zinc finger protein 5.6 

30 133268 AW956781 Hs.293937 ESTs. Weakly similar to FXD2_HUMAN FORKHEAD BOX PROTEIN 02 [H.sapiensl 1.9 

133285 M76477 Hs.289082 GM2 gangCoside activator protein 4.7 

133291 BE297855 Hs.69855 NRAS^elated gene 5.0 

133314 AA102670 Hs.70725 gamma-aminobutyric acid (GABA) A receptor, pi 2.7 

133321 T79526 Hs. 1795 16 integral type I protein 9.3 

35 133327 AL390127 Hs.7104 KruppeWike factor 13 4.4 

133347 BE257758 Hs.71475 acid duster protein 33 1.8 

133360 AI016521 Hs.71816 v-akt murine thymoma viral oncogene homolog 1 5.5 

133366 AA292811 Hs.72050 non-metastatic cells 5, protein expressed in (nucleoside-diphosphate kinase} .2.7 

133367 AF231919 Hs.18759 KIAA0539 gene product 1.7 
40 133370 AF245505 Hs.72157 DKFZP56411922 protein 1.8 

133383 BE313555 Hs.7252 KIAA 1224 protein 1.7 

133390 AI950382 Hs.72660 phosphatidylserine receptor 1.3 

133391 AW103364 Hs.727 inhlbin, beta A (activln A, activin AB alpha polypeptide) 16.1 
133394 AA305127 Hs.237225 hypothetical protein HT023 12.2 

45 133437 AL031591 Hs.7370 phosphotidyBnosito) transfer protein, beta 10.4 

1 33452 NM_002759Hs.274382 protein kinase. Interferon-inducible double stranded RNA dependent 1 .2 

133453 AI659306 Hs.73826 protein tyrosine phosphatase, non-receptor type 4 (megakaryocyte) 1.7 
133500 AW964804 Hs.74280 hypothetical protein FU22237 11.1 
133529 W45623 Hs.74571 ADP-ribosylation factor 1 2.8 

50 133540 AL037159 Hs.74619 proteasome (prosome. macropain) 26S subunit, non-ATPase, 2 2.9 

133543 AU077073 Hs.108327 damage-specific DNA binding protein 1 (127kD) 2.5 

133578 AU077O50 Hs.75066 translin 1.5 

133579 X75346 Hs.75074 mitogen-activated protein kinase-activated protein kinase 2 2.1 
133582 BE391579 Hs.75087 Fasnactivated serine/threonine kinase 1.3 

55 133594 AW160781 Hs, 172589 nuclear phosphoproteln similar to S. cerevisiae PWP1 2.2 

133595 AA393273 Hs.75133 transcription factor 6-fike 1 (mitochondrial transcription factor Mike) 1.5 

133599 NNL002885HS.75151 RAP 1 , GTPase activating protein 1 5.7 

133621 NMJXM893HsJ5258 H2A histone family, member Y 25.5 

133627 NM_002047Hs.75280 glycyl-tRNA synthetase 15.8 

60 133631 NM_000401HsJ5334 exostoses (multiple) 2 3.3 

133649 U25849 Hs.75393 acid phosphatase 1, soluble 1.6 

133690 AV661185 Hs.75574 mftochondrial ribosomal protein L1 9 4.1 

133720 L27841 Hs.75737 pencentriolar material 1 1.5 

133722 AW969976 Hs.279009 matrix Gla protein 6.3 

65 133751 AW402048.HS.334787 Homo sapiens, Similar to likely ortholog of yeast ARV1. clone IMAGE:3506392, mRNA 3.9 

133757 T52946 Hs.196209 RAE1 (RNA export 1, S.pombe) homotog 1.7 

133760 BE271766 Hs.181357 iaminm receptor 1 (67kD. ribosomal protein SA) 1.8 
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133765 M62194 Hs.75929 cadherin 1 1, type 2, OB-cadherln (osteobtasQ 1 5 

133780 AA557660 Hs.76152 decorin 35 

133784 BE622743 Hs.301064 arfaptinl 68 

- 133791 M34338 Hs.76244 spermidine synthase 26 

5 133797 AL133921 Hs.76272 reflnoblastoma-binding protein 2 1.4 

133822 D50525 Hs.699 peptidylprolyl Isomerase B (cyciophiiin B) 8 0 

133842 AW797468 Hs.285013 putative human HLA dass II associated protein I 13.5 

133845 AA147028 Hs.76704 ESTs 22 

133850 W29092 Hs.7678 cellular retinoic acid-binding protein 1 1 8 

10 133859 U86782 Hs.178761 26S proteasome-associatedpadl homolog 20 

133865 AB011155 Hs.170290 discs, large (Drosophila) homolog 5 2.8 

133B67 AW340125 Hs.76989 K1AA0097 gene product 6 7 

133868 AB012193 Hs.183874 cutlin 4A 25 

133881 U30872 Hs77204 centromere protein F (35Q/400kD, mitosin) • 3.0 

15 133922 U30825 Hs.77608 splidng factor, arginine/serme-nch 9 14 

133924 D86326 Hs.325948 vesicle docking protein p1 15 5.4 

133929 NMJ06306HS.211602 SMC1 (structural maintenance of <±rorriosomes1 t yeast)-like1 4.9 

133936 L17128 Hs.77719 gamma-glutamyl carboxylase 3.7 

133941 BE244332 Hs.77770 adaptor-related protein complex 3 ( mu 2 subunil 12.1 

20 133959 X81789 Hs.77897 splicing factor 3a, subunit 3. 60kD 9.7 

133976 AI908165 Hs.169946 GATA-binding protein 3 (T-ceB receptor gene activator) 3.1 

133989 AL040328 Hs.78202 SWI/SNF related, matrix associated, actin dependent regulator of chromatin 1.3 

133997 AI824113 Hs.78281 regulator of G-protein signalling 12 9.7 

134010 AB016092 Hs.197114 RNA binding protein; AT-rich element binding factor 24 

25 134015 D31764 Hs.278569 sorting nexin 17 2.5 

134070 NM_003590Hs.78946 culfin 3 1.3 

134110 U41060 Hs.79136 UV-1 protein, estrogen regulated 4.2 

134129 NMJ14742HS.79305 KIAA0255 gene product 2.2 

134134 H86504 Hs.173328 protein phosphatase 2, regulatory subunil B (B56), epsilon isoform 5.0 

30 134200 8E559598 Hs.197803 K1AA0 160 protein 3.2 

134206 AF107463 Hs.79968 splicing factor 30, survival of motor neuron-related 2.5 

134208 NMJ00288Hs.79993 peroxisomal biogenesis factor 7 21 

134219 NR000402Hs.80206 glucose-6-phosphate dehydrogenase 9.1 

134234 BE300078 Hs.80449 Homo sapiens, clone 1MAGE:3535294, mRNA, partial cds 2.8 

35 134275 AI878910 Hs.3688 cisplatin resistance-associated overexpressed protein 1.8 

134292 AI906291 Hs.81234 immunoglobulin superfantiy, member 3 2.0 

134301 AW502505 Hs.81360 Homo sapiens cDNA: FU21927 (is, done HEP04178, highly similar to HSU90909 2.5 

134305 U61397 Hs,81424 ubiquitin-like 1 (sentrin) 28 

134324 AB029023 Hs.179946 KIAA1 100 protein 10.4 

40 134326 AW903838 Hs.81800 chondral sulfate proteoglycan 2 (versican) 1.9 

134329 N92036 Hs.81848 RAD21 {S. pombe) homolog 2.6 

134337 NMJ04922HS.81964 SEC24 (S. cerevisiae) related gene family, member C 2.3 

134348 AW291946 Hs.82065 interieukin 6 signal transducer (gp130, oncostatin M receptor) 13.0 

134367 AA339449 Hs.82285 phosphoribosylglycinamide formyltransferase, phosphoribosylglycinamide synthetase, 8.8 

45 134376 X06560 Hs.82396 2',5 , ^oadenylate synthetase 1 (4046 kD) 1.5 

134379 AW362124 Hs.323193 hypothetical protein MGC3222 8.1 
134384 AI589941 Hs.8254 Homo sapiens, Similar to tumor dinerenfially expressed 1, done IMAGE;3639252, mRNA, partial cds 26 

134391 AA417383 Hs.82582 Integrin, beta-like 1 (with EGF-lDce repeat domains) 4.1 

134395 M456539 Hs.8262 lysosomal 17 

50 134403 AA334551 Hs.82767 sperm specific antigen 2 2.6 

134405 AW0679Q3 Hs.82772 collagen, type XI, alpha 1 1.3 

134411 BE272095 Hs.167791 reticulocaibin 1, EF-hand calcium binding domain 3.2 

134415 AI750762 Hs.82911 protein tyrosine phosphatase type IVA, member 2 1.9 

134421 AU077196 Hs.82985 collagen, type V, alpha 2 10.3 

55 134424 Z44190 Hs.83023 peroxisomal biogenesis factor 1 1B 24 

134445 AA1 12036 Hs.83419 KIAA0252 protein 1.2 

134447 M58603 Hs.83428 nuclear factor of kappa light polypeptide gene enhancer In B-cells 1 (p1 05) 1.6 

134470 X54942 Hs.83758 CDC28 protein kinase 2 21 

134480 NM_005000Hs.83916 Empirically selected from AFFX single probeset 5.3 

60 134485 X82153 Hs.83942 cathepsin K (pycnodysostosis} 25 

134498 AW246273 Hs.84131 threonyl-tRNA synthetase 21 

134513 AA425473 Hs.84429 WAA0971 protein 38 

134516 AK001571 Hs.273357 hypothetical protein FU 10709 24 

134520 BE091005 Hs.74861 activated RNA polymerase II transcription cofactor 4 6.7 

65 134529 AW411479 Hs.848 FK506-binding protein 4 (59kD) 23 

134577 BE244323 Hs.85951 exportin, tRNA (nuclear export receptor for IRNAs) 5.5 

134582 AA927177 Hs.86041 CGG triplet repeat binding protein 1 5.8 
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134612 
134624 
134632 
134654 
134664 
134666 
134687 
134692 
134705 
134714 
134719 
134722 
134724 
134746 
134751 
134790 
134806 
134834 
134850 
134853 
134859 
134880 
134910 
134925 
134955 
134971 
134975 
135011 
135022 
135032 
135077 
135083 
135095 
135096 
135153 
135181 
135199 
135207 
135214 
135243 
135245 
135257 
135263 
135274 
135294 
135295 
135307 
135321 
135354 
135361 
135389 
135400 
134975 
• 135011 
135022 
135032 
135077 
135083 
135095 
135096 
135153 
135181 
135199 
135207 
135214 
135243 
135245 



AW068223 Hs.171581 
AF035119 Hs.8700 
X78520 Hs.174139 
AK001741 Hs.8739 
AA256106 Hs.87507 
BE391929 Hs.8752 
U62317 Hs.88251 
NM.003474Hs.8850 
BE161887 Hs.88799 
Y14768 Hs.890 
AA852985 Hs.89232 
AF129536 Hs.284226 
AF045239 Hs.321576 
X07871 Hs.89476 
AW630803 Hs.89497 
BE002798 Hs.287850 
AD001528 Hs.89718 
AW451370 Hs.8991 
AI701162 Hs.90207 
BE268326 Hs.90280 
026488 Hs.90315 
AI879195 HS.90606 
AA532963 Hs.9100 
AW885909 Hs.6975 
AW401361 Hs.91773 
AI097346 Hs.286049 
R50333 Hs.92186 
AB037835 Hs.92991 
NM_000408Hs.93201 
AW301984 Hs.173685 
AW503733 Hs.9414 
AB036063 Hs.94262 
AF027219 Hs.9443 
AA081258 Hs.132390 
AI093155 Hs.95420 
BE250865 Hs.279529 
AA477514 Hs.96247 
N26427 Hs.9634 
T78802 Hs.96560 
BE463721 Hs.97101 
AI028767 Hs.262603 
AW291023 Hs.97255 
AI088775 Hs.55498 
AA448460 Hs.1 12017 
AA150320 Hs.9800 
AI090838 Hs.98006 
AI743770 Hs.98368 
AI652069 Hs.98614 
AA456454 Hs.183418 
AA373452 Hs.167700 
U05237 Hs.99872 
X78592 Hs.99915 
R50333 Hs.92186 
AB037835 Hs.92991 
NMJ30Q408HS.93201 
AW301984 Hs.173685 
AW503733 Hs.9414 
AB036063 Hs.94262 
AF027219 Hs.9443 
AA081258 Hs.132390 
AI093155 Hs.95420 
BE250865 Hs.279529 
AA477514 Hs.96247 



ubiquhin C-terminal hydrolase UCH37 
deleted In fiver cancer 1 
chloride channel 3 
hypothetical protein FU10879 
ESTs • 

transmembrane protein 4 
arylsulfataseA 

a disintegrin and metalloproteinase domain 12 (meltrin alpha) 

anaphase-promoting complex subunit 10 

lysosomal 

chromobox homolog 5 (Drosophila HP1 alpha) 
F-boxonly protein 6 
ring finger protein 22 

C02 antigen (p50), sheep red blood cell receptor 
laminBI 

integral membrane protein 1 
spermine synthase 

adaptor-related protein complex 1, gamma 2 subunit 
hypothetical protein MGC1 1 138 

5-amlnoB7ildazote-4-carboxamide ribonucleotide formyllransferase/lMP cyclohydrolase 
KIAA0007 protein 
15 kDa selenoprotein 

Homo sapiens cONA FU13100 fis, clone NT2RP3002255 
PRO1073 protein 

protein phosphatase 2 (formerly 2A), catalytic subunit, alpha Isoform 
phosphoserine aminotransferase 
Leman colled-coll protein 
K1AA1414 protein 

glycerol-3-phosphate dehydrogenase 2 (mitochondrial) 
hypothetical protein FU12619 
K1AA1488 protein 

p53-inducible ribonucleotide reductase small subunit 2 homolog 

zinc finger protein 202 

zinc finger protein 36 (KOX 18) 

JM27 protein 

px19-like protein 

translin-associated factor X 

ESTs, Highly similar to C10_HUMAN PUTATIVE C10 PROTEIN [H.sapiens] 
hypothetical protein FU1 1 656 
putative G protein-coupled receptor 
ESTs 

ESTs, Weakly similar to A46010 X-Iinked retinopathy protein [H.sapiens] 

geranylgeranyl diphosphate synthase 1 

GE36gene 

protein kinase Njmu-R1 
ESTs 

ESTs, Weakly similar to KIAA0822 protein [H.sapiens] 
ribosome binding protein 1 (dog 180kD homolog) 
cell division cycle 2-0ke 1 (PITSLRE proteins) 
Homo sapiens cONA FU10174 fis, clone HEMBA1003959 



N26427 
T78802 



Hs.9634 
Hs.96560 



BE463721 Hs.97101 
A1028767 Hs.262603 



androgen receptor (dihydrotestosterone receptor, testicular feminization; spinal and bulbar 
Leman colled-cofl protein 
KIAA1414 protein 

glycerol-3-phosphate dehydrogenase 2 (mi 
hypothetical protein FU12619 
KIAA1488 protein 

p53-fndudble ribonucleotide reductase s 

zinc finger protein 202 

zinc finger protein 36 (KOX 18) 

JM27 protein 

px19-Bke protein 

transEn-associated factor X 

ESTs, Highly similar to C10JIUMAN PUTATI 

hypothetical protein FU11656 

putative G protein-coupled receptor 

ESTs 
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6.1 
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1.5 

2.1 

4.4 

14.9 

1.3 

1.7 

6.1 

2.7 
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135257 AW291023 Hs.97255 ESTs, Weakly similar to A460 10 X-linked 7.6 

135263 AI088775 Hs.55498 geranylgefany! diphosphate synthase 1 1.8 

135274 AA448460 Hs.1 12017 GE36gene 41 

135294 AA150320 Hs.9800 protein kinase Njmu-R1 12 

5 135295 AI090838 Hs.98006 ESTs 4 8 

135307 AI743770 Hs.98368 ESTs, Weakly similar to WAA0822 protein 5.8 

135321 A1652069 Hs.98614 ribosome binding protein 1 (dog 180kO ho 12.3 

135354 AA456454 Hs.183418 cell division cycle 2-fike 1 (PITSLRE pr 5.7 

135361 AA373452 Hs.167700 Homo sapiens cDNA FU10174 fis, clone HE 7.9 

10 135389 U05237 Hs.99872 fetal Alzheimer antigen 1.9 

135400 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 13.9 

302256 AA857131 Hs.171595 HIV TAT specific factor 1 5.3 

302276 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c-erb-02, 2 2 

303135 AW592789 Hs.279474 HSPC070 protein 1 4 

15 303686 AK000714 Hs.109441 MSTP033 protein 5.2 

310085 R43191 Hs.101248 Horro sapiens clone IMAGE^2553, mRNA seq 13 

315518 AA808229 Hs.167771 ESTs 2.8 

317781 NM_007057Hs.42650 ZWIOinteractor 2.0 

320836 AI268997 Hs. 197289 rab3 GTPase-acfivating protein, non-cata 5.5 

20 321114 AA902256 Hs.78979 Golgi apparatus protein 1 1.4 

322221 N24236 Hs.179662 nudeosome assembly protein 1-like 1 1.3 

322474 AF1 18083 Hs.29494 PR01912 protein 2.9 

322556 BE041451 Hs.177507 hypothetical protein 1.6 

323541 AF292100 Hs.104613 RP42homolog 1.8 

25 407827 BE278431 Hs.40323 BUB3 (budding uninhibited by benzimidazo 1.6 

408196 AL034548 Hs,43627 SRY (sex determining region Y)-box 22 6.1 

408813 AI580090 Hs.48295 RNA heficase family 5.6 

409176 R73727 Hs.101617 ESTs, Weakly similar to T32527 hypotheti 2.6 

413670 AB0Q0115 Hs.75470 hypoftetlcal protein, expressed in osteo 2.4 

30 414108 A1267592 Hs.75761 SFRS protein kinase 1 1.5 

414846 AW304454 Hs.77495 UBX domain-containing 1 4.2 

416980 AA381133 Hs.80684 higrwnobility group (nonhistone chromoso 23.6 

417378 R57256 Hs.82037 TATA box binding protein (TBP)-associate 5.8 

418283 S79895 Hs.83942 calhepsin K (pycnodysostosis) 1.3 

.35 418467 NMJ)05910Hs.85273 retinoblastoma-binding protein 6 1.6 

420269 U72937 Hs.96264 alpha thalassemia/mental retardation syn 2.3 

420802 U22376 Hs.1334 v-myb avian myeloblastosis viral oncogen 1.6 

421225 AA463798 Hs.102696 MCT-1 protein 3.5 

421642 AF1 72066 Hs. 106346 retinoic acid repressible protein 4.9 

40 421828 AW891965 Hs.279789 histone deacetylase 3 3.1 

421983 AI252640 Hs.1 10364 peptidytprolyi isomerase C (cydophilin 1.9 

422052 AA302744 Hs.104518 ESTs 2.4 

422055 NM_014320Hs.1 11029 putative heme-binding protein 4.1 

423750 AF165883 Hs.298229 prefoidin2 7.0 

45 424001 W67883 Hs. 137476 paternally expressed 10 (PEG 10; KIAA 105 4.9 

425182 AF041259 Hs.155040 zinc finger protein 217 3.4 

425284 AF155568 Hs.155439 NS1 -associated protein 1 2.1 

426372 BE304680 Hs.169531 DEAD/H (Asp-GIu-Ala-Asp/His) box polypep 7.5 

428049 AW183765 Hs.182238 GW128 protein 1.7 

50 428477 AW500533 Hs.1 1482 spiicir^fador.argintne/serine-richll 2.4 

437562 AB001636 Hs.5683 DEAD/H (Asp-Glu-Ate-Asp/His) box polypep 3.8 

438449 AK001333 Hs.6216 Homo sapiens hepatocellular caroinoma-as 5.6 

441560 F13386 Hs.7888 Homo sapiens clone 23736 mRNA sequence 2.0 

445580 AF167572 Hs.12912 skbl (S. pombe) homotog 7.5 

55 446999 AA151520 Hs.334822 hypothetical protein MGC4485 2.2 

447111 AI017574 Hs.17409 cysteine-rich protein 1 (intestinaO 2.8 

447778 BE620592 Hs.71190 ESTs, Weakly similar to S16506 hypotheti 1.7 

448873 NMJ03677Hs.22393 density-regulated protein 5.9 

449687 W68520 Hs.331328 Intermediate fflament protein syncoilin 5.6 

60 450701 H39960 Hs.288467 Homo sapiens cONAFU 12280 fis. clone MA 1.4 

450703 AA011202 Hs.184771 niictearta<torl/C(C(>VAT-bincfingtransc 4.7 

452461 N78223 Hs.108106 transcription factor 2.9 

452511 BE408178 Hs.285165 Homo sapiens cDNA FLJ20845 Hs, clone AO 12.1 

453157 AF077036 Hs.31989 DKF2P586G 1722 protein 4.7 

65 453658 BE541906 Hs.87819 Homo sapiens, done MGC'2492, mRNA, comp 1.3 

100833 AF135168 Hs.108802 N-em>tofesnide-sensitive factor 3.2 

102481 U50360 gb:Human calcium, calmodulin-dependent p 6.2 
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102827 BE244588 Hs.6456 chaperonln containing TCPl.subunlt 2 (b 7.9 

103549 BE270465 Hs.78793 protein kinase C, zeta 2.0 

104331 AB040450 Hs.279862 cdk Inhibitor p21 binding protein 5.3 

110018 AW579842 Hs.104557 hypothetical protein FU 10697 2.0 

5 115008 AKQ01827 Hs.87889 helicase-moi 5.7 

119075 M10905 Hs.287820 fibronectin 1 1.3 

119615 AL034423 Hs.75875 ubiquitin-conjugating enzyme E2 variant 2.9 

125006 BE065136 Hs.145696 splicing factor (CC1. 3) 1.7 

127609 X80031 Hs.530 collagen, type IV, alpha 3 (Goodpasture 2.4 

10 129209 R62676 Hs.17820 Rruwssociated, coHed-coil containing p 5.2 

129917 M30773 Hs.278540 protein phosphatase 3 (formerly 2B), reg 4.5 

130182 BE267033 Hs. 192853 ubiquitin-conjugating enzyme E2G 2 (homo 11.0 

130365 W56119 Hs. 155 103 eukaryotic translation initiation factor . 3.3 

131135 NfvL016569Hs.267182 TBX3-iso protein , 1-3 

15 131853 A1681917 Hs.3321 ESTs, Highly similar to IRX1_HUMAN IROQU 3.2 

131881 AW361 018 Hs.3383 upslream regulatory element binding prot 14.3 

132726 N52298 Hs.55608 hypothetical protein MGC955 3.0 

135193 X95525 Hs.96103 TATA box binding protein (TBPJ-assodate 2.7 

409487 H19886 gb:yn57a05.r1 Soares adult brain N2b5H 2,3 

20 416040 AW819158 Hs.289044 Homo sapiens cDNA FU 12048 fis, clone HE 7.4 
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TABLE 4A 



Table 4A shows the accession numbers for those pkeys lacking unigenelD's for Table 4. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Ptey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 



123615 
123619 
101445 
124385 
124417 
124482 
102481 
103349 
110856 



120280 
113248 
120472 



30686_-15 

371681J 

1650_^ 

656394J 

1642364J 

1657509J 

31281_-28 

11052 -2 

19346J4 . 



160212 1 
328626J 
44573__2 



129019 44573,2 



120695 9683_3 



122188 
121581 
122618 
109026 
123658 
123811 
125115 
125147 
118737 
120274 
113196 
120504 
120639 



275673J 
283769 1 
305217 1 
150431 1 



AA60917O 

AA602964AA609200 
M21259 

AI267847 N27351 
N34059 N46979 
N53935N53950 
U50360 
X89059 

AA992380 N33063 N21418 H79958 R21911 H79957 

103797 109699J AA0B0912AA075318 AA083403 AA076594 AA078992 AA084926 AA081881 AA1 13913 AA 113892 
M083821 AA134801 AA082953 AA070343 AA062835 AA075419 AA063293 AA071252 AA078900 AA062836 AW974305 

AA190577AA181657 

T63857 AW971220 AA493469 T63699 

AJ950087 N70208 R97040 N36809 AJ3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI21 9788 AA88UM N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488964 
AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 A1872193 AI763273 AW1 73586 AW150329 A1653832 
AI762688 AA988777 AA488892 AJ356394 AW103813 AI539642 AA642789 AA856975 AW505512 AI961530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538AA180009 AA337499 AW961101 AA251669 AA251874 
AI819225 AW205862 AI683338 A1858509 AW276905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 AW613456 AI373032 AI564269 
F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 
A1950087 N70208 R97040 N36809 AI3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 A1560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488964 
AA263144 AI890387 AI950344 A1741346 AI689062 AA282915 AW102898 AI872193 AI763273 AW1 73586 AW150329 AI653832 
A1762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 AI961530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538AA180009 AA337499 AW961101 AA251669AA251874 
A1819225 AW205862 A1683338 AI858509 AW276905 A1633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 AJ022058 M780419 AA551005 W80701 AW613456 AI373032 AI564269 
F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 
AA976503 AI917802 AA953664 AA404613 AA428771 BE280542 AW1 94691 AI927301 AI740458 A1796100 AI935603 AWQ52210 
AA970201 AI633384 AA425910 AI0170O4 A1241295 AA402816 AA291468 
AA398838AA435847 

AA416568 AA442689 AA417233 AA442223 
AA453641 AA454061 



AA157811AA836869 
genbanKJ\A609364 AA609384 
genbank^AA6205B6 AA620586 
genbankJT97341 T97341 
NOTj : OUND_entre^.W38150 
382979 J AA199686 N73861 
genbankJ\A177051 AA177051 
genbank.T57317 T57317 
genbanleAA256837 AA256837 
genban!eAA286942 AA286942 



W38150 
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120809 
113702 



genbankJ\A346495 
genbankj"97307 



AA346495 
T97307 



5 

10 
15 
20 



129680 23162J U03749 NMJJ01275 J03483 J03915 A1214509 AW245744 AL046455 AA318960 AI741505 AA843875 AI829382 AI560122 
AI858999 D55958 AI684005 D53170 AA854091 AI025609 D531 19 D54729 D55504 D55377 D55313 AW512244 AA846441 
AW043898 AI969102 AA405741 AI091 983 AA788784 AA706586 AA854361 AW470949 AA843095 AA772028 AI148432 
AI038109 AA782478 AA910064 AI22Q384 AA781296 AA843881 AA854064 AA843125 AA843419 AA319036 AA319054 
AI273831 W32275 AI584185 C05724 AA789023 A1686818 D54392 AI022485 AA431410 AA854232 W39212 W15214 AA894441 
AJ803081 AI167381 AW245389 AA319430 AA335156 AI042646 AA327030 AA725170 T27943 M889304 AA976699 A16B7001 
AI621 107 AI865540 AA772107 C06286 AA319661 AA405992 



101045 


entre*J05614J05614 


117247 


genbank 


N21032 


110501 


genbank 


H55748 


103392 


entre2LX94563X94563 


105032 


genbank 


AA127818 


119513 


NOT_FOUND_entrez W37933 


105445 


genbank 


AA252395 


121514 


genbank 


AA412112 


121558 


genbank 


AA412497 


121911 


genbank 


AA427950 


123315 


714071J 


AA498369AA496646 


114911 


genbank 


AA236672 


409487 


1134778 J 


H19886AW402806T10231 
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TABLE 5: Figure 5 from BRCA 001 US 
Table 5 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigeneiD: Unigene number 

1 0 Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal breast tissue 



15 



20 



25 



30 



35 



40 



45 



50 



55 
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Pkey ExAccn UnigeneiD UnigeneTitle 



100114 
10)147 
100154 
100335 
100566 
100667 
100668 
100678 
100988 
101031 
101045 
101332 
101352 
101580 
101592 
101767 
101806 
101810 
101983 
102107 
102165 
102198 
102217 
102220 
102302 
102348 
102374 
102455 
102568 
102618 
102687 



102704 
102705 
102801 
102827 
103060 
103080 
103178 
103206 
103238 
103547 
103549 
103886 
104325 
104827 
104846 
104S54 
104867 



X02308 Hs.82962 
013666 Hs.136348 
H60720 Hs.81892 
AW247529 Hs.6793 
U05424 Hs.169610 
L05424 Hs.169610 
L05424 Hs.169610 
AW502935 HsJ40 
AK000405 Hs76480 
J05070 Hs.151738 
J05614 

J04088 Hs.156346 
AM94299 Hs.16297 
NMJ)12151Hs.83363 
AF064853 Hs.91299 
M81057 Hs.180884 
AA586894 Hs.1 12408 
NM_000318Hs.180612 
AI904232 Hs.75323 
BE258602 Hs.182366 
BE313280 Hs.159627 
AW950852 Hs.74598 
AA829978 Hs.301613 
U24389 Hs.65436 
AA306342 Hs.69171 
U37519 Hs.87539 
U33635 Hs.90572 
U48705 Hs.75562 
W81489 Hs.223025 
AL037672 Hs.81071 
NM_007019Hs.93002 
U96132 Hs.171280 
AU077058 Hs.54089 
T97490 Hs.50002 
BE252241 Hs.38041 
BE244588 Hs.6456 
NM_005940Hs.155324 
AU077231 Hs.82932 
AA205475 Hs,275865 
X72755 Hs.77367 
AI369285 Hs.75189 
AI376722 Hs.180062 
BE270465 Hs.78793 
AK001278 Hs.105737 
BE379766 Hs.150675 
AW052006 Hs.8551 
A1250789 Hs.32478 
AA041276 Hs.154729 
AA278898 Hs.225979 



thymidyfate synthetase 
osteoblast specific factor 2 (fasdcUn 
KIAA0101 gene product 
platelet-acth/allng factor acetythydrota 
C044 antigen (homing function and Indian 
CD44 antigen (homing function and Indian 
CD44 antigen (homing function and Indian 
PTK2 protein tyrosine kinase 2 
ubiquitin-like 4 

matrix metattoproteinase 9 (getatinase B 
gb:Human proliferating cell nuclear anti 
topoisomerase (DNA) II alpha (170kO) 
COX17 (yeast) homolog, cytochrome c oxid 
coagulation factor Vlll-associated (intr 
guanine nucleotide binding protein ( 
carboxypeptldase B1 (tissue) 
S100 calcium-binding protein A7 (psorias 
peroxisomal membrane protein 3 (35kD, Ze 
prohibits 

heat shock protein 75 
death associated protein 3 
polymerase (DNA directed), delta 2, regu 
JTV1 gene 



protein kinase C-fike 2 

aldehyde dehydrogenase 3 family, member 

P7K7 protein tyrosine kinase 7 

discoldin domain receptor family, member 

RAB31, member RAS oncogene family 

extracellular matrix protein 1 

ubiqurtin carrier protein E2-C 

hydroxyacyt-Coenzyme A dehydrogenase, ty 

BRCA1 associated RING domain 1 

small inducible cytokine subfamily A (Cy 

pyridoxal (pyridoxine, vitamin B6) kinas 

chaperonin containing TCP1, subunit 2 (b 

matrix metalloprateinase 1 1 (MMP11; stro 

cydin D1 (PRAD1: parathyroid adenomatos 

ribosomal protein S18 

monokine induced by gamma Interferon 

death-associated protein 

proteasome (prosome, macropain) subunit, 

protein kinase C, zeta 

hypothetical protein FU10416 similar to 

polymerase (RNA) II (DNA directed) polyp 

PRP4ySTK/WD splicing factor 

ESTs 

3i3hospholnositide dependent protein kin 
hypothetical protein similar to small G 



R1 

2.9 
7.5 
9.2 
2.7 
5.7 
9 

7.6 

53.2 

11.4 

8.2 

5 

3.4 

6.3 

5.7 

5.6 

14.4 

8.9 

3.2 

8.4 

1.4 

4.6 

4.3 

6.7 

4.3 

2.7 

2 

6.2 
6.9 
5.3 
5.8 
4.3 
6 

1.9 

2.3 

6.4 

5.6 

4.5 

3.1 

9.9 

8.8 

5.6 

9.7 

7.9 

6.5 

6.3 

10.9 

5.6 

12.3 

2 
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04896 AW015318 Hs.23165 
04909 AW408164 Hs.249184 
04916 AW958157 Hs.155489 
04919 M026880 Hs.25252 
04974 Y12059 Hs.278675 
04978 AI199268 Hs.19322 
05012 AF098158 Hs.9329 
05039 AA907305 Hs.36475 
05079 AA151342 Hs.12677 
I05088 H58589 Hs.35156 
05393 AF167570 Hs.256583 
05547 AA262640 Hs.27445 
05564 BE616694 Hs.288042 
I05658 AA985190 Hs.246875 
05746 AW151952 Hs.46679 
I05858 AF151066 Hs.281428 
105930 AF016371 Hs.9880 
06094 AA533491 Hs.23317 
06350 AK001404 Hs.194698 
06359 AW390282 Hs.31130 
06610 AA458882 Hs.79732 
06624 NM_003595Hs.26350 
06713 BE614802 Hs.184352 
I06829 AW959893 Hs.27099 
06846 AB037744 Hs.34892 
06873 N49809 Hs.11197 
06973 BE156256 Hs.11923 
07029 AF264750 Hs.2BB971 
07197 W15477 Hs.64639 
i07859 AW732573 Hs.47584 
107901 L42612 Hs.335952 
07922 BE153855 Hs.61460 
S07974 AW956103 Hs.61712 
08274 AF129535 Hs.272027 
08647 BE546947 Hs.44276 
08695 AB029000 Hs.70823 
08894 AK001431 Hs.5105 
109011 AA156542 Hs.72127 
09068 AA164293 Hs.72545 
I09273 AA375752 Hs.82719 
I09468 NM_015310Hs.6763 
10240 AI668594 Hs.176588 
10330 AI288666 Hs.16621 
10501 H55748 
10742 AW190338 Hs.28029 
10762 BE044245 Hs.30011 
10856 AA992380 
10958 NM_005864Hs.24587 
11125 N63823 Hs.269115 
11179 AK000136 Hs.10760 
11239 N90956 Hs.17230 
11285 AA778711 Hs.4310 
11392 W46342 Hs.325081 
11937 BE298665 Hs.14846 
12244 AB029000 Hs.70823 
12995 AA737033 Hs.7155 
13777 BE266947 Hs.10590 
13791 AI269096 Hs.135578 
13811 BE207480 Hs.6994 
13834 T26483 Hs.6059 
13868 W57902 Hs.90744 
13870 AL079314 Hs,16537 
13923 AW953484 Hs.3849 
14275 AW515443 Hs.306117 
14895 AA236177 Hs.76591 
14965 AI733881 Hs.72472 
15061 AI751438 Hs.41271 



ESTs 

transcription factor 19 (SC1) 
NS1 -associated protein 1 
prolactin receptor 
bromodomain-containing 4 



177 
5 

1.7 
1.4 
1.4 



Homo sapiens, Similar to RIKEN cDNA 2010 7.2 

chromosome 20 open reading frame 1 3.3 

ESTs 2.5 

CGI-147 protein 9.5 
Homo sapiens cDNA RJ1 1027 fis, clone PL 2.2 

interleukin enhancer binding factor 3, 9 5.4 

unknown 9.3 

hypothetical protein FU14299 1 .4 

hypothetical protein FU2QQ59 9.4 

hypothetical protein FU20739 1 .5 

hypothetical protein 2.9 

peptidyt prolyl Isomerase H (cyctophilin 5.2 

hypothetical protein FU14681 6.8 

cycfin B2 5.7 

transmembrane 7 superfamiiy member 2 6.3 

fibuRn 1 7.9 

tyrosylprotein sulfotransferase 2 7.7 

hypothetical protein FU12549 4.5 

hypothetical protein FU23293 similar to 1 6.2 

KIAA1323 protein 2.2 
Homo sapiens, clone IMAGE33431 49, mRNA, 1 6.8 



hypothetical protein 
myeloid/lymphoid or mixed-lineage leukem 
glioma pathogenesis-related protein 



6.6 
1.8 
6.1 
8.4 
2.5 
2.2 
6.7 
7.1 
9.8 
72 
4 

1.4 
2.9 



keratin 6B 

tg superfamiiy receptor LNIR 
pyruvate dehydrogenase kinase, isoenzyme 
F-boxonly proteins 
homeo box C10 
KIAA1077 protein 
hypothetical protein FU10569 
ESTs 
ESTs 

Homo sapiens mRNA; cDNA DKFZp586F1822 (f 2.9 

KIAA0942 protein . 3.2 
ESTs. Weakly similar to CP4YJWMAN CYTOC 4.2 

DKFZP434I116 protein 62 

gb:yq94a01 ,s1 Soares fetal liver spleen 6.1 

hypothetical protein MGC1 1256 7.6 

hypothetical protein MGC2963 9.3 

gb:ot37g06.s1 SoaresJestfeJIHT Homo sap 2.3 

signal transduction protein (SH3 contain 67 

ESTs. Moderately similar to Z195JWMANZ 3.6 

asporin (LRR class 1) 7.1 

hypothetical protein FU22087 7.9 

eukaryotic translation initiation factor 6.9 
Homo sapiens, done IMAGE:3659680, mRNA, 8.4 
Homo sapiens mRNA; cDNA OKFZp564D016 (fr 10.6 

KIAA1077 protein 14.6 

ESTs, Moderately similarto 2115357A TYK 5.6 

zinc finger protein 313 13.4 

chitobiase, di-N-acetyl- 1-3 

Homo sapiens cDNA: FU22044 fis, done H 3.1 

EGF-containlng fibutin-like extracellula 1 1 .3 

proteasome (prosome, macropaln) 26S subu 2.7 

hypothetical protein, similar to (U06944 6.1 

hypothetical protein FU22041 similar to 1 .9 

KIAA0306 protein 15.8 

K1AA0887 protein 7.1 

BMP-R1B 2.3 

Homo sapiens mRNA full length insert cDN 1 1.8 
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115278 AK002163 Hs.301724 hypothetical protein FU 11301 1-5 

115291 BE545072 Hs.122579 hypothetical protein FU 10461 6.2 

115652 BE093589 Hs.38178 hypothetical protein FU23468 10.6 

115693 AF231023 Hs.55173 cadherin, EGF LAG seven-pass G-type fece 6.8 

5 115941 AI867451 Hs.46679 hypothetical protein FU20739 5.5 

115968 AB037753 Hs.62767 KIAA1 332 protein 9.8 

116011 AL359053 Hs.57664 Homo sapiens mRNA fuD length insert cON 2.4 

116417 AW499664 Hs.12484 Human clone 23826 mRNA sequence 7.4 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 2.1 

10 116637 AK001043 Hs.92033 mtegrin^nked kinase-associated serine 2.7 

117132 A1393666 Hs.42315 poinding protein 5.2 

117881 AF161470 Hs.260622 butyrate-induced transcript 1 5.7 

118528 AI949952 Hs.49397 ESTs 7.4 

119075 M10905 Hs.287820 fibronectinl 5.7 

15 119265 BE539706 Hs.285363 ESTs 1.4 

119349 T65004 Hs.163561 ESTs 8.4 

119403 AL1 17554 Hs.1 19908 nudeolarprotein NOP5/NOP58 6.7 

119789 BE393948 Hs.50915 kallikreln 5 (KLK5; KLK-L2; stratum com 9.2 

120208 H26735 Hs.91668 Homo sapiens clone PP1498 unknown mRNA45.7 

20 120253 AA131376 Hs.326401 fibroblast growth factor 12B 38.9 

120297 AA191384 Hs.104072 ESTs, Weakly simitar to Z195JHUMAN ZINC 15.2 

120325 AA195651 Hs.104106 ESTs 6.4 

120327 AKD00292 Hs.278732 hypothetical protein FU20285 16.1 

120349 AW969481 Hs.55189 hypomefcal protein 16.8 

25 120356 AF000545 Hs.296433 putative purinergic receptor 28.1 

120371 AA219305 Hs.104196 EST 12.4 

120383 AL109963 Hs.123122 FSH primary response (LRPR1, rat) homolo 9.7 

120386 AW969665 Hs.154848 hypothetical protein OKFZp434D01 27 32.6 

120389 AW967985 Hs.325572 ESTs, Moderately similar to ALU7_HUMAN A 21.7 

30 120396 AA134006 Hs.79306 eukaryotic translation Initiation factor 12.5 

120418 AW966893 Hs.26613 Homo sapiens mRNA; cDNAOKFZp586F1323 (f 11.4 

120472 AI950087 gb:wq05c02.x1 MCI_CGAP_Kid12 Homo sapien 19.4 

120484 AA253170 Hs.96473 EST 10.4 

120570 AA280679 Hs.271445 ESTs, Weakly similar to ALU1JHUMAN ALU S 14.4 

35 120582 BE244830 Hs.284228 ZNF135-fike protein 10.2 

120596 AA282074 Hs.237323 N-acetytgkjeosamine-phosphate mutase 7.5 

120624 AW407987 Hs.173518 Mfhase phosphoprotein homolog 52 

120695 AA976503 gb:oq30a04.s1 NCLCGAPJ3C4 Homo sapiens 46.8 

120713 AW449855 Hs.96557 Homo sapiens cONA FU 12727 fis, done NT 5.9 

40 120750 A1191410 Hs.96693 ESTs. Moderately similar to 21 09260A B c 7 

120774 AI60B909 Hs.193985 ESTs 7.8 

120807 AA346385 Hs.30002 SH3-containing protein SH3GLB2; KIAA1848 6.8 

120809 AA346495 gb:EST52657 Fetal heart II Homo sapiens 4.4 

120984 BE262951 Hs.99052 ESTs 5.6 

45 121081 AA398721 Hs.186749 ESTs, Highly similar to I37550 mismatch 5.4 

121408 AA406137 Hs.98019 EST 6 

121505 AA494172 Hs.194417 ESTs 13.1 

121508 AA402515 Hs.97887 ESTs 28 

121513 AA416653 Hs.181510 ESTs 6.2 

50 121549 AA412477 Hs.98142 EST 7.4 

121558 AA412497 gb:zt95g12.s1 SoaresJestis_NHT Homo sap 2.8 

121655 AA421537 Hs.178072 Homo sapiens mRNA; cDNA DKFZp434B1 023 (f 7.8 

121744 AA398784 Hs.97514 ESTs 7.1 

121748 BE536911 Hs.234545 hypothetical protein NUF2R 19.5 

55 121773 AB033022 Hs.158654 KIAA1 196 protein 7.9 

121832 AW340797 Hs.98434 ESTs 5.8 

121839 AA425691 Hs.191606 ESTs. Highly similar to K1AA1 048 protein 5 

121882 AA426376 Hs.98459 ESTs 5 

121911 AA427950 gb:rw50fl)2^1 Soares totalJetus_Nb2HF8_ 7.2 

60 121999 AA430211 Hs.98668 EST 6.4 

122013 AA431085 Hs.98706 ESTs 6.5 

122036 W92142 Hs.271963 ESTs, Weakly similar to ALU5.HUMAN ALU S 13.1 

122356 AA443794 Hs.98390 ESTs 7.3 

122371 AA868555 Hs.178222 ESTs 5 

65 122372 AA446008 Hs.336677 EST 7.6 

122460 AW418768 Hs.99148 ESTs. Weakly similar to S43569 R01H10.6 9.7 

122490 AA448349 Hs.238151 EST 6.1 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



122492 
122510 
122530 
122572 
122607 
122614 
122616 
122618 
122622 
122717 
122829 
122838 
122856 
122868 
122907 
123016 
123034 
123136 
123152 
123394 
123466 
123486 
123615 
123735 
123753 
124006 
124385 
124440 
124656 
124683 
124735 
124761 
124768 
124788 
124811 
124812 
124822 
124860 
124903 
124930 
124942 
125051 
125056 
125101 
125115 
125280 
127274 
128528 
128670 
128691 
128772 
128781 
128797 
128868 
128891 
128946 
128975 
128995 
129019 
129076 
129088 
129096 
129198 
129347 
129362 
129372 
129404 



AA448417 Hs.104990 

AA449232 Hs.99195 

AW959741 Hs.40368 

AA452601 Hs.99287 

M453518 Hs.98023 

M453630 Hs.99339 

AA453638 Hs.161873 
AA453641 

AA453987 Hs.144802 

AA456859 Hs.178358 

AW204530 Hs.99500 

AA460584 Hs.334386 

AI929374 Hs.75367 

AF005216 Hs.1 15541 

AA470074 Hs.169896 

AW338067 Hs.323231 

AL359571 Hs.44054 

AW451999 Hs.194024 

AW601773 Hs.270259 

AA731404 Hs.105510 

AA599042 Hs.1 12503 

BEO19072 Hs.334802 
AA609170 

NM_013241Hs.9523t 

AA609955 Hs.234961 

AI147155 Hs.270016 
A1267847 

AA532519 Hs.129043 

AW297702 Hs.102915 

AA381661 Hs.1 19878 

R22952 Hs.268685 

AA374756 Hs.93560 

AW368528 Hs.100855 

R43543 Hs.100912 

R46068 Hs.288912 

R47948 Hs.188732 

AA418160 Hs.86043 

R65763 Hs.101477 

AW296713 Hs.221441 

AI076343 Hs.173939 



R99978 

T79956 

T81310 

AI472068 

T97341 

A1123705 



Hs.268892 
Hs.100588 
Hs.100592 
Hs.286236 

Hs.106932 
AW966158 Hs.58582 
R39234 Hs.251699 
AA975486 Hs.103441 
W27939 Hs.103834 
BE302796 Hs.105097 
N71826 Hs.105465 
NMJJ02975H3.105927 
AA419008 Hs.106730 
F34856 Hs.292457 
Y13153 Hs.107318 
BE560779 Hs.284233 
AI816224 Hs.107747 
A1950087 

AW296806 Hs.326234 
AA744610 Hs.194431 
AA463189 Hs.288906 
N57532 Hs.109315 
BE614192 Hs.279869 
U30246 Hs.1 10736 
NM_016Q39Hs.1 10603 
A1267700 Hs.317584 



ESTs 5.4 

ESTs 112 

adaptor-related protein complex 1 , sigma 1 0.1 

EST 11 

ESTs 61.5 

EST 10.7 

ESTs 107.3 
gb:zx48e06.s1 SoaresJestisJJHT Homo sap 311 

ESTs 5.6 

ESTs 8.5 

ESTs 818 

ESTs 75.3 

Src-Iike-adapter 5.8 

Janus kinase 2 (a protein tyrosine Jonas 5.3 

ESTs 11.5 

Homo sapiens cDNA FU11946 fis, done HE 2.8 

nlnein {GSK3B Interacting protein) 8.7 

ESTs 5.1 

ESTs 5.2 

ESTs 3.6 

EST 7.4 

Homo sapiens cDNA FU14680 fis, done NT 2.4 

gb:af12a12.s1 SoaresJestisJJHT Homo sap 7.8 

FH 1/FH2 domain-containing protein 10 

Huntingtin interacting protein E 30.6 

ESTs 8.1 

gbaq49a10.x1 Stanley Frontal NB pool 2 57.1 
Human DNA sequence from clone 989H1 1 on 7.8 

ESTs 8.3 
ESTs, Weakly similar to M3K9_HUMAN MrTOG 7.9 

ESTs 113 

' Homo sapiens mRNA for KIAA1 771 protein, 9 

ESTs 8.1 

Homo sapiens cDNA: FU22726 fis, clone H 5.1 

hypothetical protein FU22604 14.2 

ESTs 7.9 

Homo sapiens cDNA FU13558 fis, clone PL 6.6 

EST 23.9 

ESTs 32.4 

ESTs.WeaklysimilartoALUB_HUMAN!l!l 22.8 

ESTs, Moderately similar to B34087 hypot 6.1 

EST 135.3 

ESTs 5.4 

KIAA1856 protein 5.6 

gb:ye57e05.s1 Soares fetal Over spleen 9.6 

ESTs 8 

Homo sapiens cDNA FU 1 2789 fis, done NT 1 2.8 

ESTs, Weakly similar to IDN4-GGTR14 [H.s 2.8 

Homo sapiens. SimHar to RIKEN cDNA 1700 7.1 

hypothetical protein MGC5576 7.7 . 

thymidine kinase 1 , soluble 5.3 

small nudear ribonucleoprotein polypept -53.9 

stem cell growth factor; lymphocyte seer 1 3.3 

chromosome 22 open reading frame 3 3 
Homo sapiens, clone MGC:16362, mRNA, com 1 3.3 

kynurenine 3-monooxygenase (kynurenlne 3 7.2 

NlCE-5 protein 14 

DKFZP566C243 protein 19 
gb:wq05c02.x1 NCI_CGAP_Kid12Homosapien 2.9 

ESTs, Highly similar to T46422 hypotheti 5 

palladia 17.1 

WW Domain-Containing Gene 20.9 

KIAA1415 protein 5.8 

melanoma-associated antigen recognised b 7.6 

solute carrier family 12 {sodium/polassl 6.7 

CGI-99 protein 2 

ESTs 5 
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129482 AA188185 Hs.289043 
129559 W01296 Hs.11360 
129587 H14718 Hs.11506 
129629 AK000398 Hs.11747 
5 129649 AO000092 Hs.16488 
129680 U03749 
129689 AW748482 Hs.77873 
129702 A1304966 Hs.12035 
129720 AA156214 Hs.12152 
10 130010 AA301116 Hs.142838 
130097 AL046962 Hs.14845 
130135 AA311426 Hs.21635 
130211 NM.003358HS.23703 
130242 X79201 Hs.153221 
15 130359 NMJ13449HS.277401 
130365 W56119 Hs.155103 
130448 BE513202 Hs.15589 
130455 D90041 Hs.155956 
130471 AL121438 Hs.183706 
20 130503 BE208491 Hs.295112 
130511 L32137 Hs.1584 
130542 U64675 Hs.179825 
130553 AF062649 Hs.252587 
130556 AI907018 Hs.15977 
25 130567 AA383092 Hs.1608 
130574 AF083208 Hs.16178 
130617 M90516 Hs.1674 
130667 BE246961 Hs.17639 
130693 R68537 Hs.17962 
30 130744 H59696 Hs.18747 
130757 AL036067 Hs.18925 
130880 BE514434 Hs.20830 
130944 BE382657 Hs.21486 
131046 AA321649 Hs.2248 
35 131060 AA194422 Hs.22564 
131099 AL133353 Hs.226581 
131135 NM_016569Hs.267182 
131185 BE280074 Hs.23960 
131225 H62087 Hs.31659 
40 131245 AL080080 Hs.24766 
131283 X80038 Hs.339713 
131569 AL389951 Hs.271623 
131643 AW410601 Hs.30026 
131714 AA642831 Hs.31016 
45 131722 D13757 Hs.311 
131760 X76732 Hs.3164 
131793 AW966127 Hs.32246 
131885 BE502341 Hs.3402 
131900 AA099014 Hs.231029 
50 131905 AA179298 Hs.3439 
131941 BE252983 Hs.35086 
131971 BE5671G0 Hs.154938 
132180 NMJ04460Hs.418 
132203 NM.004782HS.194714 
55 132273 AA227710 Hs.43658 
132288 N36110 Hs.305971 
132294 AB023191 Hs.44131 
132348 AW067708 Hs.170311 
132370 AW572805 Hs.46645 
60 132384 AA312135 Hs.46967 
132450 AA100012 Hs.48827 
132465 AW169847 Hs.49169 
132532 AA454132 Hs.5080 
132574 AW631437 Hs.5184 
65 132638 A179687G Hs.54277 
132718 NI^004600Hs.554 
132726 N52298 Hs.55608 



spindlin 6.7 

hypothetical protein FU14784 7.5 

Human clone 23589 mRNA sequence 6.8 

hypothefical protein FU20391 3.8 

cairetlculin 3.3 
gfcHuman chromogranin A (CHGA) gene, pro 14.1 

B7 homotog 3 2.6 

ESTs, Weakly similar to 138022 hypotheti 7.4 

APMCF1 protein 2 

nucleolar phosphoprotein Nopp34 1 .6 

forkheadbc*03A 2.8 

tubulin, gamma 1 6.1 
ESTs, Moderately similar to CEGTHUMAN C1.6 

synovial sarcoma, translocated to X chro 5.4 

bromodomain adjacent to zinc finger doma 8.5 

eukaryotic translation initiation factor 1 1 

PPAR binding protein 3.9 

N-acetyttransferase 1 (arylamine N-acety 33.6 

adducin 1 (alpha) 2.7 

K1AA0618 gene product 16.1 

cartilage oBgomeric matrix protein (pse 6.1 

RAN binding protein 2-flte 1 7.8 
pituitary tumor-transforming 1 14.4 

EmpiricaJly selected from AFFX single pr 4.7 

replication protein A3 (14kD) 7,9 
apoptosis antagonizing transcription fac 1 .2 
glutamine-fructose-6-phosphatetransamin 12.1 
Homo sapiens ubiquitin protein ligase (U 1 3.9 
ESTs 2 

POP7 (processing of precursor, S. cerevi 3.1 
protein x 0001 5.7 
kinesin-nice2 2.1 
signal transducer and activator of trans 5.4 
small Inducible cytokine subfamily B (Cy 7.4 
myosin VI 5.1 
COX15 (yeast) homolog, cytochrome coxid 7 
TBJCMso protein 3.3 
cydin B1 5.8 
thyroid hormone receptor-associated prot 7.5 
thloredoxin domain-containing 2.8 
Homo sapiens clone F19374 APO E-C2 gene 1 .3 
nucleoporin50kD 5 
HSPC182 protein 2.9 
putative DNA binding protein 2.9 
phosphoribosyl pyrophosphate amidotransf 3.4 
nucleobindin2 2.9 
Homo sapiens cDNA FU 14656 fis, done NT 7.9 
ESTs 13.7 
Homo sapiens, done MGC:15961, mRNA, com 8.7 
stomatin-iike 2 1U 
ubiquitin specific protease 1 2.3 
hypothetical protein MDS025 3.5 
fibroblast activation protein, alpha 14.7 
synaptosomal-assoctated protein, 29kD 7.8 
DKFZP586L151 protein 10 
solute carrier family 2 (fadlitated glu 9.2 
KIAA0974 protein 2 
heterogeneous nuclear ribonudeoprotein 12.5 
ESTs 28.3 
HSPC034 protein 6.1 
hypothetical protein FU12085 8.6 
WAA1634 protein 6.1 
mitochondrial ribosomal protein L16 7.1 
TH1 drosophila homolog 14 
DNA segment on chromosome X (unique) 99212.4 
Sjogren syndrome antigen A2 (60kD, ribon 3.7 
hypothetical protein MGC955 14.3 
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132731 
132744 
132773 
132798 
5 132810 
132833 
132842 
132851 
132891 
10 132941 
132972 
132980 
132994 
133016 
15 133177 
133208 
133254 
133266 
133268 
20 133285 
133390 
133391 
133540 
133594 
25 133621 
133720 
133760 
133784 
133791 
30 133797 
133822 
133850 
133865 
133881 
35 133924 
133959 
133989 
133997 
134234 
134348 
134376 
134379 
134405 
134421 
134480 
134516 
134529 
134751 
134790 
50 134806 
134850 
134859 
134971 
135181 
55 135207 
135245 
135257 
135307 
135321 
60 135354 
135400 
302276 
317781 
321114 
65 322556 
420802 
424001 



40 



45 



AI189075 Hs.301872 

AA010233 Hs.55921 

AA459713 Hs.295901 

AI026701 Hs.5716 

AB007944 Hs.5737 

U78525 Hs.57783 
NM_016154Hs.279771 

U09716 Hs.287912 

BE267143 Hs.59271 

A1817165 Hs.6120 

AA034365 Hs.288924 

AA040696 Hs.62016 

AA1 12748 Hs.279905 

AI439688 Hs.6289 



X97795 
A1801777 
A1567421 
AI160873 



Hs.66718 
Hs.6774 
Hs.273330 
Hs.69233 



AW956781 Hs.293937 
M76477 Hs.289082 
AI950382 Hs.72660 
AW103364 Hs.727 
AL037159 Hs.74619 
AW160781 Hs.172589 
NM.O04893HS.75258 
L27841 Hs.75737 
BE271766 Hs.181357 
BE622743 Hs.301064 
M34338 Hs.76244 
AL133921 Hs.76272 
D50525 Hs.699 
W29092 Hs.7678 
AB011155 Hs.170290 
U30872 Hs.77204 



066326 
X81789 



Hs.325948 
Hs.77897 



AL040328 Hs.78202 
A1824113 Hs.78281 
BE300078 Hs.80449 
AW291946 Hs.82065 
X06560 Hs.82396 
AW362124 Hs.323193 
AW067903 Hs.82772 
AU077196 Hs.82985 
NM_005000Hs.83916 
AK001571 Hs.273357 
AW411479 Hs.848 
AW630803 Hs.89497 
BE002798 Hs.287850 
AD001528 Hs.89718 
AI701162 Hs.90207 
D26488 Hs.90315 
AI097346 Hs.286049 
BE250865 Hs.279529 
N26427 Hs.9634 
AI028767 Hs.262603 
AW291023 Hs.97255 
AI743770 Hs.98368 
A1652069 Hs.98614 
AA456454 Hs.183418 
X78592 Hs.99915 
AW057736 Hs.323910 
NM_007057Hs.42650 
AA902256 Hs.78979 
BE041451 Hs.177507 
U22376 Hs.1334 
W67883 Hs.137476 



hypothetical protein MGC4840 5.9 

glutamyl-prolyl-tRNA synthetase 6.4 

KIAA0493 protein 14.6 

K1AA0310 gene product 2.5 

KIAA0475 gene product 4.2 

eukaryotic translation Initiation factor 6.1 
Homo sapiens done PP1596 unknown mRNA7.1 

lectin, mannose-bffiding, 1 6.1 

U2(RNU2) small nuclear RNA auxiliary fac 2.7 

hypothetical protein FU13222 2.1 

Homo sapiens cONA FU1 1392 Ss, clone HE 3.5 

ESTs 1.3 

clone HQ0310 PRO0310p1 17.1 

hypothetical protein FU20886 4.4 

RAD54 (S.cerevisiae)-fike 4.4 

ESTs • 5.5 

Homo sapiens, done IMAGE:3544662, mRNA, 1.3 

zinc finger protein 16.1 
ESTs, Weakly similar to FXD2_HUMAN FORKH 12.2 

GM2 gangtioside activator protein 1 0.4 

phosphatidytserine receptor 5.7 

inhibin, beta A (activin A, activin AB a 25.5 

proteasome (prosome, macropain) 26S subu 1 .7 

nudear phosphoprotein similar to S. cer 2.6 

H2A hlstone family, member Y 1 3.5 

pericentriolar material 1 6.7 

lamlnin receptor 1 (67kD, ribosomal prot 5.4 

arfaptin 1 12.1 

spermidine synthase 9.7 

retinoblastoma-binding protein 2 1 .3 

peptidylproryl isomerase B {cyclophilin 9.7 

cellular retinoic acid-binding protein 1 4.2 

discs, large (Drosophila) homolog 5 5 

centromere protein F (350/400kD, mitosin 9.1 

vesfae docking protein p1 15 1.8 

splicing factor 3a, subunit 3, 60kD 10.4 

SWi/SNF related, matrix associated, act) 2.6 

regulator of G-protein signalling 12 13 
Homo sapiens, clone IMAGE:3535294, mRNA, 1 0.3 



Interleukln 6 signal transducer (gp130, 
2\5 , «oligoadenylate synthetase 1 (40-46 
hypothetical protein MGC3222 
collagen, type XJ, alpha 1 
collagen, type V, alpha 2 
Empirically selected from AFFX single pr 
hypothetical protein FU 10709 
FK506-binding protein 4 (59kD) 
lamin&1 

- integral membrane protein 1 
spermine synthase 
hypothetical protein MGC1 1 138 
WAA0007 protein 



px19-like protein 



6.7 

5.5 

5.8 

72.9 

6.7 

6.2 

1.4 

2.8 

6.1 

1.2 

2.6 

9.1' 

13.3 

2 

14.9 



ESTs. Highly similar to C10JWMAN PUTAT1 1 .7 

ESTs 12.2 

ESTs, Weakly similar to A46010 X-Knked 7.6 

ESTs. Weakly similar to KIAA0822 protein 5.8 

ribosome binding protein 1 (dog 180kD ho 1 2.3 

cell division cyde 1 (P1TSLRE pr 5.7 

androgen receptor (dihydrotestosterone r 1 3.9 

HER2 receptor tyrosine kinase (c 5.3 

ZWIOInterador 2.8 

Golgl apparatus protein 1 5.5 

hypothetical protein 2.9 

v-myb avian myeloblastosis viral oncogen 2.3 

paternally expressed 10 (PEG10; KIAA105 7 
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425182 AF041259.HS.155040 

446999 AA151520 Hs.334822 

450701 H39960 Hs.288467 

452461 N78223 Hs.108106 

453157 AF077036 Hs.31989 



zinc finger protein 217 2.3 
hypothetical protein MGC4485 7.5 
Homo sapiens cDNA FU12280 fis, done MA 5.6 
transcription factor 4.7 
OKFZP586G1722 protein 12.1 
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TABLE 5A 

Table 5A shows the accession numbers for (hose pkeys lacking unigenelD's for Table 5. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
5 designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

1 0 Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



15 Pkey CAT number Accessions 



35 



40 

122618 
125115 
120809 
45 129680 



50 

101045 
110501 
121558 
55 121911 



19346.14 
44573J 



123615 
124385 
110856 
20 120472 



25 



30 129019 44573.2 



30686.-15 AA609170 
656394J AI267847N27351 

AA992380 N33063 N21418 H79958 R21911 H79957 

A1950087 N702Q8 R97040 N36809 AI3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 A1820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219768 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 
AA488964 AA2B3144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 AI872193 AI763273 AW173586 
AW150329 AI653832 A1762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 
AJ961530 AW629970 BE612881 AW276997 AW513601 AW512843 M044209 AW856538 AA180009 AA337499 AW961 101 
AA251669 AA251874 A1819225 AW205862 AI683338 A1858509 AW276905 AI633006 AA972584 AA908741 AW072629 
AW513996 AA293273 AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 
AW613456 AI373032 A1564269 F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 
F04005 

A1950087 N70208 R97040 N36809 AI3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 
AA488964 AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 AI872193 AI763273 AW173586 
AW150329 AI653832 AI762688 AA988777 AA488892 AI356394 AW103813 A1539642 AA642789 AA856975 AW505512 
AI961530 AW629970 BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961101 
AA251669 AA251874 AI819225 AW205862 AI683338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 
AW513996 AA293273 AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 
AW613456 AI373032 AI564269 F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 
F04005 

120695 9683J AA976503 AI917802 AA953664 M404613 AA428771 BE280542 AW194691 AI927301 AI740458 
AI796100 AI935603 AW052210 AA970201 AI633384 AA425910 AI017004 AI241295 AA402816 AA291468 
AA453641 AA454061 
genbankJT97341 T97341 
genbank_AA346495 AA346495 

23162 1 U03749 NMJJ01275 J03483 J03915 AI214509 AW245744 AL046455 AA318960 AI741505 AA843875 AI829382 AI560122 
AI858999 D55958 AI684005 D53170 AA854091 AI025609 D53119 D54729 D55504 D55377 D55313 AW512244 AA846441 
AW043893 AI969102 AA405741 AI091983 AA788784 AA706586 AA854361 AW470949 AA843095 AA772028 AI148432 
AI038109 AA782478 AA910064 A1220384 AA781296 AA843881 AA854064 AA843125 AA843419 AA319036 AA319054 
AI273831 W32275 A1584185 C05724 AA789023 A1686818 D54392 AI022485 M431410 AA854232 W39212 W15214 
AA894441 AI803081 AI167381 AW245389 AA319430 AA335156 AI042646 AA327030 AA725170 T27943 AA889304 
AA976699 AI687001 AI621 107 A1865540 AA772107 C06286 AA319661 AA405992 

entre*J05614J05614 

genbanK.H55748 H55748 

genbank u AA412497 AA412497 

genbanK_M427950 AA427950 



305217J 
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TABLE 6: Figure 6 from BRCA 001 US 
Table 6 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Pkeyr Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnkjenelD: Unigene number 

1 0 Unigene Title; Unigene gene title 

R1 : Ratio of tumor to normal breast tissue 



Pkey ExAccn Unigene© UnigeneTitie 



R1 



100147 D13666 Hs.136348 osteoblast specific factor 2 (fasciclin 7.5 

100678 AW502935 Hs.740 PTK2 protein tyrosine kinase 2 53.2 

101806 AA586894 Hs.1 12408 S100 caldum-binding protein A7(psorias 8.9 

102455 U48705 Hs.75562 discoidin domain receptor family, member 6.9 

20 103206 X72755 Hs.77367 monokine induced by gamma interferon 8.8 

105743 BE246502 Hs.9598 sema domain, immunoglobulin domain (Ig). 2.6 

105746 AW151952 Hs.46679 hypothetical protein FU20739 1.5 

106373 AW503807 Hs.21907 histone acetyltransferase 1.8 
110240 AI668594 Hs.176588 ESTs, Weakly similar to CP4Y_HUMAN CYTOC 4.2 

25 119260 AK001724 Hs.102950 coat protein gamma-cop 3.2 

120206 H26735 Hs.91668 Homo sapiens clone PP1498 unknown mRNA 45.7 

120253 AA131376 Hs.326401 fibroblast growth factor 12B 38.9 

120297 AA191384 Hs.104072 ESTs, Weakly similar to Z195_HUMAN ZINC 15.2 

120624 AW407987 Hs.173518 M-phase phosphoprotein homofog 52 
30 120695 AA976503 gb:oq30a04.s1 MCLCGAPJ3C4 Homosapiens 46.8 

120807 AA346385 Hs.30002 SH3-containing protein SH3GLB2; K1AA1848 6.8 

121508 AA402515 Hs.97887 ESTs 28 

122607 AA453518 Hs.98023 ESTs 61.5 

122616 AA453638 Hs.161873 ESTs 107 3 

35 122618 AA453641 gb:zx48e06.s1 SoaresJestisJJHT Homo sap 31.1 

122829 AW204530 Hs.99500 ESTs B1.8 

122838 AA460584 Hs.334386 ESTs 75.3 

123753 AA609955 Hs.234961 Huntingtin interacting protein E 30.6 

124385 AI267847 gbaq49a10.x1 Stanley Frontal NB pool 2 57.1 

40 124860 R65763 Hs.101477 EST 23.9 

124930 AI076343 Hs.173939 ESTs, Weakly similar to ALU8_HUMAN HI! 22.8 

125051 T79956 Hs.100588 EST 135.3 

128781 N71826 Hs.105465 small nuclear ribonudeoprotein polypept 53.9 

129096 AA463189 Hs.288906 WW Domain-Containing Gene 20.9 

45 129347 BE614192 Hs.279869 melanoma-assodated antigen recognised b 7.6 

129689 AW748482 Hs.77873 B7homo!og3 2.6 

130503 BE208491 Hs.295112 K1AA0618 gene product 16.1 

130511 L32137 Hs.1584 cartilage oligomeric matrix protein (pse 6.1 

131046 AA321649 Hs.2248 smaD inducible cytokine subfamily B (Cy 7.4 

50 131643 AW410601 Hs.30026 HSPC1 82 protein 2.9 

131925 AF151048 Hs.183180 anaphase promoting complex subunit 11 (y 2.7 

132180 NM 004460HS.418 fibroblast activation protein, alpha 14.7 

132370 AW572805 Hs.46645 ESTs 28.3 

132994 AA112748 Hs.279905 clone HQ0310PRO0310p1 17.1 

55 133016 A1439688 Hs.6289 hypothetical protein FU20886 4.4 

133266 A1160873 Hs.69233 zinc finger protein 16.1 

133391 AW103364 Hs.727 inhibin, beta A (acGvin A, acth/in AB a 25.5 

134169 A1690916 Hs.178137 transducer of ERBB2, 1 1.2 

134219 NM_000402Hs.80206 gtucose-6-phosphate dehydrogenase 15 

60 134405 AW067903 Hs.82772 collagen, type XI, alpha 1 72.9 

134529 AW41 1479 Hs.848 FK506-binding protein 4 (59kD) 2.8 

134975 R50333 Hs.92186 Lernan coDed-coil protein 2.6 
135181 BE250855 Hs.279529 px19-fiKe protein 14-9 
322556 BE041451 Hs.177507 hypothetical protein 2.9 
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Table 6A shows the accession numbers for those pkeys lacking unigenelD's for Table 6. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset Identifier number 

CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 
124385 656394.1 AI267847 N27351 

120695 9683J AA976503 AI917802 AA953664 AA404613 AA428771 BE280542 AW194691 A1927301 AI740458 AI79610O AI935603 

AW052210 AA970201 A1633384 AA425910 AI017004 A1241295 AA402816 AA291468 
122618 305217.1 AA453641 AA454061 
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TABLE 7: Figure 7 from BRCA 001-1 US 

Table 7 shows genes upregulated in tumor tissue compared to normal breast tissue. Open 
reading frames in the sequences have been characterized as having a signal sequence (SS), a 
transmembrane domain (TM) or other. 
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Pkey: Unique Eos probeset identifier number 






ExAccn: Exemplar Accession number, Genbank accession number 






UnigeneiD: Unigene number 








Unigene Title: Unkjene gene title 








R1 : Ratio of tumor to normal breast (issue 






ORF struct info: Structural characterization of open reading frame for the sequence of the gene 




r&ey tXACcn 
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rvi 


ORF struct info 


innni mm nni9RQ 


(15.04/ HQ 


enromosome conaensaoon i 


9 ^ 


TM 


1001 14 YWV\ft 

IUU 1 11 AUiOOO 


He ft90.R9 


fhvmWvlafo evnfhafaea 

uiyiiMuyidic syiiuicuise 




other 


IUU lot UtcHOD 


U. 11QC1 

ns. 1 1 90 1 


ectonuaeaiae pyiupnospnaiase/pnospnoai 


i Q 


other 


infl14R RPIfi^dQQ 


He 9471 


KIAAnn9n nono nmHirrt 

rwv\uuzu gene pruuuu 


1 Q 
I. y 


TM 


IUU 14/ U lOOOO 




OolcUUlaSl opeClUC laCuJl L ^laoCJQin 


7.6 


other 


inniM Hfi079n 




KIAAfllM n*»no nmHurf 
rvlAAU tw 1 ycllo piUUUlA 


9.2 


other 


100163 W44671 


Hs.124 


gene predicted from cDNA with a complete 


1.6 


other 


100220 AW015534 


Hs.217493 


annexinA2 


2 


other 


100255 D38521 


Hs.112396 


K1AA0077 protein 


1.5 


other 


100271 BE160081 


Hs.256290 


S100 caldum-binding protein A11 (calgiz 


135 


other 


100275 BE242802 


Hs.154797 


K1AA0090 protein 


5.1 


other 


100323 D50920 


Hs.23106 


KlAA0130gene product 


1.9 


TM 


100335 AW247529 


Hs.6793 


platelet-activating factor acetylhydrola 


2.7 


other 


100354 NM_004341 


Hs.154868 


carbamoyl-phosphate synthetase 2, as part 


2 


other 


100372 NNL014791 


Hs.184339 


KIAA0175 gene product 


2.6 


other 


100393 084145 


Hs.39913 


novel RGD-containing protein 


3.2 


other 


100400 AW954324 


Hs.75790 


phosphatidyfinositol glycan, class C 


1.5 


other 


100418 D86978 


Hs.84790 


KIAA0225 protein 


2 


other 


100482 M65028 


Hs.81361 


heterogeneous nuclear ribonucleoprotein 


2.9 


other 


100518 NNL004415 


Hs.74316 ■ 


desmoplaWn{DPI 1 DPil) 


1.9 


other 


100666 L05424 


Hs.169610 


C044 antigen (homing function and Indian 


5.7 


other 


100667 L05424 


Hs.169610 


C044 antigen (homing function and Indian 


9 


? 


100668 L05424 


Hs.169610 


CD44 antigen (homing function and Indian 


7.7 


other 


100678 AW502935 


Hs.740 


PTK2 protein tyrosine kinase 2 


53.2 


other 


100783 AF078847 


Hs.191356 


general transcription factor HH, polype 


6 


other 


100892 BE245294 


Hs.180789 


S164 protein 


1.7 


. ? 


100945 AF002225 


Hs.180686 


ublqultln protein ligase E3A (human papi 


1.5 


other 


100969 AA157634 


Hs.79172 


solute carrier family 25 (mitochondrial 


6.3 


other 


100988 AK000405 


Hs.76480 


ublquitin-fike 4 


11.4 


? 


100999 H38765 


Hs.80706 


diaphorase (NADH/NADPH) (cytochrome b^5 


1.6 


other 


101031 J05070 


Hs.151738 


matrix metaUoproteinase 9 (gelatinase B 


8.4 


other 


101045 J05614 




gb:Human proliferating ceQ nuclear anti 


5 


? 


101077 N99692 


Hs.75227 


Empirically selected from AFFX single pr 


2.6 


other 


101093 106419 


Hs.75093 


procollagen-lysine, 2-oxoglutarate 5-dlo 


1.4 


? 


101186 AA02O956 


Hs.179881 


core-binding factor, beta subunit 


2 


TM 


101216 AA284166 


Hs.84113 


cydln-d ependent kinase inhibitor 3 (CDK 


1.8 


other 


101228 AA333387 


Hs.82916 


chaperonin containing TCP1, subunit 6A ( 


1.7 


TM 


101247 AA132666 


Hs.78802 


glycogen synthase kinase 3 beta 


1.9 


other 


101249 L18964 


Hs.1904 


protein kinase C, tola 


1.5 


other 


101332 J04088 


Hs.156346 


topoisomerase (DNA) II alpha (170kD) 


5.3 


other 


101352 AI494299 


Hs.16297 


COX17 (yeast) homolog, cytochrome c oxid 


4.2 


other 


101396 BE267931 


Hs.78996 


proliferating ceS nuclear antigen 


1.9 


TM 


101445 M21259 




gfcHuman Alu repeats in the region ff to 


1.6 


TM 


101470 NM.000546 


Hs.1846 


tumor protein p53 (Li-Fraumeni syndrome) 


2.5 


other 


101478 NMJJ02890 


Hs.758 


RAS p21 protein activator (GTPase activa 


5.5 


other 
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101483 


M24486 


Hs.76768 


procollagen-proline, 2-oxog!utarate 4-di 


2.1 


other 


101540 


J04977 . 


Hs.84981 


X-ray repair complementing defective rep 


1.6 


other 


101573 


AW248421 


Hs.250758 


proteasome (prosome, macropain) 26S subu 


5.7 


other 


101580 


NMJM2151 


Hs.83363 


coagulation factor Vlll-associated (intr 


1.8 


other 


101592 


AF064853 


Hs.91299 


guanine nucleotide binding protein (G pr 


5.6 


? 


101621 


BE391804 


Hs.62661 


guanyiale binding protein 1, interferon- 


2.4 


other 


101702 


AW504089 


Hs.179574 


proteln phosphatase 2 (formeriy 2A), reg 


1.3 


other 


101734 


M74099 


Hs.147049 


cut (DrosophilaHike 1 (CCAAT dispiacem 


2.1 


? 


101759 


M80244 


Hs.184601 


solute carrier family 7 (cationic amino 


5 


TM 


101767 


M81057 


Hs.180884 


carboxy peptidase B1 (tissue) 


14.4 


SS, 


101782 


AA306495 


Hs.1869 


phosphoglucomutase 1 


5.2 


other 


101805 


AW409747 


Hs.75612 


stress-induced-phosphoprotein 1 (Hsp70/H 


8.6 


other 


101606 


AA586894 


Hs.1 12408 


S100 calcium-binding protein A7 (psorlas 


8.9 


SS,TM 


101810 


MM 000318 


Hs.180612 


peroxisomal membrane protein 3 (35kD, Ze 


3.2 


TM 


101879 


AA176374 


Hs.243886 


nuclear autoantigenic sperm protein (his 


1.6 


other 


101911 


AA441787 


Hs.1 19689 


glycoprotein hormones, atpha polypeptide 


3U 


? 


101920 


AF182645 


Hs.8024 


IK cytokine, down-regulator of HLA 11 


1.8 


other 


101973 


U41514 


Hs.80120 


UDP^-acetyMpha-0-galactosamine:polyp 


2.4 


other 


102009 


BE245149 


Hs.82643 


protein tyrosine kinase 9 


1.3 


other 


102036 


BE250127 


Hs.82906 


CDC20 (cell division cyde 20, S. cerevi 


2 


? 


102083 


T35901 


Hs.75117 


interleukin enhancer binding factor 2, 4 


1.6 


other 


102107 


BE258602 


Ks.182366 


heat shock protein 75 


1.4 


other 


102123 


NM_001809 


Hs.1594 


centromere protein A (17kD) 


1.8 


other 


102165 


BE313280 


Hs.159627 


death associated protein 3 


4.6 


? 


102198 


AW950852 


Hs.74598 


polymerase (DNA directed), delta 2, regu 


4.4 


? 


102217 


AA829978 


Hs.301613 


JTV1 gene 


6.7 


other 


102220 


U24389 


Hs.65436 


lysosomal 


4.4 


TM 


102234 


AW163390 


Hs.278554 


heterochromatin-like protein 1 


1.9 


TM 


102260 


AL039104 


Hs.159557 


karyopherin alpha 2 (RAG cohort 1, impor 


4.4 


other 


102302 


AA306342 


Hs.69171 


protein kinase C-iike 2 


27 


? 


102330 


BE298063 


Hs.77254 


chromobox homolog 1 (Drosophila HP1 beta 


1.5 


other 


102339 


BE378432 


Hs.95577 


cycfin-dependent kinase 4 


2.3 


TM 


102348 


U37519 


Hs.87539 


aldehyde dehydrogenase 3 family, member 


2 


TM ' 


102349 


AU077055 


Hs.289107 


baculoviral 1AP repeat-containing 2 


3.2 


other 


102369 


U39840 


Hs.299867 


hepatocyte nuclear factor 3, alpha 


2 


other 


102374 


U33635 


Hs.90572 


PTK7 protein tyrosine kinase 7 


6.2 


other 


102391 


AA296874 


Hs.77494 


deoxyguanosine kinase 


1.5 


TM 


102455 


U48705 


Hs.75562 


discoidin domain receptor family, member 


7 


other 


102465 


NMJJ01359 


Hs.81548 


2,4-dienoyl CoA reductase 1, mitochondri 


1.8 


SS, 


102488 


U50939 


Hs.61828 


amyloid beta precursor protein-binding p 


1.5 


? 


102489 


AL080116 


Hs.74420 


origin recognition complex, subunit 3 (y 


3.3 


other 


102494 


AI188137 


Hs.75193 


COP9homotog 


2.1 


other 


102501 


AF217197 


Hs.74562 


siah binding protein 1; FBP interacting 


3.2 


other 


102522 


BE250944 


Hs.183556 


solute carrier family 1 (neutral amino a 


2.8 


? 


102532 


AF040253 


Hs.70186 


suppressor of Ty (S.cerevisiae) 5 homolo 


5.7 


? 


102564 


U59423 


Hs.79067 


MAO (mothers against decapentapleglc, Dr 


2.3 


other 


102568 


W81489 


Hs.223025 


RAB31 , member RAS oncogene family 


5.3 ' 


other 


102580 


U60808 


Hs.152981 


COP-diacylglycerol synthase (phosphatida 


2.1 


other 


102581 


AU077228 


Hs.77256 


enhancer of zeste (Drosophila) homolog 2 


1.6 


? 


102582 


U61232 


Hs.32675 


tubutin-spedftc chaperone e 


2.1 


other 


102617 


AW161453 


Hs.198767 


COPS (constitutive photomorphogenic, Ara 


1.8 


other 


102618 


AL037672 


Hs.81071 


extracellular matrix protein 1 


5.8 


other 


102627 


AL021918 


Hs.158174 


zinc finger protein 184 (KruppeHike) 


1.3 


- other 


102663 


NMJ102270 


H3.168075 


karyopherin (tmportin) beta 2 


1.8 


TM 


102676 


BE262989 


Hs.12045 


putative protein 


2.3 


other 


102687 


NMJXJ7019 


Hs.93002 


ubiquitin carrier protein E2-C 


4.4 


? 


102689 


U96132 


Hs.171280 


hydroxyacyi-Coenzyme A dehydrogenase, ty 


6 


? 


102696 


BE540274 


Hs.239 


forkhead box M1 


4.2 


other 


102704 


AU077058 


Hs.54089 


BRCA1 associated R(NG domain 1 


1.9 


other 


102705 


T97490 


Hs.50002 


small inducible cytokine subfamily A (Cy 


2.3 


SS.TM 


102750 


AB014460 


Hs.66196 


nth (Exoli endonudease Ill)-like 1 


1.2 


' TM 


102801 


BE252241 


Hs.38041 


pyrfdoxal (pyridoxine, vitamin B6) kinas 


6.5 


other 


102812 


U90549 


Hs.236774 


high-mobility group (nonhistone chromoso 


1.6 


other 


102827 


BE244588 


Hs.6456 


chaperonin contahlng TCP1, subunit 2 (b 


5.6 


TM 


102844 


AV653790 


Hs.324275 


WW domain-containing protein 1 


1.3 


TM 


102868 


X02419 


Hs.77274 


plasminogen activator, urokinase 


4.4 


other 


102925 


BE440142 


Hs.2943 


signal recognition particle 19kO 


1.9 


other 
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102935 


BE561850 


Hs.80506 


small nuclear ribonudeoprolein potypept 


2.4 


? 


102968 


AU076611 


Hs.1 54672 


methylene tetrahydrofolate dehydrogenase 


2.7 


other 


102983 


BE387202 


Hs.1 18638 


non-metastafic cells 1, protein (NM23A) 


3,1 


other 


102985 


U95742 


Hs.2707 


G1 to S phase transition 1 


5.2 


? 


103023 


AW500470 


Hs.1 17950 


multifunctional polypeptide similar to S 


1.6 


other 


103038 


AA926960 


Hs.334683 


CDC28 protein kinase 1 


2.5 


TM 


103060 


NMJJ05940 


Hs.1 55324 


matrix metalloproteinase 1 1 {MMP1 1 ; stro 


4.5 


other 


103080 


AU077231 


Hs.82932 


cycUn D1 (PRAD1: parathyroid adenomatos 


3.1 


other 


103089 


D31152 


Hs.1 79729 


collagen, type X, alpha 1 (Schmid metaph 


2.4 


other 


103177 


BE244377 


Hs.48876 


famesyf-dl phosphate famesyftransferase 


3.5 


other 


103178 


AA205475 


Hs,275865 


ribosomal protein S18 


9.9 


? 


103179 


NM_001777 


Hs.82685 


C047 antigen (Rh-related antigen, integr 


1.3 


other 


103181 


X69636 


Hs.334731 


Homo sapiens, clone IMAGE:3448306, mRNA. 


2 


other 


103185 


NM.006825 


Hs.74368 


transmembrane protein (63kD), endoplasmi 


1.6 


other 


103191 


AA401039 


Hs.2903 


protein phosphalase 4 (formerly X), cata 


2.5 


other 


103193 


NrvL004766 


Hs.75724 


coatomer protein complex, subunit beta 2 


2.2 


TM 


103194 


NM.004939 


Hs.78580 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 


6.3 


TM 


103206 


X72755 


Hs,77367 


monokine induced by gamma interferon 


8.8 


TM 


103223 


BE275607 


Hs.1708 


chaperonin containing TCP1, subunit 3 (g 


3 


other 


103232 


X75962 


Hs.129780 


tumor necrosis factor receptor superfami 


1.8 


other 


103238 


AI369285 


Hs.75189 


death-associated protein 


5.6 


TM 


103297 


NM 001545 


Hs.9078 


immature colon carcinoma transcript 1 


1.9 


? 


103330 


AI803447 


Hs.77496 


small nuclear ribonudeoprolein polypept 


2.5 


other 


103349 


X89059 




gb:H.sapiens mRNA for unknown protein ex 


1.6 


other 


103376 


AL036166 


Hs.323378 


coated veslde membrane protein 


1.8 


other 


103391 


X94453 


Hs.1 14366 


pyrroHne-5-carboxylate synthetase (glut 


2.3 


other 


103392 


X94563 




gbRsapiens dbi/acbp gene exon 1 & 2. 


4 


TM 


103430 


BE56409Q 


Hs.20716 


translocase of inner mitochondrial membr 


1.3 


other 


103491 


AF264750 


Hs.288971 


myeloid/lymphoid or mixed-lineage leukem 


5.7 


? 


103505 


AL031224 


Hs.33102 


transcription factor AP-2 beta (activatl 


5.1 


other 


103547 


AI376722 


Hs.180062 


proteasorne (prosome, macropain) subunit, 


9.7 


? 


103588 


NM_006218 


Hs.85701 


phosphoinositide-3-kinase, catalytic al 


2 


other 


103613 


NM_000346 


Hs.2316 


SRY (sex determining region Y)-box 9 (ca 


1.3 


? 


103621 


BE379766 


Hs,150675 


polymerase (RNA) II (DNA direded) polyp 


2 


other 


103622 


AA609685 


Hs.278672 


membrane component, chromosome 11, surfa 


2.3 


TM 


103727 


AI878883 


Hs.296381 


growth fader receptor-bound protein 2 


1.3 


other 


103754 


AI015709 


Hs.172089 


Homo sapiens mRNA; cDNA DKFZp586l2022 (f 


1.3 


other 


103780 


AA094752 


Hs.169992 


hypothetical 43.2 Kd protein 


7.6 


? 


103795 


H26531 


Hs.7367 


Homo sapiens BTB domain protein (BDPL) m 


1.3 


SSJM 


103797 


AA080912 




gb:zn04d03.ii Stratagene hNT neuron (937 


1.6 


other 


103813 


AI042582 


Hs.181271 


CGI-120 protein 


1.6 


other 


103855 


W02363 


Hs.302267 


hypothetical protein FU10330 


1.6 


other 


103886 


AK001278 


Hs.105737 


hypothetical protein RJ10416 similar to 


6.6 


TM 


104052 


NM_002407 


Hs.97644 


mammaglobin 2 


2.9 


other 


104079 


AA251242 


Hs.103238 


ESTs 


1.4 


other 


104174 


AA478984 


Hs.6451 


PRO0659 protein 


5.6 


TM 


104227 


AB002343 


Hs.98938 


protocadherin alpha 9 


1.6 


other 


104275 


AI751970 


Hs.101067 


GCN5 (general control of amino-atid synt 


5.4 


other 


104325 


BE379766 


Hs.1 50675 


polymerase (RNA) II (DNA directed) polyp 


6.4 


other 


104370 


AA324597 


Hs.21851 


Homo sapiens cDNA FU12900 fis, done NT 


1.6 


other 


104423 


R83113 


Hs.1432 


protein kinase C substrate 80K-H 


5.2 


other 


104482 


AB037762 


Hs.44268 


myeBn gene expression factor 2 


1.2 


other 


104667 


AI239923 


Hs.30098 


ESTs 


1.4 


other 


104757 


AI694413 


Hs.332649 


olfadory receptor, family 2, subfamily 


2.4 


other 


104804 


AI858702 


Hs.31803 


ESTs, Weakly similar to N-WASP [H.sapien 


1.4 


other 


104806 


AB023175 


Hs.22982 


K1AA0958 protein 


2.4 


other 


104827 


AW052006 


Hs.8551 


PRP4/STK/WD splicing factor 


10.9 


other 


104846 


AI250789 


Hs.32478 


ESTs 


5.7 


other 


104854 


AA041276 


Hs.154729 


3-phospholnositide dependent protein kin 


12.3 


? 


104867 


AA278898 


Hs.225979 


hypothetical protein similar to small G 


2.1 


other 


104871 


T78044 


Hs.28893 


Homo sapiens mRNA; cDNA DKFZp56402364 (f 


1.4 


other 


104896 


AW015318 


Hs.23165 


ESTs 


17.7 


other 


mm 


AW408164 


Hs.249184 


transcription factor 19 (SC1) 


5.1 


TM 


104916 


AW958157 


Hs.155489 


NS1-assodated protein 1 


1.8 


other 


104919 


AA026880 


Hs.25252 


prolactin receptor 


1.5 


other 


104930 


AF043467 


Hs.32893 


neurexophHin2 


2.3 


other 


104973 


NR.015310 


Hs.6763 


K1AA0942 protein 


5.1 


other 
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*.(\AQ7A 

iiwy/4 


T u05a 


HS. 278675 


bromodomain-containing 4 


1.5 


other 


104975 


AL136877 


Hs.50758 


SMC4 (structural maintenance of chromoso 


2.4 


other 




Al 199268 


Hs. 19322 


Homo sapiens, Similar to RIKEN cONA 2010 


7.3 


other 


104979 


A A AOTA1 J 

AA937934 


Hs.321062 


ESTs 


1.3 


other 


IfMGCM 

iU4yy4 


AJ49993Q 


ns.3348o5 


mitochondrial vjTP binding protein 


3.6 


7 


105009 


BE379584 


Hs.34789 


dolichyt-dlphosphoolig osaccharide-p rotei 


5.6 


other 




Ar09o15B 


Hs.9329 


chromosome 20 open reading frame 1 


J.4 


other 


105028 


AI050715 


Hs.2331 


E2F transcnpbon factor 5, p130-binding 


2.2 


other 


105041 


AB037716 


Hs.26204 


KIM 1295 pratem 


2.2 


other 


1U0U4S 


Bc242899 


U. 4 000C4 

HS. 129951 


specWe-type POZ protein 


3.9 


o 
f 


1U5U/9 


AA151342 


l_J_ 4 

Hs.12677 


CGM47 protein 


9.5 


TM 


1 05087 


A A 4 AtQQA 

AA147oo4 


Hs.9812 


Homo sapiens cDNA FU 14388 lis, clone HE 


5.7 


other 


105068 


i irnrnn 

H58589 


Hs.35156 


Homo sapiens cDNA FU1 1027 fis, clone PL 


22 


other 


105095 


Z78407 


Hs.27023 


vesicle transport-related protein 


2.2 


other 


105110 


BE387350 


Hs.33122 


KIAA1160 protein 


1.6 


other 


105126 


AW975433 


Hs.36288 


ESTs 


6.4 


? 


105127 


AA045648 


Hs.301957 


nudix (nucleoside diphosphate linked moi 


2.2 


other 


105141 


AA1 64687 


Hs.1 77576 


mannosyl {alpha-1,3-)-glycoprotein beta- 


2.8 


other 


105158 


AW976357 


IJ — dele 

Hs.234545 


hypothetical protein NUF2R 


2 


other 


105169 


BE245294 


Hs.180789 


S164 protein 


1.7 


other 


105166 


AA191512 


Hs. 28005 


Homo sapiens cDNA FU1 1309 fis, clone PL 


4.9 


SS,TM 


105254 


AA071276 


Hs.19469 


KIAA0859 protein 


2 


TM 


105281 


A A At 

AA263143 


Hs.24596 


RAD51 -interacting protein 


2.9 


? 


105288 


N99673 


Hs.3585 


ESTs, Weakly similar to AF126743 1 DNAJ 


15 


TM 


105302 


AA700122 


Hs.3355 


sentrin-specific protease 


0.2 


( 


105331 


AW270037 


Hs. 179507 


K1AA0779 protein 


1.8 


GO 

So, 


105359 


NM_01d015 


Hs.8054 


CGl-oo protein 


Q A 

0,4 


other 


105366 


8E264645 


Hs.282093 


hypothetical protein FU21918 


5.1 


other 


105373 


AW887701 


Hs.32356 


hypothetical protein FU20628 


2.6 


other 


105374 


8E242803 


Hs.262823 


hypothetical protein FU 10326 


22 


TM 


105387 


AW592146 


Hs.108636 


membrane protein CH1 


Z3 


SSJM 


105393 


AF167570 


Hs.256583 


interieukin enhancer binding factor 3, 9 


5.5 


SS, 
other 


105399 


BE386877 


i r *%*s dftd d 

Hs.334811 


Npw38-blnding protein NpwBP 


1.6 


105400 


AF1 98620 


Hs.65648 


RNA binding motif protein 8A 


1.6 


other 


105445 


AA252395 




gb2s12g10.s1 NCLCGAP_GCB1 Homo sapiens 


5.1 


? 


105507 


BE268348 


Hs.226318 


CCR4-NOT transcription complex, subunit 


1.6 


other 


105529 


A A 4 10/1/10 

AA1 13449 


HS.32471 


kiumHiAKmi . M i.:. ci ionics 

nypotnetical protein PLJ2U3b4 


4 1 


other 


105530 


AB023179 


LI- AAfA 

Hs.9059 


K1AA0962 protein 


3.5 


other 


105547 


AA2o2o40 


Hs .27445 


unknown 


9.3 


other 


105564 


BE616694 


Hs. 288042 


hypothetical protein rU 14299 


1.4 


other 


105595 


a AC7QCJC 

AA579535 


HS. 18490 


nypotneucai protein rLJ2U452 


iu.y 


IM 


105597 


AFQ54284 


Hs.334826 


splicing factor 3b, subunit 1, 155kO 


2.9 


TM 


105608 


AI808201 


1 i ofim ai 

Hs.287863 


hypothetical protein FU12475 


1.7 


7 


105610 


AA280072 


Hs.99872 


fetal Alzheimer antigen 


1.4 


other 


105617 


AK000892 


Hs.4069 


glucocorticoid modulatory element btndin 


1.7 


TM 


105620 


AW302245 


Hs.181390 


casein kinase 1, gamma 2 


5.6 


other 


105658 


AA985190 


Hs.246875 


hypothetical protein FU 20059 


9.4 


other 


105697 


AW499988 


Hs.27801 


zinc finger protein 278 


2 


TM 
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4 7 

1./ 


other 


105743 


BE246502 


Hs.9598 


sema domain, immunoglobulin domain (ig), 


2.7 


other 


105746 


AW151952 


Hs.46679 


hypoflietical protein FU20739 


1.5 


? 


105759 


AI123118 


Hs.15159 


chemokine-like factor, alternatively spl 


1.3 


other 


105771 


A1267720 


Hs.153221 


synovial sarcoma, translocated to X chro 


1.6 


other 


105620 


AA741336 


Hs.152108 


transcriptional unit N 143 


22 


other 


105826 


AA478756 


Hs.194477 


E3ubiquifinligaseSMURF2 


1.3 


other 


105856 


AI262106 


Hs.12653 


ESTs 


2.4 


other 


105858 


AF151066 


Hs.281428 


hypothetical protein 


2.9 


other 


105875 


AK001708 


Hs.32271 


hypothetical protein FU10846 


1.4 


other 


105930 


AF016371 


Hs.9880 


peptidyl prolyl isomerase H (cyclophifin 


5.3 


other 


106000 


AW194426 


Hs.20726 


ESTs 


1.7 


other 


106011 


AW081202 


Hs.12264 


Homo sapiens, clone IMAGE:2989556, mRNA, 


18 


other 


106017 


AA477956 


Hs.26268 


ESTs 


1.4 


other 


106073 


AL157441 


Hs.17834 


downstream neighbor of SON 


1.4 


other 


106078 


AA130158 


Hs.19977 


ESTs. Moderately similar to ALU8.HUMAN A 


1.6 


? 


106094 


AA533491 


Hs.23317 


hypothetical protein FU 14681 


6.9 


other 


106140 


AB006624 


Hs.14912 


KIAA0286 protein 


1.6 


other 


106271 


AA251393 


Hs.289052 


Homo sapiens, Similar to RIKEN cDNA 5430 


10.8 


? 
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107859 AW732573 Hs.47584 
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KIAA1321 protein 

higb-mobtlir/ group (nonhistone chromoso 
ESTs, Weakly similar to A54849 collagen 
cycfoiB2 

transmembrane 7 superfamily member 2 
K1AA1483 protein 

Homo sapiens cDNA: FU21487 fis, done C 

zinc finger protein 278 

Homo sapiens cDNA: FU23038 fo t clone L 

ESTs 

Homo sapiens cDNA FU10071 fis, done HE 
fibuOn 1 

tyrosylprotein sulfotransferase 2 

Homo sapiens mRNA; cDNA DKFZp564O0122 {f 

ESTs, Moderately similar to S65657 alpha 

hypothetical protein FU12549 

T1A1 cytotoxic granule-assodated RNA-bi 

ESTs 

Bd-2-related ovarian killer protein4ik 
hypothetical protein RJ23293 similar to 
centrin, EF-hand protein, 3 (COC31 yeast 
K1AA1323 protein 
hypothetical protein 

Homo sapiens, clone 1MAGE:3343149, mRNA. 
GL002 protein 
transcription factor BMAL2 
serum/glucocorticold regulated kinase 
hypothetical protein DKFZp434L1435 simD 
hypothetical protein 
ESTs 

hypothetical protein FU20727 
myeloid/lymphoid or mixed-lineage leukem 
ectonudeotide pyrophosphatase/phosphodi 
GTPase activating protein 
hypothetical protein FU20505 
GK001 protein 

Down syndrome critical region gene 2 
ESTs 

K1AA1288 protein 
ESTs 

glioma pathogenesis-related protein 

leukemia-associated phosphoproteinp18 ( 

ESTs, Moderately similar to 138759 zinc 

nudear receptor co-repressor/HDAC3 comp 

translocase of outer mitochondrial membr 

ESTs, Moderately similar to ALU7.HUMAN A 

hypothetical protein MGC4606 

Homo sapiens mRNA; cONA DKFZp586l0324 (f 

zinc finger protein 193 

TATA element modulatory factor 1 

kinesin family member 4A 

nucleolar protein (KKE/D repeat) 

ESTs 

ESTs, Moderately similar to ALU7_HUMAN A 
ESTs, Weakly similar to ALU1 JiUMAN ALU S 
potassium voltage-gated channel, delayed 
keratin 6B 

Jg superfamily receptor LNIR 

pyruvate dehydrogenase kinase, isoenzyme 

SWl/SNF related, matrix assodated, act! 

ESTs 

F-box only protein 5 

ESTs 

ESTs 

Homo sapiens cDNA FU12758 fis, clone NT 

ESTs 

ESTs 
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108647 


8E546947 


Hs.44276 


homeo box C10 


9.8 


other 


iUob95 


AB029000 


Hs.70823 


K1AA1077 protein 


7.3 


other 


106740 


AI0B9575 


Hs.9071 


progesterone membrane binding protein 


2.8 


? 


1Uoo2o 


a ivnn4f?ni 

AKQQ1693 


HS.273344 


DKFZP5&4O0463 protein 


1.9 


other 


108859 


AL 121500 


Hs. 178904 


ESTs 


1.6 


TM 


1flP071 

lUoo/2 


HQ672Q 


LI* 4 4 4COA 

ns.1 11680 


endosulfine alpha 


2.2 


other 


1 hooch 


A18Q1Z35 


11. ifllnA 

rls.46480 


ESTs 


5.4 


other 


4AGQCM 

luoo94 


A 1/11/14444 

AKOOHol 


U» £4 AC 

nS.5105 


nypouieticai protein rLJ 10569 


A 4 

4.1 


T»A 

TM 


4naoce 


A A 4 AdtCA 

AA1 49754 


nS.i95155 


Homo sapiens amino acid transport system 


5.7 


? 


4 no n no 

108962 


A A 4 C 4Tf\C 

AA1 51708 


Hs.171980 


homeo box (expressed in ES ceDs) 1 


1.7 


other 


108987 


A A 4 CUTO 

AA152178 


Hs.23467 


hypothetical protein FU 10633 


6.3 


other 


4 Annnn 

10S002 


a nAienoi 

A8028987 


Hs.72134 


K1AA1054 protein 


1.7 


other 


109011 


AA1 56542 


Hs.72127 


ESTs 


1.5 


other 


109026 


AA1 57811 




gbzo35d07.s1 Stratagene colon (937204) 


5.4 


other 


109066 


AA164293 


Hs.72545 


ESTs 


3 


other 


109101 


AW608930 


Hs.52184 


hypothetical protein FU20618 


1.6 


SS, 


109112 


AW419196 


Hs.257924 


hypothetical protein FU13782 


3.3 


TM 


109124 


AK000684 


Hs.1 83887 


hypothetical protein FU22104 


1.7 


other 


109139 


AJ 132592 


Hs.59757 


zinc finger protein 281 


2.7 


other 


109166 


AA219691 


Hs.73625 


RAB6 interacting, kinesin4ike (rabkines 


3 


TM 


109198 


BE566742 


Hs.58169 


highly expressed in cancer, rich in leuc 


2.1 


other 


109213 


NM_016603 


Hs.82035 


potential nuclear protein C50RF5; GAP-B 


5.4 


other 


109220 


AW958181 


Hs.189998 


ESTs 


5.8 


other 


109233 


AU077281 


Hs.170285 


nudeoporin 214kD (CAIN) 


5.3 


other 


109270 


N99673 


Hs.3585 


ESTs, Weakly similar to AF126743 1 DNAJ 


1.4 


other 


109273 


AA375752 


Hs.82719 


Homo sapiens mRNA; cONA DKFZp586F1822 (f 


3 


other 


109313 


AF153201 


Hs.86276 


C2H2 (Kruppel-type) zinc finger protein 


1.3 


other 


109341 


AA213506 


Hs.1 15099 


EST 


3 


? 


109391 


AL096858 


Hs.184245 


KIAA0929 protein Msx2 interacting nuclea 


1.5 


other 


109420 


H83603 


Hs.40408 


homeo box C9 


2.2 


SS, 


109426 


N30531 


Hs.42215 


protein phosphatase 1, regulatory subuni 


3.1 


TM 


109429 


A! 160029 


Hs.61438 


ESTs 


2 


? 


109445 


a a ni 

AA232103 


Hs.189915 


ESTs 


1.8 


other 


109450 


AB032969 


Hs.1 73042 


KIAA1143 protein 


3.8 


other 


109468 


NM_015310 


Hs.6763 


I/it 1 n/ijft ^_ 1. • 

KIAA0942 protein 


3.3 


other 


109478 


AW074143 


Hs.87134 


ESTs 


2 


TM 


109570 


140027 


Hs.1 18890 


gtycogen synthase kinase 3 alpha 


2.1 


other 


109662 


F02614 


Hs.27319 


ESTs 


1.4 


other 


109825 


R71264 


Hs.16798 


ESTs 


1.3 


other 


110039 


114. 4.(110 

H11938 


Hs .21907 


histone acetyttransferase 


2 


other 


110056 


A A CAOAjM 

AA503041 


1 1 _ rt-»n nnn 

HS279009 


matrix Gla protein 


2.5 


other 


110085 


M603840 


Hs.29956 


KIAA0460 protein 


1.7 


other 


110110 


T07353 


Hs.7948 


ESTs 


2.9 


other 


110129 


R51853 


Hs.226429 


ESTs, Weakly similar to ALU1 JWMAN ALU S 


1.7 


SS, 


110154 


NM_014521 


Hs.1 7667 


SH3-domain binding protein 4 


4.3 


other 


110240 


AI668594 


Hs.1 76588 


ESTs, Weakly similar to CP4YJHUMAN CYTOC 


4.3 


? 


110242 


N41744 


Hs.1 9978 


CGI-30 protein 


1.3 


other 


110259 


H28428 


Hs.32406 


ESTs, WeaWy similar to 138022 hypotheti 


2.2 


other 


110312 


□coccoqc 
Btzobsoo 


ris. 1189b 


nypouieiica] proteai hjizuo9 


0 4 


other 


110501 


H55748 




gb:yq94a01 .s1 Soares fetal fiver spleen 


6.1 


? 


110504 


H55915 


Hs.210859 


hypothetical protein FU11016 


6.1 


TM 


110525 


H57330 


Hs.37430 


EST 


6.4 


other 


110568 


AK001160 


Hs.5999 


hypothetical protein FU 10298 


1.3 


? 


110699 


T97586 


Hs.18090 


ESTs 


1.8 


other 


110705 


AB007902 


Hs.32168 


KIAA0442 protein 


1.6 


TM 


110742 


AW190338 


Hs.28029 


hypothetical protein MGC11256 


7.8 


other 


110761 


AL138077 


Hs.16157 


hypothetical protein FU12707 


2.5 


other 


110762 


BE044245 


Hs.30011 


hypothetical protein MGC2963 


9.3 


? 


110765 


AK000322 


Hs.18457 


hypothetical protein FU20315 


'5.5 


SS, 
TM 


110769 


BE000831 


Hs.23837 


Homo sapiens cDNA FU 11812 fis, clone HE 


2.1 


110799 


AI039660 


Hs.323401 


dpy-30-like protein 


1.5 


TM 


110805 


T25829 


Hs.24048 


FK506 binding protein precursor 


6.7 


TM 


110813 


AA767373 


Hs.35669 


ESTs, Moderately similar to ALU1_HUMAN A 


5.7 


other 


110820 


R33261 


Hs.6614 


ESTs, Weakly similar to A43932 mucin 2 p 


3.4 


other 


110840 


N31598 


Hs.12727 


hypothetical protein FU21610 


1.7 


TM 


110844 


AI740792 


Hs.167531 


methylcrotonoyl-Coenzyme A carboxylase 2 


1.7 


other 


110854 


BE612992 


Hs.27931 


hypothetical protein FU 10607 similar to 


4.7 


other 
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110963 AK0O2180 Hs.11449 
110981 AK001980 Hs.24284 
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111184 AI815486 Hs.243901 
111189 N67603 Hs.272130 

111216 AW139408 Hs.152940 
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111223 AA852773 Hs.334838 

111239 N90956 Hs.17230 

111285 AA778711 Hs.4310 

111299 AB033091 Hs.74313 

111312 AI523913 Hs.34504 

111318 T99755 Hs.334728 

111337 AA837398 Hs.263925 

111352 H58589 Hs.35156 

111370 AI478658 Hs.94631 

111384 N94606 Hs.288969 

111389 AK000987 Hs.169111 

111452 R02354 Hs.15999 

111486 AI051194 Hs.227978 

111549 W90638 Hs.20321 

111585 R10720 Hs.20670 

111627 R52656 Hs.21691 

111870 AB037834 Hs.18685 

111937 BE298665 Hs.14846 

111944 AW083791 Hs.21263 

111987 NMJM5310 Hs.6763 

112134 R41823 Hs.7413 

112244 AB029000 HsJ0823 

112388 R46071 Hs.301693 

112456 NMJ16248 Hs.232076 

112464 AW007287 Hs.28538 

112506 AI742756 Hs.26079 

112513 R68425 Hs.13809 

112752 AK001635 Hs.14838 

112884 AK000004 Hs.5013 

112923 T10258 Hs.5037 

112936 AW970826 Hs.6185 

112958 R61388 Hs.6724 

112966 Z44716 Hs.102548 

112978 AK000272 Hs.7099 

112995 AA737033 Hs.7155 

112996 BE276112 Hs.7165 
113047 AI571940 Hs.7549 
113049 AW965190 Hs7560 
113089 T40707 Hs.270862 
113196 T57317 

113248 T63857 

113254 AK002180 Hs.11449 

113277 AW971049 Hs.11774 

113429 AA688021 Hs,179808 

113499 AI467908 Hs.8882 

113547 H59588 Hs.15233 

113647 AA813887 Hs.188173 
113702 T97307 



gb:ot37g06.s1 SoaresJestisJIHT Homo sap 
hypothetical protein MGC13186 
DKFZP434D156 protein 
hypothetical protein RJ13187 
ESTs, Moderately similar to reduced expr 
signal transduction protein (SH3 contain 
DKF2P5640123 protein 
AOP-ribosyltransferase (NAD+; poly(ADP-r 
UDP-N^atyl^lpha-^alactDsamine:polyp 
ESTs, Moderately similar to Z195.HUMAN 2 
hypothetical protein 

Homo sapiens cDNA FLJ13289 fe, clone OV 
Homo sapiens cDNA FU12900 fis, clone NT 
Homo sapiens rnRNA; cDNA DKFZp586D1122 (f 
asporin (LRR class 1) 

Homo sapiens cDNA FU20738 fis, done HE 
ESTs. Weakly similarto S65824 reverse I 
ESTs 

KIAA1361 protein 

KIAA1866 protein 

hypothetical protein FU22087 

eukaryotic translation initiation factor 

KIAA1265 protein 

ESTs 

ESTs 

US1-interacfing protein NUDE1, rat homo 

Homo sapiens cDNA FLJ11027 fis, done PL 

brefeldin A-lnhibited guanine nudeotide 

HSCARG protein 

oxidation resistance 1 

ESTs 

EST 

ESTs, Moderately similar to ZRF1_HUMAN Z 

EST 

ESTs 

Homo sapiens mRNA for KIAA1413 protein, 
Homo sapiens mRNA; cDNA DKFZp564D016 (fr 
suppressor of potassium transport defect 
K1AA0942 protein 
ESTs; calsyntenin-2 
K1AA1077 protein 

Homo sapiens, done IMAGE:3638994, mRNA, 
A kinase (PRKA) anchor protein 11 
Homo sapiens cDNA: FU21086 lis, clone C 
ESTs 

hypothetical protein RJ10648 

hypothetical protein RJ10773 

Homo sapiens mRNA for FU00004 protein, 

EST 

K1AA1557 protein 
ESTs 

glucocorticoid receptor DNA binding fad 

hypothetical pmieh RJ20265 

ESTs, Moderately similar to 21 15357A TYK 

zinc finger protein 259 

ESTs 

Homo sapiens mRNA for WAA1729 protein, 
ESTs 

gb:yb51a03.s1 Stratagene fetal spleen (9 

gb:yc16e01.s1 Stratagene lung (937210) H 

OKFZP5640123 protein 

protein (peptidyl-prolyl ds/trans isome 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA FU12187 lis, clone MA 
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113759 AW499665 


H<? 9456 


SWI/c.NIF rplal wi matrix a<^rdaf pri aril 


1.2 


other 


113777 RE?fifiQ47 


u« 1Q590 
no. luoau 


7lnr finnor nmtoln *?1 1 
tjiiu luiyci piuicui o i<j 


13.4 


other 


■ 10/00 MLJJifQOO 




hvnnlhofiral nrnfpin nKF7n7fi9R79fi 

liypUUicULdl piOUstll UtSTLyl O/lOiZO 


1 7 
l.r 




113791 AI269096 


Hs.135578 


r+nlnhraco rfl_i\l_ar*>fvL 
UilluUIaSc, UMN-ilWiiyr" 


1.3 


other 


113808 W44735 


Hs.9286 


Wnmn cnnlono rDMA* PI 19157ft fk Hnno C 


3.3 


other 


113811 BE207480 


Hs.6994 


t-lnmn canlone rOMA* PI I00MA fk rlnno H 

norno sapiens curon. rLj&&u*rt i», uune n 


3.1 


other 


113817 H13325 


Hs.332795 


hvnnlhptiral nrntp'm nKP7n7fi1fl17191 
i lypuu icikxji pjuicu i urv&p/ a tv %i \c\ 


3.2 


other 


113826 AW378212 


Hs.24809 


\\\tv\tMr\aXti*n\ nmfaln PI MfiQ9fi 

nypuuicucai ptuiciii ru iuozo 


2.3 


? 


113834 T26483 


Hs.6059 


cor-vuiiidiniiiy uumurttfto c*Uouciiuia 


11.3 


TM 


113868 W57902 


Hs.90744 


proteasome (prosome, macropain) 26S subu 


2.7 


other 


113870 AL079314 


Hs.16537 


nypouieucai protein, similar 10 \u uo;m 


6.1 


other 


113885 AW959486 


Hs.21732 


PCTe 


6.6 


other 


113923 AW953484 


Hs.3849 


h>)f%/>fhoHril nrntnln CI 1990/11 clmilartn 

nypouieucai pruiein ruJ&iuH i similar 10 


1.9 


? 


113989 W87544 


Hs.268828 


CCTe 

cols 


1.2 


other 


114022 AI539519 


Hs.120969 


Unmn can lone PI 11 15fi9 fie rlnno HP 

nomo sapiens cuiwa rui looz ns, cione nc 


5.4 


other 


114030 A1825386 


Hs.164478 


nypouicucoj proicin rui iaoa ynuidj 10 


9.4 


other 


114060 AB029551 


Hs.7910 


Dl!0fl1 anH W1 hfnrtfnn nmtain 

kiino i ana t 1 1 ouiuing proiDui 


1.8 


other 


114196 AF017445 


Hs.150926 


rucose-i -pnospnate g uanyiyiiransierase 


1.5 


other 


114226 AB028968 


Hs.7989 


imma ivho protein 


1.8 


other 


114253 BE149866 


Hs.14831 


Homo sapiens, Similar to zinc finger pro 


2.3 


other 


114262 AL1 17518 


Hs.3686 


KiAAuy/8 protein 


1.4 


TM 


114275 AW515443.comp 


Up oncii7 
nS.jUblu 


KIAA0306 protein 15.1 


114292 AI815395 


Hs.184641 


iduy ouu aesauiiasc £ 


1.9 


TM 


114309 AA332453 


Hs.20824 


ubi-oo protein 


2.4 


other 


114392 AA249590 


Hs.100748 


co i s, weawy similar ro Aicoyyo proune-r 


1.9 


other 


114407 BE539976 


Hs. 103305 


homo sapiens rnKNA, cuna uw^z^w jAdir^d (i 


1.3 


TM 


114455 H379Q8 


Hs.271o1o 


PQTc WoalfK/ cimitar In Al 1 IR HI IMAM Al 1 1 9 

Co i s, weaKiy suntiar 10 nix) o_n u i viain mlu o 




otner 


114463 AL120247 


Hs.40109 


tflAAnA79 nmtain 

naauo/^ protein 


5.3 


TM 


114464 AI091713 


Hs.106597 


Homn canfonc <5tmffar In RII^PW rOMA 
nurno sapieno, oimiidr uj r\ii\ci>i wiha i i iu 


1.3 


other 


114471 M028074 


Hs.104613 


ODAO hnmnlnn 

t\rHc nomoiug 


1.9 


? 


114480 BE066778 


Hs.151678 


UDP-N^acetyl-alpha-O-galactosarnlneipolyp 


13.4 


other 


114671 AA766268 


Hs.266273 


hvnnthptlr^il nmfp'm PI WWLR 


2 


other 


114698 AA476966 


Hs.l 10857 


nnlumoraca fRWA\ III fHMA fi\rartaA\ nnlv 


3.6 


other 


114730 AI373544 


Hs.331328 


tnfprmpriiafo filafnont nmtoin cvnmiKn 
if llcJiiicUidlc iliaHiciil piutciM o/MWiuil 


3.9 


other 


114767 AI859865 


Hs.154443 


mfnWin*imnQnniA msfnlpnanro Hofir4pnt 

IllUtlUllUIIIUoUMR? IlkaUilCJlallWJ UtJllUCJll ^o. 


1.7 


other 


114774 AV656017 


Hs.184325 


vol"/© protein 


3.2 


other 


114798 AA159181 


Hs.54900 




3.6 


other 


114860 AL157545 


Hs.42179 


bromodomaln and PHO finger containing, 3 


4.4 


other 


114895 AA236177 


Hs.76591 


KIAAnftA7 nmtoin 


7.2 


other 


114896 BE539101 


Hs.5324 


nypomeucai protein 


1.3 


other 


114911 AA236672 




gD.ZLtyiu^.si ooares ovary lurrtor indtiui n 


1.5 


other 


114930 AA237022 


Hs.188717 


COIS 


2 


ss. 


114938 AA242834 


Hs.58384 


CCTe 

Co IS 


2.9 


other 


114965 AI733881 


Hs.72472 


DKylD D1D 

DlVlr-r\ ID 


2.3 


? 


115023 AF102546 


Hs.63931 


dachshund (Drosophita) homolog 


1.3 


other 


115038 AA252360 


Hs.87968 


toft-like receptor 9 


1.6 


other 


115061 AI751438 


Hs.41271 


Homo sapiens mRNA ftjD length insert cON 


11.8 


other 


115117 AI670847 


Hs.5324 


hypothetical protein 


1.5 


other 


115206 AW183695 


Hs.186572 


ESTs 


2.5 


other 


115221 AW365434 


Hs.79741 


hypothetical protein FU10116 


1.5 


other 


115239 BE251328 


Hs.73291 


hypothetical protein FU10881 


1.3 


TM 


115242 AI368236 


Hs.283732 


ESTs, Moderately similar to ALULHUMAN A 


1.4 


other 


115278 AK002163 


Hs.301724 


hypothetical protein HJ1 1301 


1.5 


other 


115285 AW972872 


Hs.293736 


ESTs 


2.4 


other 


115291 BE545072 


Hs.122579 


hypothetical protein FU10461 


6.3 


ss, 


115400 AI215069 


Hs.89113 


ESTs 


6.7 


? 


115468 AA314349 


Hs.48499 


tumor antigen SLP-8p 


7.5 


7 


115471 AK001376 


Hs.59346 


hypothetical protein FU10514 


1.4 


TM 


115479 AW301608 


Hs.278188 


ESTs, Moderately similar to 154374 gene 


4.1 


TM 


115496 AW247593 


Hs.71819 


eukaryotic translation initiation factor 


16.3 


other 


115500 Y14443 


Hs.88219 


zinc finger protein 200 


5 


other 


115553 AJ275986 


Hs.71414 


transcription factor (SMIF gene) 


2.5 


other 


115581 AI540842 


Hs.61082 


ESTs 


6.2 


other 


115587 BE081342 


Hs.283037 


HSPC039 protein 


2.9 


other 


115590 AA399477 


Hs.67896 


7-60 protein 


5.3 


TM 


115646 N36110 


Hs.305971 


solute carrier family 2 (facilitated glu 


4.8 


? 


115652 BE093589 


Hs.38178 


hypothetical protein FU23468 


10.6 


other 
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115655 AL048269 


Hs.288544 


Homo sapiens, clone MGC: 16063, mRNA, com 


12.7 


TM 


ii5oo3 AI138785 


Hs.40507 


ESTs 


2 


other 


115676 AA953006 


Hs.88143 


ESTs 


3.1 


other 


l 15690 AA625132 


Hs.44159 


hypothetical protein FU21615 


1.7 


TM 


l^ccn 1 * ami 4 nil 

115693 AF231Q23 


Hs.55173 


cadherin, E6F LAG seven-pass G-type rece 


6.9 


other 


115715 BE395161 


Hs.1390 


proteasome (prosome, macropaln) subunit, 


1.7 


other 


115734 AI950339 


Hs.40782 


ESTs 


2.7 


TM 


115811 NMJ)15434 


Hs.48604 


DKFZP4348168 protein 


2.1 


other 


115823 A1732742 


Hs.87440 


ESTs 


2.1 


other 


IIDOJ/ Alfa/52 1 7 


Hs.42761 


ESTs 


13 


other 


115844 AI373062 


Hs.332938 


hypothetical protein MGC5370 


4.4 


other 


lloooo AW062629 


Hs.52081 


K1AA0867 protein 


7.3 


other 


115875 N55669 


Hs.333823 


mitochondrial ribosomat protein L13 


12 


other 


115941 AI867451 


Hs.46679 


hypothetical protein FU20739 


5.5 


other 


4 4CflCQ A DrtO"77CO 

115968 AB037753 


Hs.62767 


iyi AAA * ^ * 

K1AA1332 protein 


9.8 


other 


118003 BE275469 


Hs.66493 


Down syndrome critical region gene 5 


1.4 


other 


116011 AL359053 


Hs.57664 


Homo sapiens mRNA fuQ length insert cON 


2.4 


other 


116108 AA770688 


Hs28777 


H2A histone family, member L 


1.8 


other 


116134 BE243834 


Hs.50441 


CG1-04 protein 


1.4 


other 


118189 N35719 


Hs.44749 


ESTs, Moderately similar to T00358 hypot 


\2 


other 


116195 AW821113 


Hs.72402 


ESTs 


2.1 


other 


116238 AV660717 


Hs.47144 


OKFZP586NG819 protein 


17 


other 


116246 AF265555 


Hs.250646 


baculoviral IAP repeat-containing 6 


17 


other 


116262 AI936442 


Hs.59838 


hypothetical protein FU 10808 


18 


? 


116298 AI95541 1 


Hs.94109 


Homo sapiens cDNA FU 13634 fis, clone PL 


19 


other 


116318 AF097645 


Hs.58570 


deleted in cancer 1; RNA helicase HDB/DI 


5 


SS. 


116325 AI472106 


Hs.49303 


Homo sapiens cDNA FU 11663 fis, clone HE 


14 


SS, 


116336 AL1 33033 


Hs.4084 


KIAA1025 protein 


19 


? 


116339 AK000290 


Hs.44033 


dipeptidyl peptidase 8 


1.5 


other 


116350 AA497129 


11—4 O A *r~r A 

Hs.184771 


nuclear factor l/C (CCAAT-binding transc 


19 


? 


116358 AI1 49586 


Hs.38125 


mterferojvinduced protein 75, 52kD 


19 


? 


116365 N50174 


Hs.46765 


ESTs 


6.1 


other 


116368 N90466 


Hs.71109 


KJAA1229 protein 


1.6 


? 


116417 AW499664 


Hs.12484 


Human done 23826 mRNA sequence 


7.4 


other 


440J4£ AA4£4444 

116436 AA161411 


Hs.58668 


chromosome 21 open reading frame 57 


2.1 


other 


116462 AF218313 


Hs.236828 


putative helicase RUVBL 


15 


TM 


116470 AI272141 


Hs.83484 


SRY (sex determining region Y)-box 4 


2.1 


TM 


116575 AA312572 


Hs.6241 


phosphoinositide-3-kinase, regulatory su 


15 


other 


116637 AK001043 


Hs.92033 


Integrin-Iinked kinase-assodated serine 


2.7 


other 


116640 X899o4 


It. 144CCQ 

HS.2115Q3 


B-cell CLulymphoma 7A 


2.3 


other 


116700 AI800202 


Hs. 317589 


hypothetical protein MGC10765 


1.4 


other 


116705 AW074619 


LI - 4 1*1 4 1 

Hs. 12313 


hypothetical protein FU 14566 


3.4 


other 


116732 AW152225 


Hs. 165909 


ESTs. Weakly similar to I38022 hypolhetj 


2.9 


other 


116926 H73608 


Hs.290830 


ESTs 


17 


TM 


117034 U72209 


Hs. 180324 


YYI-assoctated factor 2 


3.4 


TM 


117132 AI393666 


Hs.42315 


p10-btiding protein 


5.2 


? 


117247 N21032 




gb:yx46f06.s1 Soares melanocyte 2NbHM Ho 


5.5 


TM 


117276 N71183 


Hs.121806 


Homo sapiens cDNA FU1 1971 fis. done HE 


15 


TM 


117984 AlflVH 7M 
11/ ^04 ArwUlfUl 


U c 1AT77Q 

nS. loj//y 


mo mo sapiens ouna ruiuayu lis, clone n i 


I 


other 


117367 AI041793 


Hs.42502 


ESTs 


2 


other 


117368 AI878942 


Hs.90336 


ATPase, H+transporting, lysosomal (vacu 


2.1 


? 


117382 AF150275 


Hs.40173 


ESTs 


2J 


TM 


117412 N32536 


Hs.42645 


solute carrier family 16 (monocarboxylic 


14 


other 


117557 AF123050 


Hs.44532 


diubiqultin 


3.4 


TM 


117588 N34895 


Hs.44648 


ESTs 


3.4 


? 


117745 BE294925 


Hs.46680 


CGI-12 protein 


3 


SS, 


117754 AA121673 


Hs.59757 


zinc finger protein 281 


19 


other 


117879 N54706 


Hs.303025 


chromosome 11 open reading frame 24 


18 


other 


117904 BE540675 


Hs.332938 


hypoBietical protein MGC5370 


6 


? 


117911 AL137379 


Hs.47125 


hypothetical protein FU13912 


17 


other 


117933 Y10518 


Hs.1 16470 


hypothetical protein FU20048 


17 


other 


117983 AL110246 


Hs.47367 


K1AA1785 protein 


5.4 


other 


118078 N54321 


Hs.47790 


EST 


5.2 


other 


118301 AA453902 


Hs.293264 


ESTs 


2.6 


other 


118429 AA243332 


Hs.74649 


cytochrome c oxidase subunit Vic 


2.5 


TM 


118472 AL157545 


Hs.42179 


bromodomain and PHD finger containing, 3 


4.1 


other 


118488 AJ277275 


Hs.50102 


rapa-2(rapagene) 


\2 


other 
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110509 N22617 


Hs.43228 


Homo sapiens cDNA Full 835 lis, clone nfc 


1.0 


other 


118528 AI949952 


Hs .49397 


ESTs 


7 A 

7.4 


7 


110656 Al 458020 


Hs.293287 


ESTs 


Z.3 


other 


118670 AA332845 


Hs. 152618 


ESTs, Moderately similar to ZN91 _HUMAN Z 


\2 


TM 


118698 AB033113 


I i * r f\A n-r 

Hs.50187 


KIAA1287 protein 


2.1 


TM 


118737 AA1 99686 




gb:zq75g09.r1 Stratagene hNT neuron (937 


5.2 


other 


118925 N92293 


HS.206B32 


#— ^ r_ i A _ .j „ _ t _ . *♦ »_ a| i in mi ilia k 1 A 

ESTs, Moderately similar to AUJ8_HUMAN A 


1.4 


other 


118984 AI668709 


Hs.240722 


ESTs, Moderately similar to ALU8 J1UMAN A 


3.6 


olher 


118986 AF148713 


Hs. 125830 


bladder cancer overexpressed protein 


4.9 


? 


119206 W24781 


Hs.293798 


K1AA1710 protein 


1.7 


TM 


119235 A W4 53069 


Hs.3657 


activity-dependent neuroprotective prote 


2-2 


other 


119265 BE539706 


Hs.285363 


ESTs 


1.4 


? 


119279 N57568 


Hs.48028 


EST 


25.1 


other 


119298 NM.001241 


Hs. 155478 


cyc0nT2 


1.6 


? 


119338 AI417240 


Hs.320836 


ESTs, Weakly similar to A47582 B-ceU gr 


1.3 


other 


119403 AL1 17554 


Hs. 119908 


nucleolar protein NOP5/NOP58 


6.7 


TM 


119478 A1624342 


Hs.170042 


ESTs 


2.4 


other 


119486 AI796730 


Hs.55513 


ESTs 


2.1 


other 


119513 W37933 




Empirically selected from AFFX single pr 


1.9 


other 


119601 AK000155 


Hs.91684 


Homo sapiens mRNA; cDNA OKFZp667l103 (fr 


3.7 


TM 


119602 AW675298 


Hs.233694 


hypothetical protein FU 11 350 


3 


other 


119676 AA243837 


Hs.57787 


ESTs 


1.4 


other 


119682 W61019 


Hs.57611 


ESTs 


1.2 


? 


119774 AB032977 


Hs.6298 


KIAA1151 protein 


1.8 


TM 


119780 NM_016625 


Hs.191381 


hypothetical protein 


3.1 


other 


119789 BE393948 


Hs.50915 


katlikreln 5 (KLK5; KLK-L2; stratum com 


9.2 


other 


119805 AJ223810 


Hs.43213 


ESTs, Weakly similar to lEFSJ-iUMAN TRANS 


3.6 


TM 


119818 AA1 30970 


Hs.58382 


hypothetical protein FLJ11101 


2.5 


? 


119863 AAQ81218 


Hs.58608 


Homo sapiens cDNA FU 14206 fis. clone NT 


2.7 


TM 


119905 AW449064 


Hs.1 19571 


collagen, type III, alpha 1 (Ehlers-Darrf 


2.6 


other 


119966 AA703129 


Hs.58963 


ESTs 


2.7 


other 


120132 W57554 


Hs.125019 


lymphoid nuclear protein (LAF-4) mRNA 


1.2 


other 


120206 H26735 


Hs,91668 


Homo sapiens done PP1498 unknown mRNA 


45.7 


other 


120248 AI924294 


Hs.1 73259 


un characterized bone marrow protein BM03 


1.2 


other 


120269 AW131940 


Hs. 104030 


ESTs 


9.6 


other 


120274 AA177051 




gb:hc02aO2.s1 NCLCGAP_pr3 Homo sapiens 


4.7 


other 


120280 AA190577 




gb2p52g02.s1 Stratagene Hela cell s3 93 


2.1 


other 


120296 AW995911 


Hs.299883 


hypothetical protein FU23399 


1.9 


TM 


120297 AA191384 


Hs. 104072 


ESTs, Weakly similar to Z195_HUMAN ZINC 


15.2 


other 


120324 M1 95517 


Hs. 191643 


ESTs 


5.6 


7 


120325 AA1 95651 


Hs, 104106 


ESTs 


6.5 


other 


120327 AK000292 


Hs.278732 


hypothetical protein FU20285 


16.1 


other 


120336 N85785 


Hs.181165 


eukaryotic translation elongation factor 


3 


other 


120342 AW450669 


Hs.45068 


hypothetical protein OKFZp434!143 


5.8 


other 


120345 AA210722 


Hs.10415B 


ESTs 


4.6 


SSJM 


120349 AW969481 


Hs.55189 


hypothetical protein 


16.8 


other 


120352 R06859 


Hs.193172 


ESTs, Weakly similar to 138022 hypotheti 


5.1 


olher 


120356 AF000545 


Hs.296433 


putative purinergic receptor 


28.1 


TM 


120371 AA219305 


Hs.104196 


EST 




o 


120382 AA228026 


Hs.38774 


ESTs 


4.1 


TM 


120383 AL109963 


Hs.123122 


FSH primary response (LRPR1, rat) homolo 


9.7 


TM 


120386 AW969665 


Hs.154848 


hypothetical protein DKFZp434D0127 


32.6 


other 


120388 AA232874 


Hs.104245 


ESTs 


3.2 


other 


120389 AW967985 


Hs.325572 


ESTs, Moderately similar to ALU7JHUMAN A 


21.7 


other 


120396 AA134006 


Hs.79306 


eukaryotic translation initiation factor 


12.5 


other 


120404 AB023230 


Hs.96427 


KIAA1013 protein 


7.3 


other 


120418 AW966893 


Hs.26613 


Homo sapiens mRNA; cDNA DKFZp586F1323 (I 


11.4 


olher 


120423 AA236453 


Hs.18978 


Homo sapiens cDNA: FU22822 fis, clone K 


1.9 


olher 


120472 A1950087 




gb:wq05c02a1 NClCGAPJ<ld12Homosapien 


19.4 


other 


120473 AA251973 


Hs.269988 


ESTs 


5.5 


? 


120484 AA253170 


Hs.96473 


EST 


10.4 


? 


120504 AA256837 




gb:2f84d10.s1 Soares_NhHMPu_S1 Homosapi 


4 


? 


120509 BE047718 


Hs.96545 


ESTs 


9.4 


other 


120520 AA258601 


Hs.161731 


EST 


2.4 


other 


120535 BE350244 


Hs.96547 


ESTs 


Z5 


? 


120551 AA279160 


Hs.111407 


Homo sapiens, done 1MAGE:3613029, mRNA, 


5.3 


other 


120570 AA280679 


Hs.271445 


ESTs, Weakly similar to ALU INHUMAN ALU S 


14.4 


? 



160 



WO 02/059377 



120582 8E244830 Hs.284228 2NF135-Dke protein 

120590 AW372799 Hs.125790 leudne-rich repeakantaining 2 

120596 AA282074 Hs.237323 N-acetylgfucosamine-phosphaie muiase 

120619 AW965339 Hs.1 11471 ESTs 

120624 AW4Q7987 Hs.173518 M-phase phosphoproteln homotog 

120639 AA286942 gb:zs56f05 .si NCL_CGAP_GCB1 Homo sapiens 

120648 AA287095 Hs.1403Q9 Homo salens, done IMAGE:36771 94, mRNA, 

120653 AW063659 Hs.191649 ESTs 

120668 AW969638 Hs.1 12318 6.2 kd protein 

120669 BE536739 Hs.109909 ESTs 

120695 AA9765Q3 gb:oq30a04.s1 NCLCGAP GC4 Homo sapiens 

120696 A1821539 Hs.97249 ESTs 

120713 AW449855 Hs,96557 Homo sapiens cDNA RJ12727 fis, clone NT 

120718 AA292747 Hs.97296 ESTs 

120750 A1191410 Hs.96693 ESTs. Moderately simUar to 210926QA B c 

120774 AI6O8909 Hs.193985 ESTs 

120807 AA346385 Hs.30002 SH3-contaming protein SH3GLB2; KIAA1848 

120809 AA346495 gb:EST52657 Fetal heart II Homo sapiens 

120938 AA386260 Hs.104632 EST 

120977 AA398155 Hs.97600 ESTs 

120984 BE262951 Hs.99052 ESTs 

120985 AI219896 Hs.97592 ESTs 
121011 AA398360 Hs,97608 EST 
121026 A1439713 Hs.165295 ESTs 

121081 AA398721 Hs.186749 ESTs, Highly similar to 137550 mismatch 

121133 AA363307 Hs.97032 ESTs 

121176 AL121523 Hs.97774 ESTs 

121223 AI002110 Hs.97169 ESTs, Weakly similar to (U667H12.2.1 IH. 

121320 AA403008 Hs.301927 c6.1A 

121340 AW956981 Hs.97910 Homo sapiens cDNA FU13383 fis, done PL 

121408 AA406137 Hs.98019 EST 

121439 AA410190 Hs.98076 ESTs, Weakly similar to A47582 B-celi gr 

121450 AA406430 Hs.105362 Homo sapiens, done MGC:18257,mRNA l com 

121452 AW971063 Hs.292882 ESTs 

121455 H58306 Hs.15165 retinoic acid induced 14 

121457 W07404 Hs.144502 rrypothetica! protein FU22055 

121496 AA442224 Hs.97900 ESTs 

121505 AA494172 Hs.194417 ESTs 

121508 AA402515 Hs.97887 ESTs 

121513 AA416653 Hs.181510 ESTs 

121514 AA412112 gb:zt69b02.s1 Soares tesbs_NHT Homo sap 
121549 AA412477 Hs.98142 EST 

121558 AA412497 gb:zt95g12.s1 SoaresJestisJJHT Homo sap 

121577 AA411970 Hs.98096 EST 

121581 AA416568 gb:zu05c10.s1 SoaresJestisJJHT Homo sap 

121589 AD001528 Hs.89718 spermine synthase 

121594 AA626010 Hs.98247 ESTs 

121622 AA416931 Hs.126065 ESTs 

121655 AA421537 Hs.178072 Homo sapiens mRNA; cDNA DKFZp434B1023 (f 

121682 AA418160 Hs.86043 Homo sapiens cONA FU 13558 fis, done PL 

121690 AV660305 Hs.110286 ESTs 

121706 U55184 Hs.154145 hypothetical protein FU1 1585 

121714 AA419225 Hs.98269 Homo sapiens cDNA FLJ1 1953 fe, done HE 

121729 AI949597 Hs.98325 ESTs 

121731 AA421041 Hs.180744 ESTs 

121744 AA398784 Hs.97514 ESTs 

121748 BE536911 Hs.234545 hypothetical protein NUF2R 

121773 AB033022 Hs.158654 KIAA1 196 protein 

121775 AA421773 Hs.161008 ESTs 

121776 AA292579 Hs.125133 hypothetical protein FU22501 
121786 AI810774 Hs.98376 ESTs 

121832 AW340797 Hs.98434 ESTs 

121836 AA328348 Hs.218289 ESTs 

121839 AA425691 Hs.191606 ESTs, Highly similar to K1AA1048 protein 

121842 AF027406 Hs.104865 serine/threonine kinase 23 

121847 AA446628 Hs.2799 cartilage linking protein 1 

121871 AW972668 Hs.293044 ESTs 

161 



• 



WO 02/059377 



PCT/US02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



121882 


AA426376 


Hs.98459 


ESTs 


5 


. other 


121911 


AA427950 




gb:zw50f02.s1 SoaresJotaLfetus_Nb2HF8_ 


7.3 


TM 


121915 


AA428179 


H3.223405 


ESTs, Moderately similar to A46010 X-iin 


2.5 


other 


121935 


AA428647 


Hs.98611 


EST 


2.3 


other- 


121983 


AA298760 


Hs. 180191 


hypothetical protein FU14904 


3.4 


other 


121985 


A1862570 


Hs.299214 


Homo sapiens, clone IMAGB2822295, mRNA, 


11.4 


other 


121995 


AA210863 


Hs.3532 


nemo-Eke kinase 


3.8 


? 


121999 


AA430211 


Hs.98668 


EST 


6.5 


other 


122009 


AW292763 


Hs,160822 


Homo sapiens cDNA: FU20863 fis, done A 


2.2 


other 


122013 


AA431085 


Hs.98706 


ESTs 


6.6 


other 


122036 


W92142 


Hs.271963 


ESTs, Weakly similar to ALU5.HUMAN ALU S 


13.1 


other 


122050 


A1453076 


H&166109 


ELAV (embryonic lethal, abnormal vision, 


9.1 


other 


122060 


AA431738 


Hs.98750 


EST 


13.1 


? 


122114 


AW161023 


Hs.104921 


ESTs 


1.5 


other 


122188 


AA398838 




gb:zt80d01.ii SoaresJestisJJHT Homo sap 


3.4 


other 


122204 


AA435936 


Hs.98842 


EST 


5.6 


other 


122246 


AA329550 


Hs.29417 


HCF-binding transcription factor Zhangfe 


5.2 


other 


122257 


AA436819 


Hs.98899 


ESTs 


5.6 


other 


122302 


AA441801 


Hs.104947 


ESTs 


5.8 


other 


122341 


AW601969 


Hs.99010 


hypothetical protein FU22263 similar to 


2 


other 


122356 


AA443794 


Hs.98390 


ESTs 


7.4 


SSJM 


122369 


AA443985 


Hs.303222 


ESTs 


12.2 


7 


122371 


M868555 


Hs.178222 


ESTs 


5 


? 


122372 


AA446008 


Hs.336677 


EST 


7.8 


? 


122378 


AB032948 


Hs.21356 


hypothetical protein DKFZp762K2015 


2.5 


? 


122405 


AA446572 


Hs.303223 


EST 


2.8 


TM 


122412 


AA446869 


Hs.119316 


ESTs 


7.4 


other 


122415 


AA446918 


Hs.99088 


EST 


1.9 


other 


122418 


AA446966 


Hs.99090 


ESTs, Moderately similar to similar to K 


6.9 


? 


122440 


AW505139 


Hs.9460 


Homo sapiens mRNA; cDNA DKFZp547C244 (fr 


2.6 


other 


122446 


AA447603 


Hs.99123 


EST 


1.8 


TM 


122448 


AA447626 


Hs.99127 


EST 


3.5 


other 


122458 


A1266159 


Hs.104980 


ESTs 


1.5 


other 


122460 


AW418788 


Hs.99148 


ESTs, Weakly similar to S43569 R01H10.6 


9.7 


other 


122464 


AA448158 


Hs.99152 


EST 


4.9 


other 


122490 


AA448349 


Hs.238151 


EST 


6.2 


? 


122492 


AA448417 


Hs.104990 


ESTs 


5.5 


other 


122502 


AA204969 


Hs.234863 


Homo sapiens cDNA FU12082 fis, done HE 


1.3 


other 


122510 


AA449232 


Hs.99195 


ESTs 


11.2 


? 


122530 


AW959741 


Hs.40368 


adaptor-related protein complex 1. sigma 


10,1 


other 


122547 


AA779725 


Hs.1 64589 


ESTs 


2.5 


SS, 


122555 


AA194055 


Hs.293858 


ESTs 


1.9 


other 


122570 


AA452578 


Hs.262907 


ESTs 


9.5 


other 


122572 


AA452601 


Hs.99287 


EST 


11 


? 


122586 


AK001910 


Hs.99303 


Homo sapiens cONA FU11048 fis, done PL 


3.4 


other 


122587 


AB040893 


Hs.6968 


WAA1460 protein 


2 


other 


122598 


AI028173 


Hs.99329 


ESTs 


1.7 


? 


122599 


AL355841 


Hs.99330 


hypothetical protein FU23588 


4.4 


? 


122602 


AA411925 


Hs.301960 


ESTs 


A 7 

4.7 


other 


122607 


AA453518 


Hs.98023 


ESTs 


61.5 


other 


122614 


AA453630 


Hs.99339 


EST 


10.7 


? 


122616 


AA453638 


Hs.161873 


ESTs 


107.3 


? 


122617 


AI681535 


Hs.148135 


serine/threonine kinase 33 


121.4 


other 


122618 


AA453641 




gb:zx48eQ6.s1 Scares Jestis.NHT Homo sap 


31.1 


SS, 


122622 


AA453987 


Hs.144802 


ESTs 


5.6 


other 


122717 


AA456859 


Hs.178358 


ESTs 


8.5 


SS, 


122762 


AI376875 


Hs.105119 


ESTs 


10.4 


other 


122829 


AW204530 


Hs.99500 


ESTs 


81.8 


? 


122834 


AA461492 


Hs,99545 


Homo sapiens cDNA FLM0658 fis, done NT 


3.7 


? 


122836 


AA460581 


Hs.290996 


ESTs 


4.6 


other 


122837 


AA461509 


Hs.293565 


ESTs. Weakly similar to putative p150 [H 


2.7 


TM 


122838 


AA460584 


Hs.334386 


ESTs 


75.3 


other 


122854 


AA600235 


Hs.9625 


NIMA (never in mitosis gene a)-related k 


7.8 


other 


122856 


AI929374 


Hs.75367 


Src-like-adapter 


5.8 


other 


122861 


AA335721 


Hs.1 19394 


ESTs 


1.3 


other 


122866 


BE539656 


Hs.283705 


ESTs 


42 


other 


122868 


AF005216 


Hs.1 15541 


Janus kinase 2 (a protein tyrosine klnas 


5.3 


other 
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100Q7n A\AWT7C040 


H. 04Q710 

HS.Olo/22 


Homo sapiens cONA: FLJ21765 fls, clone C 


9.9 


? 


A\A/nA11QA 
UMUL /W¥U010y4 


Lie 07 Ifi** 

nS.y/ lUo 


COTe 

coTS 


5.3 


other 




US. 1/0034 


cblS 


13.9 


other 


12290/ AA470074 


Hs.1 69895 


ESTs 


11.5 


other 


IZ^yiti AA470140 


Hs.229170 


EST 


1.7 


TM 


lOOAO* t Amines 

122981 AA478951 


Hs. 105629 


ESTs 


5 


other 


123013 AW968324 


Hs. 17384 


ESTs 


15.4 


other 


l*V3n4£ AiAiTion^T 

123016 AW33BG67 


Hs.323231 


Homo sapiens cONA FU1 1946 fis, clone HE 


2.8 


other 




HS.44054 


riinein (GSK3B interacting protein) 


8.7 


other 


4 0^070 Alooncnrt 

i2oU/2 A13B26QQ 


Hs.104308 


ESTs, Weakly simBar to K1AA1395 protein 


8.8 


other 


1/3082 AA485360 


Hs.105661 


ESTs 


4 


? 


I'MAflQ A 11/1 ICO 

123088 AI343652 


HS.1 05667 


ESTs 


3.8 


other 


lo^nn AA4Q&0KC 


ns.iyjoiu 


CCT 

EST 


7.4 


other 


123114 o £304942 


Hs.26584o 


myomegaBn 


2.8 


? 




n. 074 70C 

HS.z/1/yo 


ESTs, WeaWy similar to 138022 hypotheti 


2.4 


other 


400400 Ainc^coo 
123132 AJ0o1582 


HS.324179 


Homo sapiens cDNA FU 12371 fis, done MA 


15.6 


TM 


123138 AW451999 


Hs. 194024 


ESTs 


5.2 


other 


<00<AQ AI70VI47O 

AJ/041/9 


Lie 4ACC7C 

ns.105o/o 


ESTs 


23.8 


TM 
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? 


428477 


AW500533 


Hs.1 1482 


splicing factor, aio/inine/serine-rich 11 


1.7 


other 


437562 


AB001636 


Hs.5683 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 


2.4 


other 


438449 


AK001333 


Hs.6216 


Homo sapiens hepatocellular cardnoma-as . 


3.8 


other 


441560 


F13386 


Hs.7888 


Homo sapiens done 23736 mRNA sequence 


5.6 


other 


445580 


AF167572 


Hs.12912 


skbl (S. pombe) homolog 


2 


TM 


446999 


AA151520 


Hs.334822 


hypothetical protein MGC4465 


7.6 


other 


447111 


AI017574 


Hs.17409 


cysteine-rich protein 1 (intestinal) 


^2 


other 


447778 


BE620592 


Hs.71190 


ESTs, Weakly similar to S16506 hypotheti 


2.9 


other 


448873 


NM_003677 


Hs.22393 


density-regulated protein 


1.8 


other 


449687 


W68520 


Hs.331328 


intermediate filament protein syncoilin 


5.9 


other 


450701 


H39960 


Hs.288467 


Homo sapiens cDNA FU 12280 fis. done MA 


5,7 


other 


450703 


M011202 


Hs.184771 


nudear factor t/C (CCAAT-binding transc 


1.4 


other 



172 



WO 02/059377 



452461 N78223 
452511 8E403178 
453157 AF077036 
453658 BE541906 
100685 AA328229 
100690 AA383256 
100833 AF135168 
100850 AA836472 
101161 NMJXJ6262 
102481 U50360 
102831 AA262170 
103549 BE270465 
103749 AL135301 
104331 AB040450 
104532 AI498763 
104563 AL1 17403 
105032 AA127818 
105039 AA907305 
106531 AA454036 
106977 AL043152 
107298 N95657 
108717 AA122393 
110018 AW579842 
110330 AI288666 

111391 NMJJ03896 

111392 W46342 
113554 AW503990 
113722 AV653556 
115008 AK001827 
115062 AA253314 
115121 AI634549 
117881 AF161470 
119075 M10905 
119615 AL034423 
120253 AA131376 
125006 BE065136 
127609 X80031 
128868 AA419008 
128891 F34856 
126959 AI580127 
129209 R62676 
129449 A1096988 
129453 AW974265 
129629 AK000398 
129917 M30773 
129922 AF042379 
129989 AB015856 
130182 BE267033 
130365 W56119 
130471 AL121438 
130542 U64675 
130586 AB007891 
130768 AF258627 
130992 BE398091 
131047 H23230 
131135 NM_016569 
131339 AF058696 
131760 X76732 
131774 BE267158 
131853 AI881917 
131881 AW361018 
131887 W17084 
132031 AF193844 
132192 AA206153 
132203 NNL004782 
132240 AB018324 
132348 AW067708 



Hs.103106 

Hs.285165 

Hs.31989 

Hs.87819 

Hs.184582 

Hs.1657 

Hs.108802 

Hs.297939 

Hs.37044 

Hs.80917 

Hs.78793 

Hs.8768 

Hs.279862 

Hs^03013 

Hs.306189 

Hs.36475 

H&8832 

Hs.50421 

Hs.6820 

Hs.70811 

Hs.104557 

Hs.16621 

Hs.225939 

Hs.325081 

Hs.142442 

Hs.184411 

Hs.87889 

Hs.154103 

Hs.88155 

Hs.260622 

Hs.287820 

Hs.75875 

Hs.326401 

Hs.145696 

Hs.530 

Hs.106730 

Hs.292457 

Hs.107381 

Hs.17820 

Hs.111554 

Hs.1 11632 

Hs.11747 

Hs.278540 

Hs.13386 

Hs.247433 

Hs.192853 

Hs.155103 

Hs.183706 

Hs.179825 

Hs.16349 

Hs.211562 

Hs.74316 

Hs.22481 

Hs.267182 

Hs.25812 

Hs.3164 

Hs.169474 

Hs.3321 

Hs.3383 

Hs.332848 

Hs.3758 

Hs.4209 

Hs.194714 

Hs.42676 

Hs.170311 



transcription factor 

Homo sapiens cDNA FU20845 fis, done AD 

DKFZP58661722 protein 

Homo sapiens, clone MGC:2492, mRNA, comp 

ribosomal protein L24 

estrogen receptor 1 

N-ethylmaJeimide-sensitive factor 



peripheral 

gb:Human calcium. calmoduCn-dependent p 

adaptor-related protein complex 3, sigma 

protein kinase C.zeta 

hypothetical protein RJ 10849 

cdk Inhibitor p21 binding protein 

hypofteticat protein RJ12748 

DKFZP434F1735 protein 

gb:zl12a02.s1 Soaresj>regnanOiterus_NbH 

ESTs 

ESTs 

KlAA0203gene product 

ESTs, Moderately simflar to YOJ1.CAEEL H 

hypothetical protein FU20516 

hypothetical protein FU10697 

DKFZP434I116 protein 

slalyltransferase 9 (CMP-NeuActactosylc 

Homo sapiens, clone IMAGE:365968Q, mRNA, 

HP1-BP74 

albumin 

hellcase-moi 

LIM protein (similar to rat protein kina 
ESTs 

butyrate-induced transcript 1 
Gbronectinl 

ubiquHin-conjugatlng enzyme E2 variant 
fibroblast growth factor 128 
splicing factor (CC1 .3) 
collagen, type IV, alpha 3 (Goodpasture 
chromosome 22 open reading frame 3 
Homo sapiens, done MG£16362, mRNA, com 
hypothetical protein FU11200 
Rho-assodated, coiled-coil containing p 
ADP-ribosyiation factor-like 7 
Lsm3 protein 

hypothetical protein FU20391 

protein phosphatase 3 (formerly 2B), reg 

gamma-tubulin complex protein 2 

activating transcription factor 6 

ubiquitin-conjugafing enzyme E2G 2 (homo 

eukaryotic translation initiation fador 

adducin 1 (alpha) 

RAN binding protein 2-like 1 

K1AA0431 protein 

ATP-binding cassette, sub-family A (ABC 1 
desmoplakin(DPI, DPI!) 
ESTs, Moderately similar to A46010 X-tin 
TBX3-iso protein 

Nijmegen breakage syndrome 1 (nibrin) 
nudeoblndin 2 
DKFZP586J01 19 protein 
ESTs, Highly similar to 1RX1_HUMAN IROQU 
upstream regulatory element binding prot 
SWl/SNF related, matrix associated, acti 
COP9 complex subunit 7a 
mitochondrial ribosomal protein 137 
synaptosomaJ-associated protein, 29kD 
KIAA0781 protein 

heterogeneous nuclear ribonudeoprotein 
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132528 
132571 
132726 
132863 
133016 
133053 
133197 
133240 
133266 
133285 
133383 
133540 
133784 
133791 
133850 
133859 
133881 
134208 
134403 
134724 
134806 
134859 
135193 



T78736 

AW674699 

N52298 

BE268048 

AI439688 

AI065016 

AI275243 

AK001489 

AI160873 

M76477 

SE313555 

AL037159 

BE622743 

M34338 

W29092 

U86782 

U30572 

NM_000288 

AA334551 

AF045239 

AD001528 

D26488 

X95525 

AA2430O7 

T70541 

X57766 

S66431 

AA453483 

R63925 

AA173417 

AA280588 

AA504223 

AA609996 

F02907 

AA480103 

AA024664 

AA251776 

AA399047 

N34059 

U95367 

AA490899 

T54762 

Z41963 

AA521186 

AA400195 

AA045083 

AA099589 

W85712 

W45728 

U61232 

AA425154 

T39176 

AA496000 

W38150 

T96595 

AA227463 

R46Q25 

AA233177 

AA338760 

AA412106 

L47276 

D82307 

AA293568 

R37778 

AA250843 

W49521 

D80000 



Hs.50758 SMC4 (structural maintenance of chramoso 7,4 

Hs.5169 suppressor of G2 allele of SKP1, S. cere 6.9 

Hs.55608 hypothetical protein MGC955 14.3 

Hs.236494 RAB 10, member RAS oncogene family 10.3 

Hs.6289 hypothetical protein BJ20886 4.4 

Hs.6390 Homo sapiens clone FLB3344 PRO0845 mRNA, 1.8 

Hs.180201 hypothetical protein aJ20671 1.8 

Hs.242894 ADP-ribosylation factor-like 1 1.8 

Hs.69233 zinc linger protein 16.1 

Hs.269082 GM2 ganglloside activator protein 10.4 

Hs.7252 KIAA1224 protein 1.5 

Hs.74619 proteasome (prosome, macropatn) 26S subu 1.7 

Hs.301064 arfaptin 1 12.1 

Hs.76244 spermidine synthase 9.7 

Hs.7678 cellular retlnolc acid-binding protein 1 ' 4.2 

Hs.1 78761 26S proteasome-assoclated pad1 homolog 2.2 

Hs.77204 centromere protein F (35Q/400kD, mitosin 9.1 

Hs.79993 peroxisomal biogenesis factor 7 3.2 

Hs.82767 sperm specific antigen 2 1.4 

Hs.321576 ring finger protein 22 1.4 

Hs.89718 spermine synthase 2.6 

Hs.90315 KJAA0007 protein 13.3 

Hs.96103 TATA box binding protein (TBPHssociate 3.1 

ESTs 1.6 

ESTs 2.5 

Human stromerysin-3 mRNA 4.5 

Homo sapiens done 23592 mRNA sequence 3. 1 

ESTs 4.6 

ESTs 1.4 

ESTs 1.9 

ESTs 2.2 

ESTs Highly similar to CHROMOSOME 2.4 
ESTs Highly similar to Surf-4 protein [MmusculusJ 5.5 

ESTs " 2.3 

ESTs Weakly similar to HI! ALU SUBFAMILY J 2.8 

Human NAOHrubiquinone oxidoreductase subunit 6.2 

ESTs 2.3 

ESTs 2.4 

EST-RCJM059 3.3 
Human GABA-A receptor pi subunit mRNA complete cds 

* ESTs 3.3 

ESTs 2.9 
Homo sapiens HP protein (HP) mRNA complete cds 1 .3 

ESTs 1.6 

ESTs 1.3 
VITAMIN K-OEPENDENT GAMMA-CARBOXYLASE 
Homo sapiens mRNA for GDP dissociation inhibitor beta 
ESTs Weakly similar to PROCOLLAGEN ALPHA 2(IV 

ESTs Highly similar to HETEROGENEOUS 3.7 
Human tubulln-folding cofactor E mRNA complete cds 



ESTs 



5.3 
2.6 
1.9 
1.7 
1.8 



ESTs Weakly similar to ZK1058.4 [C.elegans] 
ESTs 

EST-RCJV38150 
EST-RCJ96595 
ESTs Weakly similar to No definition line found [C.elegans] 

ESTs 2.8 

ESTs 2 

ESTs 1.3 

ESTs 6.2 

EST-L47276 3.4 
ESTs Weakly similar to TH1 protein [D.melanogaster] 

ESTs * 1.5 

ESTs 2.4 

Interferon regulatory factor 5 14.6 

Human prolyl 4-hydroxy!ase alpha (II) subunit 6.5 

Human mRNA for KIAA0178 gene partial cds 2 
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? 

other 
? 

other 
other 

SS.TM 
other 
other 
other 

SS, 
? 

other 
other 
other 
SS, 
other 
other 
other 
other 

other 
other 
other 
? 
SS, 
other 
other 
TM 
other 
other 
other 
other 
? 

TM 
TM 
other 
other 
other 
other 
1.7 
other 

? 

? 

TM 

other 
25 
1.6 
2.6 

other 
2.1 

other 
SS.TM 
SS, 

? 

TM 

1.9 



? 

other 
other 
11.4 
other 
other 
? 
? 

other 
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other 

TM 

TM 

other 
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R99978 


ESTs Weakly similar to line-1 protein 0RF2 [hUapiens] 


6.1 


AA1 95036 


Human Ro/SSA ribonucleoprotein homolog (RoRet 5.3 


? 


Z38501 


ESTs Weakly similar to PROBABLE E5 


1.4 


other 


U37547 


Human IAP homolog B (MJHB) mRNA complete cds 3.2 


other 


AA479961 


ESTs 


1.7 


other 


X57579 


Inhibln beta A (activin A activin AB alpha polype) 


atide) 


15.8 


AA449071 


ESTs 


13 


TM 


N51855 


ESTs Moderately similar to NAD{+) ADP- 


1.3 


other 




ESTs Weakly similar to F28F8.3 {Celegans] 


3.2 


other 


AA355201 


ESTs 


\2 


SSJM 


N78717 


HUapiens mRNA for translin 


1.5 


? 


N73808 


ESTs 


5 


? 


U86782 


Human 28S proteasome-assoclated pad1 


2.2 


other 


AA234817 


ESTs 


1.3 


other 


D13666 


Homo sapiens mRNA for osteoblast speciOc 


7.5 


SS, 


AA236177 


ESTs 


7.1 


? 


U50648 


Protein kinase interferorHnduclble double 


4.1 


? 


M28211 


Homo sapiens GTP-bindhg protein (RAB4) 


2.9 


other 


AA446949 


ESTs 


2.2 


other 


W03007 


ESTs 


1.2 


other 


W61011 


ESTs 


1.2 


other 


W87544 


ESTs 


1.2 


other 


X02751 


Neuroblastoma RAS viral (v-ras) oncogene homolog 1.2 


? 


Z14077 


YY1 transcripfion factor 


U 


other 


Z38839 


ESTs 


1.2 


? 


AA410894 


ESTs 


1.7 


other 


AA504499 


ESTs Highly similar to probable chloride channel 3 |H.sap 


1.3 
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TABLE 7A 



5 

10 
15 
20 



Table 7 A shows the accession numbers for those pkeys lacking unigenelD's for Table 7. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

102481 31281.-28 U50360 

105032 genbankJ\A127818 AA127818 

409487 1134778.1 H 1 9886 AW402806 T1 0231 
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TABLE 8: Figure 8 from BRCA 001-1 US 

Table 8 shows genes upregulated in tumor tissue compared to normal breast tissue. 
Specifically, one column shows the ratio of expression of the indicated gene in breast tumor 
tissue compared to other body tissues, and another column shows the ratio of expression of 
the indicated gene in breast tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnkjenelD: Unigene number 

Unigene Tide: Unigene gene title 

R1 : Ratio of tumor to normal body tissue 

R2: Ratio of tumor to normal breast tissue 

Pkey ExAccn- UnigenelD Unigene Title 



100075 AF152333 
100229 AV652249 
100262 D38500 
100271 BE160081 
100355 AI907114 
100522 X51501 
100552 AA019521 
100599 X77343 
100676 X02761 
100690 AA383256 
100895 U01351 
101046 K01160 
101086 AA382524 
101148 NM_002923 
101161 NM_006262 
101201 L22524 
101212 AI186220 
101441 AW468397 
101447 M21305 
101469 AA310162 
101567 M33552 
101600 BE561617 
101624 M55998 
101674 NM.002291 
101861 AA350659 
101977 AF1 12213 
102193 AL036335 
102199 AA334592 
102304 AF015224 
102345 NM.003480 
102457 NMJJ01394 
102534 U96759 
102541 AI379954 
102827 BE244588 
102962 R50032 
102991 AW293542 
103119 X63629 
103175 X69089 
103286 D38616 
103319 X83492 
103372 BE536700 
103419 T34708 
103471 Y00815 
103546 Z14244 



Hs.284160 

Hs.180107 

Hs.278468 

Hs^56290 

Hs.71465 

Hs.99949 

Hs.301946 

Hs.334334 

Hs.287820 

Hs.1657 

Hs.75772 

Hs.250959 

Hs.78944 

Hs.37044 

Hs.2256 

Hs.83164 

Hs.100000 

Hs.169248 
Hs.56729 
Hs.1 19192 

Hs.82124 

Hs.83347 

Hs.184062 

Hs.313 

HsJ9914 

Hs.46452 

Hs.300946 

Hs.2359 

Hs.198307 

Hs.79025 

Hs.6456 

Hs.159263 

Hs.75309 

Hs.2877 

Hs.79227 

Hs.54941 

Hs.82359 

Hs.4888 

Hs.272927 

Hs.75216 

Hs.75752 



protocadherin gamma subfamily B, 4 
polymerase (DNA directed), beta 
posfrneiotic segregation increased 2-8ke 
S100 calcium-binding protein A11 (cakjiz 
squalene epoxidase 
prolacttn-induced protein 
lysosomal 

transection factor AP-2 alpha (activat 
fibronectin 1 
estrogen receptor 1 

nuclear receptor subfamily 3, group C, m 
NMJX)2122:Homo sapiens major histocompatl .7 
histatin 1 

regulator of G-proteln signalling 2, 24k 
peripheral 

matrix metaltoprateinase 7 (MMP7; uterin 
collagen, type XV, alpha 1 
S100 calcium-binding protein A8 (calgran 
gfcHuman alpha satellite and satellite 3 
cytochrome c 



H2A histone family, member Z 
gb:Human atpha-1 collagen type I gene, 3 
laminin, beta 1 

anglo-associated, migratory cell protein 
putative Rab5-lnteracting protein 
secreted phosphoprotein 1 {osteopontin, 
lumlcan 

mammaglobin 1 

Microfibril-associated glycoprotein-2 
dual specificity phosphatase 4 
von Hippel-Undau binding protein 1 
K1AA0096 protein 

chaperonin containing TCP1, subunit 2 (b 
collagen, type VI, alpha 2 
eukaryoOc translation elongation factor 
cadherin 3, type 1, P-cadherin (placenta 
myomesin (M-protein) 2 (165kD) 
phosphorylase kinase, alpha 2 (river) 
tumor necrosis factor receptor superfaml 
seryHRNA synthetase 
Sec23 (S. cerevislae) homotog A 
protein tyrosine phosphatase, receptor t 
cytochrome c oxidase subunit VHb 
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R1 


R2 


1 


3.8 


1.7 


5.3 


0.8 


4.8 


3.2 


2.3 


3.3 


1.4 


11.9 


0.4 


3.8 


1.2 


9.4 


9.4 


3 


7.8 


4.4 


4.4 


1 


3.9 


It1.7 


4 


0.8 


4.1 


M 


12 


3.1 


1.1 


4.4 


0.6 


3.1 


3.4 


0.9 


4.2 


29.9 


0.3 


0.8 


4.9 


1 


5.9 


2.8 


4 


3.1 


1.7 


1.5 


4.1 


3.1 


1.4 


1.3 


6.9 


1.9 


4.9 


22 


3.8 


4.2 


0.7 


1.1 


4.2 


4.5 


0.5 


1.4 


4.2 


0.9 


3.9 


1.5 


10.9 


2.2 


6.2 


5.6 


5.7 


3.7 


0.5 


1.3 


4 


1.3 


3.8 


0.8 


4.6 


0.9 


8 


1.1 


5.1 


3.7 


1.2 


0.9 


4.4 
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103658 


NMJJ0008B 


Hs.172928 collagen, type I, alpha 1 


3.2 


3 


103758 


M084874 


gb:zn13e04.r1 Stratagene hNT neuron {937 


0.9 


10 


103774 


H241B5 


Hs.92918 hypothetical protein 


1.9 


15.9 


103821 


M095971 


Hs.198793 Homo sapiens cDNA: FU22A63 fis, clone H 


1.2 


3.9 


103869 


BE439604 


Hs.24322 ATPase, H+ transporting, lysosomal {vacu 


1.4 


3.9 


103980 


AW130242 


Hs.293476 hypothetical protein FKSG44 


1.6 


4.1 


104054 


AK001913 


Hs.7100 hypothetical protein 


1.5 


4.3 . 


104115 


AF183810 


Hs.26102 opposite strand to trichorhlnophalangeal 


7 


7 


104189 


AB040927 


Hs.301804 KIAA1494 protein 


2 


4.6 


104230 


AB002347 


Hs.15303 K1AA0349 protein 


0.7 


4.5 


104278 


AW583693 


Hs. 109253 N-temninal acetyltransferase complex ard 


3.3 


3.3 


104295 


AW365522 


Hs.103657 hypothetical protein PR02219 


2.3 


4.2 


104319 


AW804296 


Hs.9950 Sec61 gamma 


3.1 . 


7 


104425 


AF283775 


Hs.35380 x 001 protein 


4 


1.3 


104432 


X51501 


Hs.99949 prolactin-induced protein 


3.8 


0.6 


104464 


AW966728 


Hs.54642 methionine adenosyltransferase 11, beta 


0.8 


6.7 


104479 


AK001731 


Hs.106390 Homo sapiens mRNA; cDNA OKFZp586H0924 (f 


1.7 


104505 


W94824 


Hs.11565 RIKENcDNA 2010100012 gene 


2 


7.5 


104592 


AW630488 


Hs.325820 protease, serine, 23 


1.9 


7.4 


104613 


AF123303 


Hs.24713 hypothetical protein 


1.1 


6.3 


104636 


R82252 


Hs.106106 protein kinase (cAMP-dependent, catalyti 


1.2 


4 


104782 


AW270555 


Hs.171774 hypothetical protein 


1.4 


3.9 


104792 


AA960961 


Hs.305953 zinc finger protein 83 (HPF1 ) 


1.5 


42 


104848 


AA305351 


Hs.274369 uncharaderized hypothalamus protein HAR 


1.1 


4.1 


104849 


AI279065 


Hs.241507 ribosomal protein S6 


1.3 


4.6 


104850 


AL133035 


Hs.8728 hypothetical protein DKFZp434G171 


1.2 


3.6 


104852 


W70164 


Hs.20107 ESTs 


0.8 


4.2 


104861 


AA058630 


Hs.29759 RNA POLYMERASE I AND TRANSCRIPT RELEASE 


1.7 


104873 


W03831 


Hs.20597 host ceB factor homotog 


0.8 


5.4 


104891 


W44626 


Hs.30627 ESTs 


0.7 


6.6 


104920 


AW955089 


Hs.306083 Novel human gene mapping to chomosome 22 


1 3.9 


104926 


BE298808 


Hs.33363 DKF2P434N093 protein 


3.3 


3.3 


104952 


AW076098 


Hs.74316 desmoptekin(DPI,DPII) 


1.2 


3.7 


104963 


AB029020 


Hs.173694 KIAA1097 protein 


1.1 


5.5 


104977 


AI392640 


Hs.18272 amino acid transporter system A1 


3.2 


1.4 


105030 


BE613061 


Hs.337772 Homo sapiens, Similar to RiKEN cDNA 0610 1.6 


11.4 


105035 


N39760 


Hs.8859 Homo sapiens, Similar to RIKEN cDNA 5830 1 .5 


7.2 


105068 


BE410438 


Hs.9006 VAMP (vesicle-associated membrane protei 


1.1 


3.5 


105159 


AF146277 


Hs.265561 CD2-associated protein 


1.2 


10 


105178 


AA313825 


Hs.21941 AD036 protein 


3.6 


8.3 


105182 


BE407961 


Hs. 1 8271 golgi phosphoproteln 3 


1.7 


6.8 


105274 


AI554929 


Hs.281866 ATPase, H> transporting, lysosomal (vacu 


1.1 


3.7 


105303 


BE243327 


Hs.1 82626 chromosome 22 open reading frame 5 


1.5 


4 


105413 


AI015709 


Hs.172089 Homo sapiens mRNA; cDNA DKF2p586l2022 (f 


1.5 


105426 


W20027 


Hs.23439 ESTs 


4.3 


2.9 


105432 


W03516 


Hs.76698 stress-assodated endoplasmic reticulum 


1.5 


5 


105443 


AA252372 


Hs.12144 KIAA1033 protein 


1.2 


3.6 


105483 


AL137257 


Hs.23458 Homo sapiens cDNA: FU23015 fis, clone L 


1.7 


15.8 


105492 


A1805717 


Hs.289112 CGI-43 protein 


2 


4.8 


105495 


AL037715 


Hs.28785 microfibrillar-associated protein 3 


1.3 


3.9 


105539 


AB040884 


Hs.109694 KIAA1451 protein 


2.7 


11.4 


105594 


AB024334 


Hs.25001 tyrosine 3-rnonooxygenase/tryptophan 5-mo 


1.3 


6.1 


105623 


BE504200 


Hs.30127 hypothetical protein 


1.7 


4.5 


105807 


AA788946 


Hs.16869 ESTs, Moderately similar to CA1C RAT COL 


3.9 


24.6 


105812 


BE614149 


Hs.20814 CGK27 protein 


1.8 


3.6 


105823 


AI559444 


Hs.293960 ESTs 


1.9 


6.6 


105831 


AA329449 


Hs.247302 twisted gas trutation 


1.5 


4.3 


105851 


AI827976 


Hs.24391 hypothetical protein FU13612 


3.8 


1.9 


105879 


BE392914 


Hs.30503 Homo sapiens cDNA FU1 1344 fis, clone PL 


1.7 


4 


105918 


AW028485 


Hs.26136 hypothetical protein MGC14156 


1.7 


7,4 


105939 


AL137728 


Hs.12258 Homo sapiens mRNA; cDNA DKFZp434B0920 (f 


1.2 


105941 


AB033075 


Hs. 1 0669 development and differentiation enhancin 


1.3 


4.6 


105969 


AB030656 


Hs.17377 coranin, actin-binding protein, 1C 


1.1 


5.9 


105990 


A1690586 


Hs.29403 hypothetical protein FU22060 


2 


4.6 


106012 


AI240665 


Hs.8895 ESTs 


4.1 


1.2 


106060 


NM-001329 


Hs.171391 Oterminal binding protein 2 


2.6 


7 


106070 


T74445 


Hs.5957 Homo sapiens done 2441 6 mRNA sequence 1 .4 


10.7 



4.8 



5.1 



14 



3.8 
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106083 H62087 Hs.31659 thyroid hormone receptor-associated prot 1.5 

106155 AA425414 Hs.33287 nuclear factor l/B 54 

106255 BE613206 Hs.279607 caJpastatin 1.8 

106414 BE568205 Hs.28827 mitogen-activated protein kinase kinase 5.1 

106538 AKQ00274 Hs.278635 HDCMA18P protein 1.2 

106568 AW051564 Hs.28285 patched related protein translocated in 1.8 

106574 BE044325 Hs.227280 U6 snRNA-assoclated Sm-Oke protein 2.3 

106613 N88604 Hs.30212 thyroid receptor interacting protein 15 \2 

106617 H09548 Hs.5367 ESTs, Weakly similar to 138022 hypotheti 0.9 

106619 AA459480 Hs.23956 hypothetical protein FU20502 1.3 

106701 BE387614 Hs.25797 splicing factor 3b,subunit 4, 49kD 1.6 

106721 AA741038 Hs.6670 ESTs 1.7 

106776 AA206079 Hs.6693 hypothetical protein FU20420 1 

106866 AA487416 Hs.268231 Homo sapiens cDNA: FU231 1 1 fis, clone L 1,6 

106868 BE185536 Hs.301 183 molecule possessing ankyrin repeats indu 3.3 

106887 BE503373 Hs.334335 hypothetical protein FU 13576 1.4 

106940 T85594 Hs.339808 hypothetical protein FU 101 20 3.3 

106968 AF216751 Hs.26813 COA14 3 

107052 BE391904 Hs.12482 gryceronephosphateOacyltransferase 1.7 

107061 BE147611 Hs.6354 stromal ceil derived factor receptor 1 1,2 

107149 AI289507 Hs.299883 hypothetical protein FLJ23399 1.8 

107222 BE172058 Hs.82689 tumor rejection antigen (gp96) 1 1.2 

107233 BE267795 Hs.22595 hypothetical protein FU10637 1.4 

107295 AA186629 Hs.80120 UDP-N-acetyt-aIpha^aIactosamine:polyp 2.6 

107679 AA011510 Hs.60512 ESTs 1.8 

107914 AA027229 Hs.61329 ESTs, Weakly similar to T16370 hypotheti 1.3 

107965 AF109219 Hs.108787 phosphatidyiinositol glycan, class N 1.6 
108033 AW368993 Hs.323748 Homo sapiens clone CDABP0086 mRNA sequen 

108060 AA291440 Hs.73149 paired box gene 8 1.1 

108081 M093668 Hs.28578 musdebfind (Drosophila)-fike 0.7 
108137 AI283611 Hs.263479 ESTs, Weakly similar to HMGLHUMAN HIGH 
108186 AW068579 Hs.7780 Homo sapiens mRNA; cDNA OKFZp564A072 (fr 

108215 AI879238 Hs.299315 collapsin response mediator proteto; C 1.5 

108297 AA333660 Hs.71331 hypothetical protein MGC5350 1.5 
108339 AW151340 Hs.51615 ESTs, Weakly similar to ALU7_HUMAN ALUS 

108371 AA074374 Hs.67639 ESTs 1,3 

108399 AFOB6070 Hs.237519 EST 1 

108469 AA079487 gbzm97fi)8.s1 Stratagene colon HT29 (937 1.5 

108470 AA079500 gb:zm96h10.s1 Stratagene colon HT29 (937 1.1 
108564 M23114 Hs.1526 ATPase, Ca++transporfing, cardiac muscl 2 
108641 AA112059 Hs.429 ATP synthase, H+ transporting, mitochond 1.1 
108668 AA058522 Hs. 185751 ESTs 12 
108694 AA036725 Hs.61847 ESTs 1.4 
108824 AK001332 Hs.44672 hypothetical protein FU 10470 1.4 
108863 AA133456 Hs.102548 glucocorticoid receptor DNA binding fact 1.2 
108893 BE276891 Hs.194691 refinoic acid induced 3 U 
108992 AA152312 Hs.72047 ESTs 1 1 
109072 AI732585 Hs.22394 hypothetical protein FU 10893 M 
109097 AA167512 gb:zp10f12.s1 Stratagene fetal retina 93 1.3 
109160 BE220601 Hs.301997 hypothetical protein FU 13033 4 
109244 BE179030 Hs.64239 Human DNA sequence from clone RP5-1 174N9 
109481 AA878923 Hs.289069 hypothetical protein FU21 01 6 3.8 
109484 AA366263 Hs.72531 hypothetical protein FU 11 838 1.9 
109795 AA173942 Hs.326416 Homo sapiens mRNA; cDNA DKFZp564H 191 6 (f 
110009 BE075297 Hs.6614 ESTs. Weakly similar to A43932 mucin 2 p 4.6 
110107 AW151660 Hs.31444 ESTs 1 2 
110411 AW001579 Hs.9645 Homo sapiens mRNA for KIAA1 741 protein, 3.7 
110731 NMJH4899 Hs.188006 K1AA0878 protein 28 
110756 N21207 Hs.182999 ESTs 16 
110930 BE242691 Hs.14947 ESTs 31 
110935 AI753230 Hs.323562 hypothetical protein OKFZp564K142 1.9 
111051 A1681293 Hs.12186 hypothetical protein FU22558 2 
111110 AK001566 Hs.23618 hypofoetical protein aj10704 1.1 

111356 BE301871 Hs.4867 mannosyi(aIpha-13-)-glycoproteinbeta- 1 

111357 BE314949 Hs.87128 hypothetical protein FU23309 3.3 
111770 R27975 Hs.269401 ESTs, Moderately sImBar to S65657 alpha 1.2 
111900 AF131784 Hs^5318 Homo sapiens clone 25194 mRNA sequence 3.2 
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111903 


NM 014906 


Hs.166351 KIAA1072 protein 


1 


5.4 


111951 


NM~014927 


Hs 100527 KIAA0902 era tain 


1 


3.8 


112141 


AW137198 


Hs.278682 PhosphatidylglycerophQsphalfi Synthase 


1.4 


3.5 


.112193 


R49499 


Hs.138238 ESTs 


1.5 


3.6 


112197 




Hs.5637 ESTs 


4.6 


2 


112610 




Hs.23643 serine/threonine protein kinase MASK 


3.3 


10R 


112971 


242387 


Ms.83883 transmembrane, prostate androgen induced 


3.2 


3 


119QR4 


T1RQ71 

1 109/ 1 


Hs.289014 ESTs, Weakly similar to A43932 mucin 2 p 


3,7 


m r 


113056 


AF019226 
r\r\j ijt.£.\j 


Hs.8036 glioblastoma overexpressed 


4.5 


3.7 


113449 


AW1fiflfiR3 


Hs.158006 hypothetical protein 


1.2 


4.4 


1 IJ*t9f 


AP141191 


Hs.15572 hypoihetical protein IMAGE 109914 


0.9 


'K R 
0.0 


113508 


Al n49Q3fi 


Hs.21 1 57 1 holocytochrome c synthase (cytochrome c 


1.1 




113531 


AKnMAQ.fi 

AIMJU 10 JO 


Hs.16740 hypothetical protein FU11036 


1.2 


0.9 


113604 


AM7*t4n7 


Hs.296083 ESTs, Moderately similar to 154374 gene 


1.7 


j.j 


111R74 


MM M4914 


Hs.5753 inosltol(myo}-1{or4)-monophosphatase 2 


0.8 


R 1 

0. I 


111AA1 
1 toot I 


VVOUDO 1 


Hs.146233 Homo sapiens cDNA: FU22130 fis, clone H 


1.7 


fi 9 


113657 


MVwcHO I JO 


Hs.5297 DKF2P564A2416 protein . 


\2 


4fi 

H.O 


111031 




Hs.3496 hypothetical protein MGC15749 


1.5 


A 
1 


113936 


W17fWfi 

TV 1/ U3Q 


Hs.83623 nuclear receptor subfamily 1, group 1, m 


3.8 




4 1 10fl7 


AA34331;; 


Hs.964 1 complement component 1, q subcomponent, 


1.2 


A 7 


1141V 


AH494Q3 


Hs.24192 Homo sapiens cONA FU20767 fis, done CO 0.3 




11A15R 


RP17QAR9 


Hs.336920 glutathione peroxidase 3 (plasma) 


1.1 


4 1 

M.J 


114911 
114Z1J 


Nooouy 


Hs.19575 CGl-11 protein 


1.6 


q o 
y.£ 


1 I4DJD 


MrtU/3^00 


gb:zmB8d01.s1 Stratagene ovarian cancer ' 


1.6 


37 


1 WfOU 


AIQOOTflO 

Aiyzyjo^ 


Hs.252692 hypothetical protein RJ20343 


1.4 


A 


4 4 AIM 
1 1*1/01 


TinviAft 

1 11HWO 


Hs.95388 ESTs 


1 


*t.J 


4 ■M7QJ; 


AdUJ/(J3o 


Hs.173484 hypothetical protein FU10337 


1.6 


Q 9 


l Haul 


rtVOOUU l£ 


Hs.196437 hypothetical protein FU10788 


1.4 


R 9 


nouyo 


Aiooouoy 


Hs.175319 ESTs 


3.7 


1 


113310 




Hs.23240 Homo sapiens cDNA: RJ21848 fis, clone H 


3.2 


49 


1 13W0 


NJOllU 


Hs.305971 solute carrier family 2 (facilitated glu 


1.5 


j.y 


115764 
113/04 


awra99rr 

AWjoZZOO 


Hs.9101 1 anterior gradient 2 (Xenepus laevis) horn 


1.3 


o.y 


1 1 JOU£ 


AW4M933 

MVV4 


Hs.206521 YME1(S.cerevisiae)-Jike1 


1.7 


6.6 


11CQQ4 


nDviO' ooo 


Hs.109315 K1AA1415 protein 


1.5 


9.1 


1 ifini9 

1 lOUOi 


DC30J000 


Hs.42484 hypothetical protein FU10618 


0.9 


4.3 


116046 




Hs.94491 hypothetical protein FU20297 


1.6 


5.5 


116274 


AI19Q7K7 


Hs.182874 guanine nucleotide binding protein (G pr 


3.2 


2.4 


11fi1in 


79ARRA 
tZ*t03*t 


Hs.42299 ESTs 


0.8 


4.7 


116356 


Al 37 1993 


Hs.288671 Homo sapiens cDNA FU 11997 fis, done HE 2.4 


3*g 


11R49Q 




Hs.279923 putative nucleotide binding protein, est 


5.5 




1 1PMR1 


AAJ tJOUV 


Hs.58633 Homo sapiens cDNA: FLJ22145 fis, done H 


5 


1.3 


1 16470 


AI979141 


Hs.83484 SRY (sex determining region Y)-box 4 


8.7 


4.5 


I I03/0 


fV51 9R9 


Hs.75337 nucleolar and coiled-body phosphprotein 


3.2 


0.9 




A WRPR4 1 1 
A VVOOO't I 1 


Hs.81915 leukemia-associated phosphoprotein p18 ( 


3.2 


3 


1 16589 


Al33/£l<£ 


Hs.17132 ESTs, Moderately similar to 154374 gene 


3.1 


r ^ 

0.3 


110/00 


nzaooo 


Hs.301527 ESTs. Moderately similar to unknown [H.s 


3.2 


4 f% 
*1.0 


<M717n 

n/uu 




Hs.42500 AOP-ribosylation factor-Tike 5 


7 


3,0 


11/^10 


NAJUoo 


Hs.133207 PTPRF Interacting protein, binding prate 


1.2 


ft9 


4 179flft 


M1R917 


Hs.172129 Homo sapiens cDNA: FU21409 fis, done C 


4.5 


9 4 


117576 


A1383467 


Hs.44597 ESTs 


1.4 


4.2 


117667 


U59305 


Hs.44708 Ser-Thr protein kinase related to the my 


4.3 


0.5 


117681 


AF161470 


Hs.260622 butyrate-induced transcript 1 


2.1 


5.7 


118336 


BE327311 


Hs.47166 HT021 


3.6 


7.7 


118475 


N66845 


gb2a46c1 U1 Soares fetal fiver spleen 


4.2 


0.5 


118493 


AL353944 


Hs.50115 Homo sapiens mRNA; cDNA DKFZp761J1112 (f 


3.5 


118505 


N67343 


gb:yz50b07^1 Morton Fetal Cochlea Homo 


2.1 


3.8 


119159 


AF142419 


Hs.15020 homotog of mouse quaking OKI (KH domain 3.7 


1.5 


119307 


BE048061 


Hs.37054 ephrin-A3 


3 


1.1 


119355 


BE218319 


Hs.5807 GTPaseRab14 


1.1 


5.6 


119462 


BE041667 


Hs.314544 Homo sapiens cervical cancer suppressor- 


1.4 


4.3 


119771 


AI905687 


Hs.2533 EST 


3.2 


1 


119940 


AL050097 


Hs.272531 DKFZP586B0319 protein' 


4.3 


0.7 


119943 


BE565849 


Hs.14158 copinelll 


3.5 


1.9 


120407 


AA235207 


• Hs.250456 hypothetical protein DKFZp762F2011 


1.5 


3.7 


120493 


AW968080 


Hs.152939 Homo sapiens done 24630 mRNA sequence 4 


1.4 


120677 


AF1502O8 


Hs.108327 damage-spedfic DNA binding protein 1 (1 


1.6 


6.8 


120867 


AA350781 


Hs.96967 ESTs 


1.1 


3.6 



3.3 
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121368 BE262956 
121603 AA416785 
121723 AA243499 
122223 AF169797 
122378 AB032948 
122946 A1718702 
123155 AF121856 
123158 AF161426 
123327 AA421581 
123495 W28673 
123526 AA608657 
123533 AA608751 
123768 AI932318 
123961 AL050184 

123999 AF084555 

124000 BE563957 
124038 AB037860 
124059 BE387335 
124083 AW195237 
124148 BE30O094 
124153 AU077333 
124252 BE613340 
124314 AK001552 
124375 D87454 
124432 N39016 
124447 N480Q0 
124539 D54120 
124543 AI393320 
124564 H66409 
124574 AL036596 
124605 AA749315 
124639 H60193 
124659 AI680737 
124737 BE270465 
124760 AW408586 
124763 BE410405 
124792 R44357 
124842 R56485 
124940 AF068846 
124949 AI903210 
124960 AL023513 
124995 T52700 
125030 AA610577 
125034 BE548446 
125058 T83731 
125076 AA973971 
125090 T91518 
125103 AA570056 

125154 W38419 

125155 AA837043 
125159 AK000669 
125170 AL020996 
125181 R40815 
125193 W67577 
125260 H05635 
125262 AW884980 
125272 BE612888 
125388 W27235 
125824 Z45258 
125852 AW630088 
125970 AW504721 
126192 AW160399 
126469 BE384361 
126510 AA057593 
127095 AA340277 
127524 AI243596 
128312 J04182 



Hs.178292 

Hs.249495 

Hs.104800 

Hs.27413 

Hs.21355 

Hs.308026 

Hs.284291 

Hs.218329 

Hs.178443 

Hs.106747 



Hs.188762 

Hs.21610 

Hs.7351 

Hs.74861 

Hs.173933 

Hs.283713 

Hs.7734 

Hs.227751 

Hs.160483 

Hs.334725 

Hs.215766 

Hs.192966 

Hs.268869 

Hs.146409 
Hs.104573 
Hs.108275 
Hs.42322 
Hs.77171 
Hs.21143 
Hs.289068 
Hs.78793 
Hs.91052 
Hs.76288 
Hs.48712 

Hs.103804 
Hs.336780 
Hs.194766 
Hs.1 10044 
Hs.187775 
Hs.5167 
Hs.3343 



KIAA0180 protein 1,5 
heterogeneous nudear ribonucleoprotein 2.2 
hypoflieiical protein FLJ10134 3.4 
adaptor protein containing pH domain, PT 3.9 
hypothetical protein DKFZp762K2015 1.4 
major histocompatibility complex class 1 .4 
sorting nexin 6 1.2 
hypothetical protein Z4 
ESTs 0.9 
serine carboxypeptidase 1 precursor prot 1 .3 
gbae55d04.s1 Stratagene lung carcinoma 2.1 
gb:ae56h07.s1 Stratagene lung carcinoma 2.1 
ESTs. Moderately similar to H2BL.HUMAN H 1.1 
DKFZP434B203 protein 1.1 
cyclic AMP phosphoprotein, 19 kD 1.4 
acQvated RNA polymerase II transcriptio 1 .9 
nuclear factor I/A 1.5 
ESTs, Weakly similar to S64054 hypolheti 14.8 
hypothetical protein FU22174 1.2 
lectin, galactoside4)lnding. soluble, 1 25 
erythrocyte membrane protein band 7.2 (s 1 
Homo sapiens, Similar to RIKEN cONA 9430 1.5 
GTP-binding protein 1.8 
KIAA0265 protein 1.1 
ESTs, Weakly similar to ALUC_HUMAN If Jl 1.3 
gb:yy98e12.s1 Soares_mulllp[e_sderosis_ 2.7 
cell division cycle 42 (GTP-binding prot 2.1 
ESTs 1 
ESTs 1.4 
A kinase (PRKA) anchor protein 2 0.7 
minichrornosome maintenance deficient (S. 11 
OKFZP586C1324 protein 1.4 
Homo sapiens cDNA FU1 1918 fis, done HE 1.5 
protein kinase C, zeta 0.7 
ESTs, Moderately similar to ALU5.HUMAN A 0,9 

1.3 
1.8 
1 

3.2 
1 

0.9 
0.9 
U 



calpain 2, (m/li) large subun'rt 
hypothefical protein FU20736 
gb.yg93h09.s1 Scares Infant brain 1NI8 H 
heterogeneous nudear ribonucleoprotein 
tubufin, beta polypeptide 
seizure related gene 6 (mouse)-like 
ESTs 
ESTs 

Homo sapiens mRNA; cDNA DKFZp434F152 (fr 

phosphogrycerate dehydrogenase 0.9 
gbBq02hQ8.s1 NCI_CGAP_Lu5 Homo sapiens 

gbye20f05,s1 Stratagene lung (937210) H 3.2 

Hs.122730 ESTs.ModeraterysimilartoKlAA1215pro 5.3 

gb:zc78a07.s1 Pancreatic Islet Homo sapi 0.9 

Hs.143669 ESTs 1.1 

Hs.274428 TRF2-inleractlng telomeric RAP1 protein 1.1 

Hs.8518 selenoprotein N 1.1 

Hs.12396 ESTs, Weakly similar to 2004399A chromos 1 

Hs.84298 CD74 antigen (invariant polypeptide of m 1.2 

Hs.294030 topolsomerase-related function protein 4 1 

Hs.171957 triple functional domain (PTPRF interact 1.3 

Hs.1 80224 myosin regulatory light chain 1.1 

Hs.6431 1 a dlsintegrin and metalloproteinase doma 1 .4 

Hs.286013 short coiied-coil protein 2.4 
Hs.76550 Homo sapiens mRNA; cDNA DKFZp564B1264 (f 

Hs.1 77516 high density lipoprotein binding protein 1.9 

Hs.30376 hypothetical protein 1.4 

Hs.182885 ESTs, Weakly similar to JC5024 UDP-galac 2 

Hs.334762 hypothetical protein RJ 14735 1.3 

Hs.10248 Homo sapiens cONA FU20167 fis, done CO 1.3 

Hs.94830 ESTs, Moderately similar to T03094 A-kln 4.3 

Hs.150101 lysosomal 1.5 



4.1 

5.5 

3.2 

3.9 

7.1 

3.7 

4.9 

3.6 

5.2 

5.1 

5.2 

9.3 

3.6 

3.5 

3.8 

11.2 

4.4 

11.5 

6.2 

12.7 

4.1 

8.4 

10.2 

4.8 

4.1 

4.3 

5.7 

4.1 

4 

4 

3.5 
3.6 
9.9 
4 

3.6 
3.9 
4.2 
3.6 
3.4 
4.4 
5.2 
3.5 
5 

1.5 
6 

1 3.7 

2.5 

6.6 

6.1 

4.3 

4.1 

3.8 

3.6 

7.8 

4.9 

4.8 

16.1 

5.3 

8.7 

1.8 

3.8 

4.1 

3.7 

4.1 

5 

0.9 
4.7 
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128453 X02761 


Hs.287820 fibronedin 1 


1.2 


4.3 


128460 T16206 


Hs.237164 ESTs, Highly similar to LDHH.HUMAN L-LAC 3.1 


44.4 


128491 H08379 


Hs.165563 hypothetical protein DKFZp434N1429 


0.6 


13.1 


128495 NM 005904 


Hs.100602 MAD (mothers against decapentapieglc, Dr 


1.3 


4 


128546 NMJJ03478 


Hs.101299 cullin 5 


1 


5.1 


128574 AI185977 


Hs.38260 ubiquitin specific protease 1 8 


0.8 


4 


128611 NMJJ14721 


Hs.102471 K1AA0680 gene product 


1.3 


3,7 


128652 AA432202 


Hs.103147 hypothetical protein FU21347 


1.4 


3.9 


128653 D87432 


Hs. 103 15 sohte carrier fam3y 7 (cationlc amino 


1.2 


3.6 


128655 A1246669 


Hs.324275 WW domain-containing protein 1 


0.8 


4.1 


128684 BE246444 


Hs.283685 hypothetical protein FLJ20396 


3 


1.6 


128717 AK001564 


Hs.104222 hypothetical protein FU10702 


2.8 


4.8 


128774 AA476220 


Hs.54457 CD81 antigen (target of antiproBferativ 


1.1 


10.6 


128790 AF026692 


Hs.105700 secreted frizzled-related protein 4 


1 


3,8 


128805 AA194554 


Hs.1 83434 ATPase, H+ transporting, lysosomal (vacu 


5.3 


5.3 


128827 AI638184 


Hs.106334 Homo sapiens done 23836 mRNA sequence 2.2 


5,3 


128840 AJ917602 


Hs.106440 ESTs 


1 


4.5 


128869 AA768242 


Hs.8061B hypothetical protein 


0.8 


3,6 


128889 060985 


Hs.106909 DKFZP566D193 protein 


4.6 


3.7 


128890 AI222020 


Hs.1 82364 CocoaCrisp 


3 


1.5 


128915 AK000140 


Hs.107139 hypothetical protein 


0.2 


3.9 


128920 AA622037 


Hs.1 66468 programmed ceO death 5 


15 


15.2 


128926 AF1 55096 


Hs.107213 hypothetical protein FU20585 


4 


4 


128930 AA298958 


Hs.10724 MDS023 protein 


1.2 


4.5 


128942 AW247536 


Hs.10729 hypothetical protein 


14 


5 


128948 AW953622 


Hs.223025 RAB31, member RAS oncogene family 


2.3 


5.6 


128953 AB020716 


Hs.107362 KIAA0909 protein 


0.9 


3.9 


128979 AW271217 


Hs.281434 Homo sapiens cDNA FLJ14028 fis, done HE 1,5 


3.6 


128980 AA258924 


Hs.10758 NMJ)02495*.Homo sapiens NADH dehydrogena 


0.8 


129005 AI770025 


Hs.13323 hypothetical protein FU22059 


1.2 


5.7 


129009 C15105 


Hs,330716 Homo sapiens cDNA FLJ14368 fis, done HE 2.1 


9.9 


129013 AA371156 


Hs.107942 DKFZP564M1 12 protein 


2.4 


3.8 


129068 AI634522 


Hs.152925 KIAA1268 protein 


1.2 


3.8 


129106 AW504486 


Hs.108689 sterol regulatory element binding transc 


1.2 


5.5 


129113 BE543205 


Hs.288771 DKFZP586A0522 protein 


0.5 


3.7 


129125 AB002450 


Hs.278391 CGI-109 protein 


1 


5.2 


129126 AW881089 


Hs.108806 Homo sapiens mRNA; cDNA DKFZp566M0947 (f 


1.5 


129151 N23018 


Hs.1 71 391 C-terminal binding protein 2 


2.1 


9.7 


129230 AA335362 


Hs.109646 Empirically selected from AFFX single pr 


0.9 


8.6 


129234 M16916 


Hs.282997 glucosidase, beta; acid (Indudesglucos 


1.1 


3.5 


129238 BE542214 


Hs.109697 ESTs 


1.1 


118 


129239 W57656 


Hs.1 09701 ubiquitin-Hke 5 


3.2 


5.1 


129241 AI878857 


Hs.1 09706 hematological and neurological expressed 


1.9 


5,7 


129243 BE169531 


Hs.109727 TAK1-bindlng protein 2; KIAA0733 protein 


1.2 


6.6 


129247 R49920 


Hs.109733 CGM31 protein 


1.5 


3.5 


129250 AA344367 


Hs.109760 Empirically selected from multiple AFFX 


1 


5.4 


129258 AA250970 


Hs.251946 poly{A)-bindlng protein, cytoplasmic 1-1 


1.3 


4.1 


129260 AF077200 


Hs.279813 hypothetical protein 


1.6 


3.9 


129270 AA357185 


Hs.109918 ras homolog gene family, member H 


1.8 


4l2 


129277 AB007896 


Hs.1 10 putative L-type neutral amino add trans 


1.1 


6.1 


129284 AA318224 


Hs.296141 ESTs ■ 


15 


4.8 


129300 W94197 


Hs.1 10165 ribosomal protein L26 homolog 


1.6 


5.1 


129318 AF189062 


Hs.285976 tumor metastasis-suppressor 


1.8 


6.5 


129352 AW511656 


Hs.170177 Meisl (mouse) homolog 


0.9 


4 


129362 U30246 


Hs.1 10736 solute carrier family 1 2 (sodium/potassi 


1.4 


9.2 


129379 BE278964 


Hs.1 1085 CGM 11 protein 


1 


4.8 


129390 AA318271 


Hs.250905 hypothetical protein 


1 


4.1 


129416 AA016188 


Hs.1 11 244 hypothetical protein 


1.8 


10.7 


129427 AI498631 


Hs.111334 ferritin, light polypeptide 


1.1 


4.8 


129470 W92931 


Hs.250899 heat shock factor binding protein 1 


1,8 


9.3 


129472 AL050260 


Hs.323817 DKFZP547E1010 protein 


1 


5 


129475 NM.004477 


Hs.203772 FSHD region gene 1 


1.1 


4.2 


129498 AA449789 


Hs.7551 1 connective tissue growth factor 


1.9 


6.8 


129501 A1631811 


Hs.180403 STRIN protein 


1.1 


9.7 


129527 AA769221 


Hs.270847 delta-tubulin 


1.1 


4.3 


129545 R18087 


Hs.323769 dsplatin resistance related protein CRR 


1 


4.2 


129579 AW517695 


Hs.286218 junctional adhesion molecule 1 


2.3 


3.5 



3.8 
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129606 AW968941 Hs.166254 hypothetical protein DKFZp566M 33 2.4 

129619 AA209534 Hs.284243 tetraspan NET-6 protein 3.2 

129620 D79338 Hs.239720 CCR4-NOT transcription complex, subunit 1.6 

129621 AL1 10212 Hs.301005 purine-rlch element binding protein B 1 .1 
129634 AB020335 Hs. 181 300 seM (suppressor of lin-1 2. C.elegans)- 0.9 
129663 A1207406 Hs.11866 translocase of inner mitochondrial membr 1.9 
129679 AW889132 Hs.11916 ribokinase 0.9 
129688 U53209 Hs.24937 transformer-2 alpha (htra-2 alpha) 1.3 
129691 M26939 Hs.1 19571 collagen, type 111, alpha 1 (Ehlers-Oani 4.7 
129712 U46386 Hs.W02 sorting nex&i 3 U 
129747 AL050272 Hs.12305 0KFZP566B 183 protein 1 
129788 BE397454 Hs.124969 Homo sapiens done 24707 mRNA sequence 1.4 

129796 8E218319 Hs.5807 GTPaseRab14 2.9 

129797 M62839 Hs.1252 apoDpoprotein H (beta-2-glycoprotein I) 0.3 
129800 AF052112 Hs.12540 lysosomal 1.6 
129834 AL080084 Hs.296155 CGU0O protein 0.9 
129836 AW410233 Hs.206521 YME1 (S.cerevisiaeHike 1 1.8 
129843 NM_014840 Hs.200598 WAA0537 gene product 0.9 
129874 AA626937 Hs.181551 hypothetical protein MGC2594 1.4 
129878 Z43161 Hs.283714 30 kDa protein 1.1 
129904 AL1 19499 Hs.13285 neuronal potassium channel alpha subunit 1 
129917 M30773 Hs.278540 proletn phosphatase 3 (formerly 28), reg 2 
129976 X14008 Hs.234734 lysosomal 0.9 
129982 214221 gb:Rsapiens germfine transcript of lg h 1.2 
130007 R15917 Hs.142570 Homo sapiens done 24629 mRNA sequence 4.3 
130060 BE277024 Hs.146381 RNA binding motif protein, X chromosome 1.6 
130064 X57815.comp Empirically selected from AFFX single pr \2 
130068 M93143 Hs.262869 plasminog en-like 1.4 
130090 H97878 Hs. 132390 zinc finger protein 36 (KOX 18) 1.4 
130095 AK001635 Hs.14838 hypothetical protein FU 10773 0.2 
130102 W61O05 Hs.14896 DHHC1 protein 1 . 
130112 AA916785 Hs.180610 splicing factor proline/glutamine rich ( U 
130115 T47294 Hs. 1 49923 X-box binding protein 1 3.8 
130123 NM_005095 Hs. 150390 zinc finger protein 262 1 
130150 BE094848 Hs.15113 homogentisate 1,2-dioxygenase (homogenti 0.5 
130161 R42676 Hs.151385 WAA0564 protein 1 
130210 M23115 Hs.1526 ATPase. Ca^ transporting, cardiac musd 0.4 
130213 BE278370 Hs.15265 heterogeneous nuclear ribonudeoprotein 17 
130215 BE3018B3 Hs. 152707 glioblastoma amplified sequence 1 
130232 U29463 gfcHuman cytochrome b561 gen 1.2 
130252 U92014 Hs.153527 Homo sapiens pTM5 mariner-like transposo 1.3 
130281 W78907 Hs.15395 similar to arginyl-tRNA synthetase (argi 1.5 
130343 AB040914 Hs.278628 K1AA1481 protein 2,9 
130385 AW067800 Hs. 155223 stanntocalcin 2 3.2 
130414 AW842182 Hs.241392 small Inducible cytokine A5 (RANTES) 1.4 
130417 AW163518 Hs.155485 huntingfln interacGng protein 2 1.7 
130440 AA852B68 Hs. 1 32853 K1AA0 171 gene produd 1.1 
130442 NM_006245 Hs. 11 8244 protein phosphatase 2, regulatory subuni 1.4 
130465 AW362955 Hs.15641 Homo sapiens cDNA aj 144 15 fis, done HE 1.6 
130479 R44163 Hs.12457 hypothetical protein FLJ 108 14 0.9 
130499 AB007915 Hs.158286 KJAA0446 gene produd 1 
130546 A1598022 Hs.193989 TAR DNA binding protein 1.3 
130568 AA232119 Hs.16085 putative G-proteln coupled receptor 12 
130606 A1652143 Hs.288382 hypothetical protein FLJ1 31 1 1 1 
130612 BE242873 Hs,16677 WD repeat domain 15 1.1 
130616 AL049963 Hs.284205 up-iegulated by BCG-CWS 0.6 
130623 AL045128 Hs.1691 grucan (1,4^lpha-), branching enzyme 1 0.9 
130629 AL042896 Hs.1697 ATPase, H+ transporting, lysosomal (vacu 0.9 
130632 AW073971 Hs.238954 ESTs, Weakly similar to K1AA1 204 protein 0.9 
130639 AI557212 Hs.17132 ESTs, Moderately similar to I54374 gene 2.6 
130641 AF158555 Hs.239189 glutaminase 1.2 
130653 A1861791 Hs.278479 TSPY-like 1.3 
130655 AIB31962 Hs.17409 cysteine-rich protein 1 (intestinal) 2.5 
130666 AL117508 Hs.194035 KIAA0737 gene produd ' 1.3 
130669 AI928985 Hs.17680 hypoflietical protein MGC1314 similar to 1.4 

130693 R68537 Hs.17962 ESTs 3.2 

130694 NMJ)14827 Hs.17969 KIAA0663 gene produd 1.1 
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130696 
130701 
130707 
130731 
130787 
130796 
130808 
130863 
130902 
130908 
130911 
130913 
130923 
130959 
130967 
130975 
131037 
131039 
131060 
131097 
131101 
131104 
131107 
131109 
131136 
131148 
131150 
131156 
131164 
131181 
131194 
131199 
131215 
131216 
131233 
131237 
131262 
131263 
131367 
131372 
131373 
131388 
131492 
131493 
131514 
131524 
131528 
131534 
131555 
131578 
131589 
131609 
131626 
131670 
131697 
131701 
131703 
131739 
131764 
131781 
131791 
131853 
131870 
131903 
131913 
131930 
131941 



AA325308 

Z98883 

AW190925 

AI932971 

AF072813 

AA088809 

NMJ01761 

Y10805 

AB037750 

AW195747 

BE409769 

BE390905 

H96115 

AB023182 

AA393071 

AA099923 

BE243101 

D87436 

AA194422 

AL137682 

BE387561 

W27770 

BE620886 

BE564123 

AB033099 

AW953575 

X77753 

AI472209 

AW013807 

H25094 

AW864222 

AW979155 

AL050107 

AI815486 

089053 

AW956868 

AU077158 

AU077002 

AI750575 

AW293399 

NM_006052 

NMJJ14810 

AI452601 

AW960146 

BE270734 

AB040927 

AU076408 

AF157326 

T47364 

AA936296 

C18825 

D83032 

BE514605 

H03514 

C19034 

AF103798 

AW160865 

AF017986 

A1805664 

AF077036 

X62111 

AI681917 

NMJ14874 

NMJ0O4642 

AW207440 

AA772603 



Hs.18016 

Hs.18079 

Hs.203559 

Hs.18593 

Hs.252831 

Hs.19525 

Hs.1973 

Hs.20521 

Hs.21061 

Hs.21122 

Hs.21189 

Hs.21198 

Hs.21293 

Hs.184523 

Hs.182579 

Hs.283728 

Hs.22391 

Hs.166318 

Hs.22564 

Hs.22937 

Hs.22981 

Hs.301756 

Hs.75354 

Hs.23060 

Hs.23413 

Hs.303125 

Hs.23582 

Hs.323117 

Hs,182265 

Hs.293663 

Hs.24083 

Hs.298275 

Hs^4341 

Hs.243901 

Hs.268012 

Hs.24608 

Hs.24930 

Hs.24950 

Hs.173933 

Hs.144904 

Hs.26146 

Hs.92200 

Hs.288869 

Hs.284137 

Hs.2795 

Hs.301804 

Hs.28309 

Hs.184786 

Hs.278613 

Hs.234265 

Hs.29191 

Hs.169984 

Hs.289092 

Hs.10130 

Hs.288613 

Hs.30819 

Hs.30888 

Hs.31386 

Hs.31731 

Hs.31989 

Hs.3321 

Hs.3363 

Hs.3436 

Hs. 185973 

Hs.69476 

Hs.35086 



Homo sapiens mRNA; cONA DKFZp586H0324 (f 1.8 

phosphatldyfinositol glycan, class Q 1.1 6.7 

hypothetical protein FU12701 1.2 4.1 

HomosapienscONA:FU21449fjs, clone C 1.4 6.9 

reticu!on3 1.2 11.2 

hypothetical protein RJ22794 1.8 6.8 

cydinF 1.3 4.1 

HMT1 (hnRNP methyltransf erase, S. cerevl 3.2 • 5.9 

KIAA1329 protein 1 .3.8 

hypothetical protein FU1 1830 similar to 1.3 7.9 

DnaJ {Hsp40} homolog, subfamily A, membe 2.7 3.7 

translocase of outer mitochondrial membr 1 .9 4 

UDP4Jnacteylgtucosarninepyrophosphorylas 1.9 10.3 

KIAA0965 protein 1.5 6.8 

leucine aminopeptidase 1.4 5.5 

PEST-contalnlng nudear protein 1.3 3.8 

chromosome 20open reading frams 3 1.9 4.1 

Bpin2 1.6 3.5' 

myosin VI 4.5 5 

l-kappa-8-interacting Ras-iike protein 2 2 3.7 

OKFZP586M1523 protein 1.6 • 4.5 

ESTs, Weakly similar to T31475 hypotheti 0.9 3.5 

6CN1 (general control of amino-add synt 2.1 4.5 

DKFZP564F0522 protein 1.1 4.6 

KIAA1273 protein 1.2 4.2 

p53-induced protein P1GPC1 4.5 135 

tumor-associated calcium signal transduc 3.4 0.4 

ESTs 0.8 4.9 

keratin 19 3.3 2.4 

ESTs, Moderately similar to I38022 hypot 0.6 4 

KIAA0997 protein 1.4 3.8 

amino acid transporter 2 1 .2 8.5 

transcriptional co-activator with PDZ-bi 0.7 4.7 

Homo sapiens cDNA FU20738 fis, done HE 2.1 8.2 

fatty-add-Coenzyme A figase, long-chain 1 .7 3.5 

DKFZP5640177 protein 1.3 5.4 

tubulin-speciflc chaperone a 1.6 4.8 

regulator of G-proteln signalling 5 1.4 4.4 

nuclear factor l/A 3,3 . 12 

nudear receptor co-repressor1 1.6 3.9 

Down syndrome critical region gene 3 1 111 

KIAA0480 gene product 5 2 

nudear receptor subfamily 2, group F, m 0.9 3.5 

hypothetical protein FU 1 2888 1 3.5 

lactate dehydrogenase A 2 6.5 

KIAA1494 protein 1.5 10.7 

UDP-glucose dehydrogenase 1.3 4.7 

TBP-interacting protein 1.3 4.9 

interferon, alpha-inducibte protein 27 1 .5 8 

DKFZP586G011 protein 1.8 3.5 

epithelial membrane protein 2 1.3 8.2 

nuclear protein 2.8 3.9 

Homo sapiens cDNA: FU22380 fis. clone H 1.3 11.2 

ESTs 1.3 4.8 

Homo sapiens cDNA FU14175 fis, clone NT 3.2 9.7 

hypothetical protein 1.3 5.2 

cytochrome c oxidase subunit Vila polype 1.3 7.8 

secreted frlzzled-related protein 2 1 0.6 14.7 

peroxiredoxin 5 1.1 3.6 

OKF2P586G1722 protein 1.6 3.7 

gbtl^apiens VII-5 gene for immunoglobul 1 .1 3.5 

ESTs, Highly similar to IRX1_HUMAN IROQU 5.3 1.2 

KIAA0214 gene product 0.6 

deleted in oral cancer (mouse, homolog) 2.4 4.9 

degenerative spermatocyte (homolog Droso 2.4 6 

Homo sapiens cDNA FU12758 fts, done NT 1.7 9.2 

ubiquitin specific protease 1 0.5 5.2 
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131947 AI123939 Hs.182997 ESTs OJ 

131961 AA129782 Hs.3576 Homo sapiens mRNA full length Insert cON 0.9 

131964 AW381148 Hs.198365 2,34)isphosphog!ycerate mutase 1.1 

131974 AF208856 Hs.268122 hypothetical protein U 

131983 AF119665 Hs.184011 pyrophosphatase (Inorganic) 3.3 

131997 AF229181 Hs.136644 CS box-containing WO protein 0.9 

132006 AW162336 Hs.3709 towirolecularn^ubiquinone-bindingpr \2 

132063 BE277910 Hs.3833 3'-phosphoadenosine ff-phosphosulfete sy 3.2 
132065 BE379335 Hs.211594 proteasome (prosome, macropain) 26S subu M 

132071 AF217798 Hs.3850 US 1 -interacting protein NUDEL; endoolta 0.7 

132079 AI701457 Hs.38694 ESTs 2 

132094 NM.016045 Hs.3945 CGM07 protein 1.2 

132116 AW960474 Hs.40289 ESTs 3.1 

132164 AI752235 Hs.41270 procoliagen-lysine, 2-oxoglutarate 5-dio 1.8 
132181 AW961231 Hs.16773 Homo sapiens done TCCCIA00427 mRNA sequ 

132208 AL031709 Hs.241575 hypothetical protein CAB561 84 1.4 

132258 AA306325 Hs.431 1 SUMO-1 activating enzyme subunit 2 2 
132303 BE177330 Hs.325093 Homo sapiens cONA: RJ21 210 fis, clone C 1.2 

132316 U28831 Hs.44566 KIAA1 641 protein 5 9 

132358 NM_003542 Hs.46423 H4 histone family, member G 5,8 

132384 AA312135 Hs.46967 HSPC034 protein 2.1 

132397 AA021160 Hs.4750 hypothetical protein DKFZp564K0822 1.3 

132413 AW361383 Hs.260116 metalloprotease 1 (pitriiysin family) 2 

132442 AW970859 Hs.313503 ESTs 1.2 

132534 BE388673 Hs.5086 hypofiietical proten MGC10433 2 

132540 BE396290 Hs.5097 synaptogyrin2 1.4 

132554 AF065391 Hs.194718 zinc finger protein 265 1.2 

132575 AV660538 Hs.2841 62 60S ribosomal protein L30 Isolog 3 

132585 AFO29750 Hs.179600 TAP binding protein (tapasin) 1.8 

132602 AW606927 Hs.5306 hypothetical protein DKF2p586F1 122 simO 1.6 

132608 AA353044 Hs.5321 ARP3 (actln-related protein 3, yeast) ho 1.8 

132718 NM_004600 Hs.554 Sjogren syndrome antigen A2 (60kD, ribon 4.2 

132719 AI264357 Hs.55405 hypothetical protein MGC16212 1.1 
132730 AK000868 Hs.5570 hypothetical protein FU 10006 1.4 
132765 BE222975 Hs.56205 insulin induced gene 1 11 
132782 F07424 Hs.279840 zinc finger protein 222 1.3 
132793 AB020713 Hs.56966 KIAA0906 protein 2.3 
132805 AW975748 Hs.5724 sclerostin 07 
132863 BE268048 Hs.236494 RAB10, member RAS oncogene family 1.8 
132894 D63209 Hs.5944 solute carrier family 11 (proton-coupled 15 
132930 AA579258 Hs.6083 Homo sapiens cONA: FU21028 fis, clone C 1 

132932 AW118826 Hs.6093 Homo sapiens cONA: FU22783 fis, clone K 0.7 

132933 BE263252 Hs.6101 hypothetical protein MGC3 178 1.6 
132965 AI248173 Hs.191460 hypothetical protein MGC12936 1 
132984 BE539199 Hs.62112 zinc finger protein 207 1.5 
132990 X77343 Hs.334334 transcription factor AP-2 alpha (actival 13.9 
132998 Y00062 Hs. 170121 protein tyrosine phosphatase, receptor t 0.6 
133002 AW499985 Hs.42915 ARP2 (actiivrelated protein 2, yeast) ho 1.5 

133011 NMJJ06379 Hs.171921 sema domain, Immunoglobulin domain (Ig), 3.5 

133012 AA847843 Hs.62711 Homo sapiens, done IMAGE:3351295, mRNA 1 
133040 AW502761 Hs.30909 KIAA0430 gene produd ' 0.9 
133056 H12028 Hs.6396 jumping translocation breakpoint 1.7 
133063 AI654133 Hs.30212 thyroid receptor interacting protein 15 0.6 
133067 AK000708 Hs.169764 hypothetical protein FU20701 \2 
133080 AF089816 Hs.6454 chromosome 19 open reading frame 3 1.2 
133110 AA808177 Hs.65228 ESTs 0.9 
133150 AV655783 Hs.661 Empirically selected from AFFX single pr 1.1 
133175 AW955632 Hs.66666 ESTs, Weakly similar to S19560 proline-r 1.5 
133199 AF231981 Hs. 2501 75 homolog of yeast long chain polyunsatura 5.5 
133203 AA464362 Hs.6748 hypothetical protein PP1665 1.2 
133206 AB037773 Hs.6762 hypothetical protein 16 

W %1 ^l 474 Hs 301746 RAP^^emberofRAS oncogene family 2.4 

133229 AL137480 Hs.6834 K1AA1 01 4 protein 1 

133241 AW796524 Hs.68644 Homo sapiens microsomal signal peptidase 1.3 

133257 BE617892 Hs.6895 actin related protein 2/3 complex, subun 1.4 

133271 Z48633 Hs.283742 rlsaptans mRNA for retrotransposon 31 

133273 N27672 Hs.69469 dendritic ceH protein is 
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133287 AW797437 


Hs.69771 B-factor, properdin 


1.3 


4 


133291 BE297855 


Hs.69855 NRAS-re!atedgene 


1.4 


5 


133292 AA304961 


Hs.699 peptidylprolyl tsomerase B (cyclophllin 


2.2 


6.8 


133294 AJ001388 


Hs.69997 zinc finger protein 238 


1.5 




133300 AF1 16666 


Hs.70333 hypothetical protein MGC10753 


1.4 


63 


133302 XQ4898 


Hs.237658 apolipoprotein A-ll 


0.2 


O.O 


133308 U56979 


Hs.250651 H factor 1 (complement) 


0.6 


5 


133347 BE257758 


Hs.71475 add cluster protein 33 


1.2 


A 9 




Hs.72157 OKFZP564I1922 protein 


3.7 


<s ft 
3.0 


133404 AB00791R 


. Hs.214646 K1AA0447 gene product 


1.4 


5 1 
O. 1 


133408 AI738719 

fvinuu nil owl 1 3 


Hs.198427 hexokinase2 


0.9 


0.0 


133422 ABQ3306.1 


HsJ3287 WAA1235 protein 


1.2 


1 7 
o.f 


133442 AL1 37663 


Hs.7378 Homo sapiens mRNA; cDNA DKFZp434G227 (fr 


0.7 


133448 M27749 


Hs.288168 immunoglobulin lambda-like polypeptide 1 


1.1 


4.3 


133449 AF038962 


Hs.738 1 voltage-dependent anion channel 3 


0.7 


4 2 


133501 AI962602 


Hs.74284 hypothetical protein MGC2714 


3.1 


5.9 


133504 NMJ304415 


Hs.74316 desmoplakin(DPI i DPIl) 


4.3 


1 I*? 


133506 BE562958 


Hs.74346 hypothetical protein MGC14353 


18 


1Q7 
13./ 


133532 D87452 


Hs.74579 KIAA0263 gene product 


1.2 


5 A 


133574 H97991 


Hs. 1933 13 Empirically selected from AFFX single pr 


1.4 


3.9 


133586 AJ929645 


Hs.225936 synapstnl 


0.8 


4.3 


133589 U7368 


Hs.75104 RNA-binding protein S1, serine-rich doma 


2 


in c 


133591 AI423369 


Hs.751 1 1 protease, serine, 11 (IGF binding) 


2.1 




133606 U10564 


Hs.75188 weeH(S.pcmbe)homolog 


3.3 


1.1 


133617 BE244334 


Hs.75249 ADP-ribosylation factor-like 6 interact! 


2.3 


5.6 


133651 AI301740 


Hs. 1 73381 dihydropyrimldinase-like 2 


0.8 


11 £ 

10.0 


133660 H14843 


Hs.303154 popeye protein 3 


1 ' 


Q 1 
3. 1 


133663 AJ006239 


Hs.75436 quinokJ dihydropteridine reductase 


0.5 


D.0 


133668 L77964 


Hs.271980 mitogen-activaled protein kinase 6 


1.1 


6 9 


133671 AW503116 


Hs.301819 zinc finger protein 146 


1.8 


^ A 
0.0 


133681 AI352558 


Hs .75544 tyrosine 3-monooxygenase/tryptophan 5-mo 


1.5 


11 1 

1 I.I 


133694 W17187.comp Hs.232400 heterogeneous nuclear ribonucleoprotein 


2 


J.3 


133708 AI018666 , 


Hs.75667 synaptophysin 


0.6 


0.0 


133737 AW001130 


Hs.75824 KIAA0174 gene product 


1.2 


7.2 


133743 AI929587 


Hs.75847 CREBBP£P300 inhibitory protein 1 


1.5 


c 

0 


133750 BE410769 


Hs.75873 zyxin 


\2 


A A 


133765 M62194 


Hs.75929 cadherin 1 1, type 2, OB-cadherin (osteon 


3.2 


4.1 


133776 BE268649 


Hs. 177766 ADP-ribosyltransferase (NAD+; poly (ADP- 


2.1 


0.0 


133799 W24087 


Hs.76285 DKFZP564B167 protein 


1.9 


19 fi 


133800 AF075337 


Hs.76293 thymosin, beta 10 


2.6 


O.D 


133802 AW239400 


Hs.76297 G protein-coupled receptor kinase 6 


1 


4.9 


133806 025969 


Hs.76325 step II splicing factor SLU7 


0.5 


1 A 
0.0 


133817 AW578716 


Hs.7644 H1 histone family, member 2 


1.5 


*I.D 


133829 AW630088 


Hs.76550 Homo sapiens mRNA; cDNA DKFZp564B1264 (f 


1 7 
O./ 


133841 AA345824 


Hs.76688 carboxylesterase 1 (monocyte/macrophage 


0.3 


A A 


133845 AA147026 


Hs.76704 ESTs 


5.5 


9 Q 

t.3 


133863 AI815523 


Hs.76930 synuclein, alpha (non A4 component of am 


0.6 


A A 
4.0 


133887 X07767 


Hs.77271 protein kinase, cAW-dependent, cataiytl 


1 


it\ 9 


133892 AW859528 


Hs.301497 arginyltransferase 1 


0.9 


4.8 


133913 AU076964 


Hs.7753 calumenln 


2.8 


10.5 


133914 AI458213 


Hs.77542 ESTs 


1.8 


5.6 


133917 AL031177 


Hs.7756 prateasome (prosome, macropain) 26S subu 1.5 


6.6 


133947 L41066 


Hs.77810 nuclear factor of activated T-cells, cyt 


1.5 


3.8 


133986 M5496B 


Hs.1 84050 v-Kkas2 Kirsten rat sarcoma 2 viral on 


0.9 


4.3 


133987 L15409 


Hs. 174007 von Hippel-Undau syndrome 


2.3 


4.3 


133989 AL040328 


Hs.78202 SW1/SNF related, matrix associated, adt 


3.3 


3.4 


133990 R48316 


Hs.7822 Homo sapiens mRNA; cDNA DKFZp564C1216 (f 


1.3 


134029 BE150882 


Hs.1 43601 hypothetical protein hCLA-iso 


1 


6.5 


134040 NM_003470 


Hs.78683 ubiquitin specific protease 7 (herpes vi 


1.7 


3.6 


134042 AI027881 


Hs.7869 lysosomal 


1 


7.5 


134049 AF1 17236 


Hs.78625 matrin3 


1.2 


4 


134095 NM.004354 


Hs.79069 cydinG2 


2.7 


4.8 


134098 BE513171 


Hs.79086 mitochondrial ribosoma! protein L3 


3.3 


2.1 


134207 243039 


Hs.1 70198 KIAA0009 gene product 


1.3 


3.5 


134210 AF035606 


Hs.80019 programmed cell death 6 


1.7 


6.9 


134218 U77735 


Hs.80205 pim-2 oncogene 


0.8 


5.3 


134270 X68194 


Hs.80919 synaptophysin-fike protein 


1.4 


11.4 



4.8 



5.6 



5.7 
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134277 NWL004369 Hs.8Q988 collagen, type VI, alpha 3 2.6 

134280 NMJ30O712 Hs.81029 bOrverdin reductase A 1.8 

134288 AI022650 Hs.8117 eibb2-lnteracting prateb ERBIN 1.1 

134296 R00603 Hs.8128 phosphatidytserine decarboxylase 1.1 

134300 NMJJQ1430 Hs.8136 endomefial PAS domain protein 1 0.5 

134310 AL037800 Hs.8148 selenoprotein T 1.7 

134343 D50683 Hs.82028 trareforrrang growth factor, beta recepto 0.8 

134364 X76534 Hs.82226 glycoprotein (transmembrane) nmb 2.2 

134374 N22687 Hs.8236 ESTs 1.9 

134378 AL035786 Hs.82425 actin related protein 2/3 complex, subun 1.5 

134382 BE512856 Hs.109051 SH3 domain binding glutamic acid-rich pr 1.1 

134415 AI750762 Hs.82911 protein tyrosine phosphatase type iVA, m 1.9 

134417 NM.006416 Hs.82921 solute earner family 35 (CMP-siaSc aci 1.2 

134421 AU077196 Hs.82985 collagen, type V. alpha 2 6.6 

134439 Z23024 Hs. 1 38860 RhoGTPase activating protein 1 2 

134454 NMJJ13230 Hs.286124 CD24 antigen (small cell rung carcinoma 3,5 

134494 D86981 Hs.84084 amyloid beta precursor protein (cytoplas 1.5 

134501 W84869 Hs.211568 eukaryotic translation foirjarjon factor 1,2 

134505 AW960673 Hs.177530 ATP synthase, H+ transporting, mitochond 1.3 

134520 BE091005 Hs.74861 activated RNA polymerase II transcriptio 1.8 

134528 M23161 Hs.84775 Human transposon-Qke element mRNA 0.8 

134545 AI902899 Hs.85155 butyrate response factor 1 (EGF-response 1.4 

134553 AI203545 Hs.296169 S-phase response (cycfirwelated) 0.8 

134573 NfvL016142 Hs.279617 steroid dehydrogenase homolog 1.3 

134576 AB033017 Hs.8594 KIAA1 191 protein 0.9 

134577 8E244323 Hs.85951 exporfin, tRMA (nuclear export receptor 4 
134579 AW936928 Hs.85963 OKFZP564M 182 protein 2.2 
134582 AA927177 Hs.86041 CGG triplet repeat binding protein 1 1.6 
134600 AF078859 Hs.86347 hypotMcaJ protein Z1 
134655 AF265208 Hs.123090 SWI/SNF related, matrix associated, acfi 1.7 
134700 AKD00606 Hs.8868 golgiSNAP receptor complex member 1 4.4 
134737 D17530 Hs.89434 drebrin 1 3.1 
134762 T51986 Hs.283108 hemoglobin, gamma G 0.5 
134843 AA428520 Hs.90061 progesterone binding protein 1.3 
134854 J03464 Hs.1 79573 collagen, type I, alpha 2 8.7 
134865 AA587775 Hs.66295 mulrj-PDZ-domahcontaining protein 1.7 
134868 AB020689 Hs.90419 WAA0882 protein 3.4 
134874 A1803761 Hs.90458 serine palmitoyltransferase, long chain 1.3 
134885 AJ002030 Hs.9071 progesterone membrane binding protein 1.4 
134891 R51083 Hs.90787 ESTs 1 
134908 BE089782 Hs.9877 hypothetical protein 1.9 
134934 AF005043 Hs.91390 poly (ADP-ribose) glycohydralase 1 
134970 BE560779 Hs.284233 NICE-5 protein 1.4 
134982 AK002085 Hs.92308 Homo sapiens cDNA RJ1 1223 fis, done PL 1.6 
135011 AB037835 Hs.92991 KIAA1414 protein 1.2 
135032 AW301984 Hs.173685 hypothetical protein FU1 261 9 1.7 
135035 AL034344 Hs.284186 forkhead box CI 3.2 
135051 AI272141 Hs.83484 SRY (sex determinhg region Y)-box 4 4.2 
135060 AK001887 Hs.259842 protein kinase, AMP-activated, gamma 2 n U 
135062 AK000967 Hs.93872 K1AA1682 protein 2 
135077 AW503733 Hs.9414 K1AA1488 protein 2.8 
135082 AB017363 Hs.94234 frizzled (Drosophila) homolog 1 2.4 
135107 T97257 Hs.337531 ESTs, Moderately similar to 138022 hypot 1.4 
135143 AA132813 Hs.69559 KIAA1096 protein 1.8 
135156 BE563088 Hs.9552 binder of Ari Two * 1.2 
135172 AB028956 Hs.12144 WAA1033 protein 3.1 
135181 BE250865 Hs.279529 px19-tike protein 1.3 
135222 AA534009 Hs.183487 interferon stimulated gene (20kD) 1.3 
135232 AU038812 Hs.96800 ESTs, Moderately similar to ALIOUMAN A 2.1 

135289 AW372569 Hs.9788 hypothetical protein MGC1 0924 similar to 0.9 

135290 AA331901 Hs.184736 hypothetical protein RJ10097 1 

135291 T83882 Hs.97927 ESTs \2 
135349 AA1 14212 Hs.9930 serine (or cysteine) proteinase Inhibito Z6 
135357 A1565004 Hs.79572 cathepsln O (lysosomal aspartyl protease 2.5 

135398 M16029 Hs.267270 ret proto-oncogene (mulfiple endocrine n 0.4 

135399 W79431 Hs.326249 ribosomal protein L22 1.5 

135400 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 3.2 
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30 



35 



40 



45 



50 



55 



60 



65 



302665 
302892 
302963 
303131 
303150 
310125 
312662 
319429 
320591 
406779 
410691 
410763 
415738 
420186 
422055 
425815 
426218 
427397 
427466 
427505 
427723 



430219 
430450 
432866 
433423 
437562 
437667 
437754 
440252 
441471 
448292 
449404 
449964 
451389 
452685 



R99693 
AW176909 
AW673106 
AW081061 
AA887146 
AA147979 
AA233808 
BE616412 
AA054761 
AA412048 
AW239226 
AF279145 
BE539367 
NMJJ15925 
NMJJ14320 
R94023 
AF119043 
AI929685 
AA523543 
AA361562 
A1355260 
AW601325 
X99209 
R23553 
BE395875 
BE407127 
AB001636 
BE616412 
R60366 
BE513940 
AL042986 
BE281316 
H51066 
AW001741 
N73222 
A1634651 
RCH15S47_s 
RCJV84712 
X14008_ma1J 
RC_H86543_f 
H07011 

RCJW164586.S 

RCJW570485 

RC_H98714 s 

RC_M406145J 

AA458584 

AA031548 

X02761 

RC.jAA487193 

R25326 

RC_AA393805 

RC AA449333 

RCJ^A287681_s 

RCJM90864 

RC.C14243J 

R21443 

RCJVA251902 

M21121_s 

C00038_s 

Y00503 

RC R27006J 

RCJ\A416886 

RC_AA460450 

RC_jAA488433 

RC_AA278400J 

U28831 

RC.AA199588 



Hs.224410 
Hs.42346 
Hs.151945 
Hs.103180 
Hs.8217 
Hs.285005 
Hs.286241 
Hs.286218 
Hs.169149 
Hs.279574 
Hs.65450 
Hs.8966 
Hs.295953 
Hs.95697 
Hs.1 11029 
Hs.337531 
Hs.168005 
Hs.177656 
Hs7678 
Hs.178761 
Hs.279789 
Hs.324278 
Hs.235887 
Hs.241489 
Hs.279609 
Hs.8997 
Hs.5683 
HS.286218 
Hs.5822 
Hs.6101 
Hs.7857 
Hs.47334 
Hs.23581 
Hs.273193 
Hs.279009 
Hs.30250 



Homo sapiens cDNA FU12843 fis, clone NT 3.6 

caldneurin-bindkig protein calsarcin-1 3.3 

mitochondrial ribosomal protein L43 0.9 

DC2 protein 3 

stroma) antigen 2 6.2 

mitochondrial import receptor Tom22 1 .2 

protein kinase, cAMP-dependent, regulato 1 

junctional adhesion molecule 1 1 .5 

karyopherin alpha 1 (importin alpha 5) 1 .2 

CG1-39 protein; cell death-regulatory pr 1 .3 

reticulon4 1.2 

hypothetical protein RJ21 776 2 

ESTs,Weakty similar to AF220049 1 uncha 1.3 

Over-specific bHLH-Ztp transcription fa 1 .5 

putative heme-binding protein 2 

ESTs. Moderately similar to 138022 hypot 1 .7 

Homo sapiens cONA FU13372 fis, clone PL 3.3 

calmodulin 1 (phosphorylase kinase, delt 1 .3 

cellular retinoic acid-binding protein 1 11 

26S proteasome-assocJated pad1 homolog 3.2 

hlstone deacetylase 3 2.8 
Homo sapiens mRNA; cONA DKFZp566M063 (fr 

HMT1 (hnRNP methyltransferase, S. cerevi 1 .8 
hypothetical protein 
mitochondrial carrier homolog 2 
heat shock 70kO protein 1 A 
DEAD/H (Asp-Glu-Ala-Asp/His) box porypep 
junctional adhesion molecule 1 
Homo sapiens cONA: FU22120 fis, clone H 
hypothetical protein MGC3178 
erythrocyte membrane protein band 
hypothetical protein FU14495 



1.1 
1.5 
1.3 
1.6 
1.3 
2 

1.1 
0.5 
2.5 
1.1 
1.4 
4 

0.8 
1.8 
3.5 
0.9 
1.8 
1.8 
ESTs 
3.4 
1.6 
ESTs 
3.4 



hypothetical protein FU10706 
matrix Gla protein 

v-maf musculoaponeurotic fibrosarcoma (a 
peptidytproiyl Isomerase B (cydophQinB) 
calumenin 

iysozyme (renal amyloidosis) 
ESTs 

ESTs; Weakly similar to SAS [H.sapiens] 

Homo sapiens clone 23967 
ESTs 

SRY (sex determining region Y>box 4 
ceil division cycle 42 (GTP-blnding protein; 25kD) 

fibronectfn 1 3.6 

secreted frizzled-related protein 4 4.7 

Homo sapiens mRNA for putative vacuolar 0.9 

ESTs; Weakly similar to (deffine not 1.1 

ESTs 2.9 

ESTs 

ESTs; Highly similar to heat shock factor 1 .4 

ESTs; Highly similar to heat shock factor 1 .7 

ESTs 1.6 

Homo sapiens lysophospholipase (LPL1 ) 2.2 

small inducible cytokine A5 (RANTES) 0.9 

ESTs 2.8 

keratin 19 3.1 

ESTs 1.6 

ESTs; Weakly similar to predicted using 3.1 

fibroblast growth factor receptor 2 (bacteria- 1 .5 



3.6 

I. 6 
4.2 
17.3 
4 

6.6 
3.5 
4.7 
,5.6 
3.5 
13.9 
5.1 
3.9 
6.2 

II. 3 
3.6 
2.8 
4.7 
3.7 
2.5 
22 

I. 1 
8.8 
5.6 
6.1 
7.6 
6.5 
3.5 
5.7 
6.2 
3.7 
4.9 
3.6 
3.5 

II. 2 
5.6 
4.8 
4.6 
4.5 
6.6 
3.9 
6.2 
2.6 
3.5. 
4.6 
0.4 
3.1 
15.2 
4 

5 

8.4 

4.6 

1.3 

5 

5 

37 
3.8 
9.9 
4.8 
1.1 
3.7 
3.1 
3.7 
4 



5.2 



0.8 



3.9 



ESTs; Weakly similar to deduced amino acid 1 .1 

Homo sapiens HRIHFB21 1 5 mRNA; partial cds 
Human protein immuno-reactive with anti-PTH4.4 0.6 
Homo sapiens actin-related protein Arp3 (ARP3) 1 .8 4.7 



1.5 



3.6 
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AFO06O82 
RC_H90899 
RCJV95070 
RC T90946 f 
087258 
AA313414 s 
RCJf73484_s 

AFFX-HUMISGF3A/M97935 3 
AFFX-HUMRGE/M1QG98 5 
AFFX-M2783Q.5 
AFFX-M27830_5 
RC.AAQ6343U 

RC J63769J ferritin; light polypeptide 



Homo sapiens acfowelatad protein Arp2 (ARP2) 
desmoplatdn (DPI; DPII) 5.4 
desmoptakin (DPI; DPII) 5 
Human mRNA for KIAA263 gene; complete cds 
protease; serine; 1 1 (IGF binding) 2.4 
ESTs; Weakly similar to cDNA EST EMBLT1 157 
ESTs; Weakty simiiarto similar to Yeast 1.3 



1.1 



ESTs 
1.1 



1.6 
5.5 
2.6 
1.1 
3.5 
1.5 
6.3 
2.3 
7.9 
0,5 
0.6 
0.8 
3.7 



10.9 

3.9 

5.3 

13.5 

7.4 
5.4 
4.1 
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TABLE 8A 

Table 8A shows the accession numbers for those pkeys lacking unigenelD's for Table 3. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 



108469 1 16761J AA079487 AA128547 AA128291 AA079587 AA079600 
20 125076 190299.1 AA973971 T88817 AA253263 

114636 109698J AA075488 AA129081 AA074851 AA082852 AA074732AA084908AA084751 AA076042 AA131172 AA085374 AA079519 
AA074510 M1 13824 AA1 02437 AA070833 AA070143 AA084693 AA084389 AA076373 AA075492 AA062834 AA084335 
M078829 M079344 M069916 AA079275 AA070914 AA654069 AA081976 AA080957 AA0831 15 AA070942 AA085296 
123528 genbanK^A608657 AA608657 
25 123533 genbank_AA608751 AA608751 
125090 genbank_T91518 T91518 
125154 genbank_W38419 W38419 
118475 genbank^N66845 N66845 
118505 genbank_N67343 N67343 
30 101046 entrezJ<01160K01160 

129982 221J67 Z14221 AW381862 M97920 AW401444 Z66542 M29470 AW406502 X6101 1 M34024 AA327072 Z14166 Z14167 Z14165 

AW403806 Z14200 AA383972 Z14205 Z14201 M18513 Z14202 AW403684 X14584 AF062221 U43760 X65892 X65883 X62107 
Z80847 X65885 X65893 AF062142 X65891 X17675 Z47274 Z47277 Z47276 X65888 Z47275 X62109 AF062140 L01278 
AF062134 AF062139 X81723 Z80840 X81733 X81743 X81744 X81732 Z80843 AW402942 AW403516 X65919 AF062190 
3 5 AF062177 AF062222 AF0621 15 Z47240 AF062263 AF062261 AF062223 AF06221 1 Z47238 AW401714 AW404008 AW404991 

AF062280 M99595 Z47214 Z47232 Z47218 M26995 AF062184 X65895 L38433 X81731 Z1 1946 Z47226 AF062205 AF174012 
L01 276 AF062168 AF062136 X81755 X81748 AF17401 9 AF062285 L06924 L1 1699 AW402665 L09070 L28049 L08090 
AW407843 Z14171 AW402944 L08083 AW405627 133035 L26907 M17750 L28052 M17751 AJ239360 U19885 L14821 X56526 
AB014341 L12087 L12098 U68231 L12184 AF062242 AF022000 U64499 U00570 AI268604 Y15773 X64239 X62969 U00506 
40 X73605 X99360 U00577 D83677 AB021539 AF035796 Z33899 U00588 AJ239353 AF174062 Z33901 X98899 AF174058 X63080 

083676 AB021529 Z18318 U00488 L01412 X81746 U21262 U21272 U00560AF1 74060 U00547 U00561 Z18321 S73957 
M26435 AF115130 AF115117 Z92896 U21254 Z92895 AF115112 AF062290 T28938 Y09386 AF174067 U27189 Y09384 
U77373 AF174057 M17749 X69692 AF174038 U64478 U64486 AF174098 X99365 AF174077 AF174083 AF174089 AF015133 
AF127792 AJ006171 U00510 U58144 AF004323 AF1 15109 AF004324 AF015130 AF090414 AF090418 AB021536 AB021519 
45 Z96957 AF021986 Z92898 Z96956 L34164 AF062251 AF052524 U00549 U00541 D83687 AF052522 AF087420 AF052520 

AF02201 1 L43085 U00536 L19915 L43083 AF090420 AB021530 AB021534 Z96955 Z96954 Z33903 X62964 AF103282 UO0509 
AF062298 AF062289 AF062307 AW408326 L33037 L04337 L04328 X81742 L04336 L03385 AF032360 S56184 AF062191 
AF191092 AI906954 AF103184 UM343 AW364860 AF001424 AF103163 Z98717 AF103143 AJ008250 L04323 AF103321 
AF103335 L04342 L03818 L03817 AW404978 AW403690 U86801 AF103150 AJ010334 AF035027 AJ007327 AF103115 
50 AF017458 AJ008207 AJ008183 AJ008196 AJ008241 AJ008208 AF103210 AF068668 AF068670 AF068671 AF068664 

AF068669 AF068666 AF068665 AF068672 AW371244 AW403670 AW408074 AW404575 AW362153 AW403803 AW4067O2 
AW351514 D78345 T29140 J00231 NM_002179 AW405146 AA301091 X04646 H64660 AW402990 AW406534 T93007 
AI857980 AW368899 AI905833 AW406586 AA482084 AI872299 AA71 5266 AW404328 AI831674 AI709348 AA6031 12 
AW514864 AA485775 H64492 AW404789 AA487630 AA715498 AA295885 T27613 T98113 
55 108470 genbank u AA079500 AA079500 
101447 entrez_M21305 M21305 
124447 genbank_N48000 N48000 
101624 entrez_M55998 M55998 

131791 221.260 X621 11 S67984 AJ131056 Z47243 Z47235 AF062268 Z47237 AJ131058 AF062130 AF062282 X62108 AA385989 AA464794 
60 X69861 AW402964 M90808 Z98735 Z98734 Z98736 AF035018 X79161 U00545 AF174046 AF 174071 U0G552 U96288 

AF021989 AF062255 AF174061 S73953 AF062135 AF062155 X64147 U00555 AF174070 AF129754 U86789 Z98714 Z98738 
S75886 AJ008175 AJ008186 AJ008188 Y08403 H15753 H22208 AA327241 AW405737 H42300 H39677 H26239 H26903 
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H45128 R86072 AA327565 AI660584 AW361537 AW383759 S71043 H15014 H45570 H42819 H45523 H45134 R72043 
H24543 H27636 H27610 T28147 H25496 AW364071 AW354072 H45561 H45556 H42605 A1735017 T47421 R48719 H27570 
H44599 AI459598 H42347 H41938 H24993 AA345888 H22339 AJ538691 AJ012264 AA684201 AI880450 AA327310 AI991250 
AI833028 AW001210 AI956075 H30467 AA326915 H41943 AI749266 A1744441 AA327377 AW512326 A1735170 H01634 
AI587047 AI571623 AA327486 AA327103 AA327195 AA326973 T28143 



130064 221.264 X57815 129157 AA367448 AW32B098 AW404536 L29156 L29163 H27683 R83195 AA295096 AW327822 H25458 AW404692 



X57819 X57823 AW405604 AW404447 Z34914 AW406542 AA427726 M604389 AW405606 AW405918 AW4051 17 
AW407182 L03632 AW405058 L03627 AW407470 R72738 L21959 AW375738 X87888 AF124166 AF054638 AJ241406 
U09902 AW405284 M12025 AA360219 L03631 L03629 AW327727 AF194810 AF058076 AF194686 AF063774 AF063737 
AF063755 AF063740 AF063739 AF063738 S73129 L43088 AF063775 L33026 AF060138 AF194813 AF194812 AF19481 1 
AF194809 AF194814 AF194808 AF064505 AF0645O3 AF063765 AF063757 AF063752 AF063716 AF063736 AF194806 
AF058077 AF063747 AF063772 AF063781 AF060137 AF194805 AF060134 AF060132 AF058074 AF063754 AF063704 
U38589 Z18332 AF060122 AF194807 AF060135 AF064506 AF064504 AF063773 L26540 AF058072 AF06013 1 AF064500 
AF060136 AF064513 AF194683 AF194711 AF058075 AF063717 Z19546 AF194581 X72746 U96393 U09890 X98897 
AF194592 M80916 AJ241405 AF194632 AJ388659 AF194625 Z74694 AF194588 AF1 94601 U09901 U0991 1 U09892 
AJ010336 AJ006162 AJ249377 AJ241414 AA327392 U97248 X72747 Z46850 X95739 AW406701 AJ243109 AF194609 
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TABLE 9: Figure 9 from BRCA 001-2 US 

5 Table 9 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

1 0 ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelO: Unigene number 
Unigene Title: Unigene gene title 

15 Pkey ExAccn UnigenelD UnigeneTitle 

100690 AA383256 Hs.1657 estrogen receptor 1 

102211 BE314524 Hs.78776 putative transmembrane protein 

103587 BE270266 Hs.82128 5T4 oncofetal trophoblast glycoprotein 

20 104115 AF183810 Hs.26102 opposite strand to trichorhinophalangeal syndrome ) 

105038 AW503733 Hs.9414 KIAA 1488 protein 

105500 AW602166 Hs,222399 CEGP1 protein 

105990 AI690586 Hs.29403 hypothetical protein FU22060 

106155 AA425414 Hs.33287 nuclear factor I/8 

25 106373 AW503807 Hs.21907 histone acetyltransferase 

106414 BE568205 Hs.28827 miiogen-actlvated protein kinase kinase kinase 2 

110009 BE075297 Hs.6614 ESTs, Weakly similar to A4 3932 mucin 2 precursor, Intestinal . 

111900 AF131784 Hs.25318 Homo sapiens clone 25194 mRNA sequence 

114540 AI904232 Hs.75323 prohibitin 

30 116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 

1 17280 M18217 Hs.172129 Homo sapiens cDNA: FU21409 fis. clone COL03924 

119771 AI905687 Hs.2533 EST 

121723 AA243499 Hs.104800 hypothetical protein FU 10134 

124059 BE387335 Hs,283713 ESTs, Weakly similar to S64054 hypothetical protein YGL050w 

35 1 31 148 AW953575 Hs.303125 p53-induced protein PIGPC1 

132371 AA235448 Hs,46677 PRO2000 protein 

134169 AI690916 Hs.178137 transducer of ERBB2, 1 

302235 AL049987 Hs.166361 Homo sapiens mRNA; cDNA DKFZp564F1 12 

452410 AL133619 Hs.29383 Homo sapiens mRNA; cDNA DKFZp434 
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TABLE 10: Figure 10 from BRCA 001-3 PCT 

5 Table 10 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 
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R1: Ratio of tumor to normal body tissue 

R2: Ratio of SO" percentile tumor to body 

R3: Ratio of 75* percentile body to tumor 

R4: Ratio of tumor to normal breast tissue 
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134089 R51273 Hs.79029 
15 134095 NM.004354HS.79069 
134098 BE513171 Hs.79086 
134110 U41060 Hs.79136 
134125 NM_014781Hs.50421 
134246 D28459 Hs.80612 

20 134257 C05768 Hs.8078 
134272 X76040 Hs.278614 
134282 R45621 Hs.81057 
134288 A1022650 Hs.8117 
134321 BE538082 Hs.8172 

25 134326 AW903838 Hs.81800 
134328 AW959281 Hs.8184 
134348 AW291946 Hs.82065 
134359 NM_001982Hs.199067 
134367 AA339449 Hs.82285 

30 134374 N22687 Hs.8236 
134380 AU077143 Hs.179565 
134395 AA456539 Hs.8262 
134401 AI916662 Hs.211577 
134405 AW067903 Hs.82772 

35 134415 A1750762 Hs.82911 
134417 NH.006416HS.82921 
134419 W95642 Hs,82961 
134421 AU077196 Hs.82985 
134436 U29344 Hs.83190 

40 134485 X82153 Hs.83942 
134487 AF061739 Hs.83954 
134495 063477 Hs.84087 
134520 BE091005 HsJ4861 
134542 M14156 Hs.85112 

45 134570 U66615 Hs.172280 
134590 AW903849 Hs.173840 
134604 NM_002884Hs.865 
134612 AW068223 Hs.171581 
134643 AW299723 Hs.87223 

50 134654 AK001741 Hs.8739 
134656 AI750878 Hs.87409 
134672 AF271212 Hs.322901 
134700 AK000606 Hs.8868 
134711 X04011 Hs.88974 

55 134722 AF129536 Hs.284226 
134856 BE281128 Hs.9030 
134880 AI879195 Hs.90606 
134917 X87241 Hs.166994 
134921 AL137491 Hs.125511 

60 134982 AK002085 Hs.92308 
134989 AW968058 Hs.92381 
135029 H58818 Hs.187579 
135035 AL034344 Hs.284185 
135051 AI272141 Hs.83484 

65 135062 AK000967 Hs.93872 
135073 W55956 Hs.94030 
135098 AW274526 Hs.277721 



cadherin 11. type 2, OB-cadherin (osteob 


3.2 


560 


174 


2.6 


deconn 


5.4 


144 


27 


13.3 


arfaptin 1 


4.7 


47 


1 


4.1 


myxovlrus {influenza) resistance 1, homo 


3.3 


380 


114 


4.9 


Homo sapiens mRNA; cDNA DKFZp564B1264 (f 6.7 


304 


46 


7.8 


ESTs 


6.2 


600 


97 


4.1 


catumenin 


3.3 


889 


267 


5 


transcription factors (represses interi 


3.7 


91 


25 


2.6 


Homo sapiens mRNA; cDNA DKFZp564C1216 (f 3.4 


91 


27 


8.5 


RAB2, member RAS oncogene family 


7.8 


78 


1 


5.6 


serine (or cysteine) proteinase inhibito 


5.9 


59 


1 


3.3 


K1AA0244 protein 


5.8 


58 


1 


4.9 


Ihymbe-ONA gfycosylase 


6.4 


100 


16 


4.4 


ESTs 


5.1 


51 


9 


3.8 


cyclin G2 


5 


50 


1 


3.2 


mitochondria! ribosomai protein L3 


4.6 


246 


51 


3.9 


UV-1 protein, estrogen regulated 


4.5 


1472 


330 


2.1 


K1AA0203 gene product 


4.6 


69 


15 


5.8 


ubiquitirwonjugaling enzyme E2A (RAD6 h 


7 


97 


14 


7.5 


Homo sapiens done FBD3 Cri-du-chat crit 


3.4 


34 


5 


2.6 


protease, serine, 15 


3.6 


36 


1 


2.8 


hypothetical protein MGC2718 


6.7 


67 


9 


5.7 


erbb2-interacting protein ERBiN 


4.5 


137 


31 


12 


ESTs, Moderately similar to A46010 X4in 


5.2 


52 


1 


4.9 


chondroiSn sulfate proteoglycan 2 (vers 


8.6 


568 


66 


22.4 


ESTs 


4.8 


53 


11 


3.7 


interieukin 6 signal transducer (gp130, 


7.1 


71 


4 


6.4 


v-erb-b2 avian erythroblastic leukemia v 


3 


68 


23 


2.8 


pbosphoribosylgr/cmamide formyitransfer 


4.4 


44 


1 


4.1 


ESTs 


13.3 


445 


34 


6 


minlchromosome maintenance deficient (S. 


4.5 


45 


2 


3.4 


lysosomal 


6 


60 


5 


5.9 


kinectin 1 (kinesin receptor) 


4.1 


301 


73 


6.1 


collagen, type XI, alpha 1 


4.6 


1216 


267 


4.4 


protein tyrosine phosphatase type IVA, m 


4.9 


163 


34 


15.1 


solute earner family 35 (CMP-slalic aci 


4.9 


49 


3 


3.8 


trefoil factor 3 (intestinal) 


3.2 


1872 


592 


3.3 


collagen, type V, alpha 2 


6.3 


1075 


171 


3.8 


fatty acid synthase 


3.3 


710 


217 


2 


cathepsin K (pyenodysostosis) 


34.3 


411 


12 


5.1 


protein associated with PRK1 


4.8 


153 


32 


4.3 


KIAA0143 protein 


3.1 


147 


48 


12.7 


activated RNA polymerase ti transcriptio 


3.3 


33 


1 


2 


insulin-like growth factor 1 (somatomedi 


4.2 


42 


5 


2.6 


SWI/SNF related, matrix associated, acti 


3.9 


39 


1 


2.5 


HUEl (C4orf1)-interacting protein 


3.7 


41 


11 


0.6 


RAP1A, member of RAS oncogene family 


5.2 


52 


1 


3 


ubiquitin Oterminal hydrolase UCH37 


4.9 


49 


1 


3.7 


bone morphogenetic protein receptor, typ 


5.2 


52 


5 


3.5 


hypothetical protein FU10879 


6.4 


64 


1 


5.1 


thrombospondki 1 


12.6 


126 


1 


10.8 


disrupter of silencing 10 


5.4 


81 


15 


2.6 


golgl SNAP receptor complex member 1 


3.4 


179 


52 


1.5 


cytochrome b-245, beta polypeptide (chro 


3.2 


143 


45 


13.9 


F-boxonly protein 6 


7 


70 


6 


6 


TONDU 


3.1 


31 


1 


2.3 


15 kDa selenoprotein 


5.7 


57 


1 


5 


FAT tumor suppressor (DrosophEa) homolo 


3.2 


153 


48 


4.7 


Homo sapiens mRNA; cDNA DKFZp434P1530 (f 4 


452 


114 


2 


Homo sapiens cDNA FU1 1223 fis, clone PL 


5.1 


150 


30 


72 


nudix (nucleoside diphosphate linked moi 


8.2 


114 


14 


9.9 


hydroxysteroid (17-beta) dehydrogenase 7 


11.5 


115 


1 


10 


forkhead box C1 


5.4 


259 


48 


1.4 


SRY (sex determining region Y)-box 4 


3.3 


1296 


394 


2.2 


K1AA1682 protein 


3.8 


240 


64 


3.2 


Homo sapiens mRNA; cDNA DKF2p586E1624 (f 8.1 


101 


13 


7.9 


ovanan carcinoma antigen CA125 


3.3 


33 


1 


2.6 
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'1&144 
135154 
135155 
135172 
135242 
135243 
135269 
135356 
135357 
135389 



V&2493 Hs.94694 
NM_016255Hs.95260 
AK001835 Hs.267812 
AI207958 Hs.166556 
AB028956 Hs. 12144 
AI583187 Hs.9700 
BE463721 Hs.97101 
NrVL003403Hs.97496 
BE312948 Hs.18104 
AI565004 Hs79572 



435400 



U05237 

L14922 

X78592 

AI471525 

X70683 

L14922 

M23253 

AI267886 



Hs.99872 

Hs.166563 

Hs.99915 

Hs.247486 

Hs.93668 

Hs.82128 

Hs.904 

Hs.148027 



AA044840 Hs.241676 

N90960 Hs.227459 

AA873285 Hs.137947 

T56679 Hs.865 
AA305536 
AI369384 

AA219081 Hs.242396 



Homo sapiens cDNA RJ10561 fis, clone NT 


5.3 




1 
i 
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Autosomal Highly Conserved Protein 


7.4 


74 


5 
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sorting nexin 4 


6.6 
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oy 


11 
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Homo sapiens, Similar to TEA domain fami 
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R1 
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1 


1 




KJAA1033 protein 


3.4 


oo 


OR 
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cycfinEI 


3.1 


^1 
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] 






putative G protein-coupled receptor 


3.4 


169 


50 


9.1 




YY1 transcription factor 


3.4 


475 


142 


2.5 




hypothetical protein FU11274 


3.1 


31 


10 


1.7 




calhepsin D (lysosomal aspartyl protease 


4.7 


710 


151 


2.5 




fetal Alzheimer antigen 


20.6 


206 


4 


19.1 




replication factor C (activator 1) 1 (14 


3.2 


32 


1 


2.4 




androgen receptor (dihydrotestosterone r 


3.2 


117 


37 


9.4 




ESTs 


3.8 


58 


16 


5.5 




ESTs 


1.8 


1047 


596 


1.6 




5T4 oncofetal trophoblast glycoprotein 


5 


285 


58 


1.2 




amylo-1;6-glucosidase; 4-aIpha-glucanotransferas 


3.1 


31 


1 


2.6 


polymerase (RNA) II (DNA directed) polypeptide B 


7.8 


137 


18 


11.9 


stromal cell-derived factor 1 


4.7 


114 


25 


0.9 




ESTs; Moderately similar to !!!! ALU SUBFAMILY 


4.7 


151 


32 


9.3 


ESTs 


4.7 


47 


3 


4.4 




RAP1A; member of RAS oncogene family 


4 


40 


1 


3.4 




"EST1 76522 Colon carcinoma (Caco-2) cell line II 


3.6 


121 


34 


11.8 


arylsulfatase D 


3.5 


113 


33 


1.7 




ESTs; Moderately similar to lit! ALU SUBFAMILY] 


3.4 


107 


32 
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TABLE 10A 

Table 10 A shows the accession numbers for those pkeys lacking unigenelD's for Table 10. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 


CAT number 


Accession 


123619 


371681J 


AA602964AA609200 


104602 


524482 2 


H47610R86920 


121581 


283769J 


M416568 AA442889 M417233 AA442223 


123523 


genbank_AA508588 AA6Q8588 


100821 


tigr_HT4306 


M26460 U09116 


125091 


genbankj"91518 


T91518 


125150 


NOLFOUND.entrez W38240 W38240 


118475 


genbank L _N66845 


N66845 


104787 


genbankJ\A027317 AA027317 


106055 


genbanLM417034 AA417034 


113702 


genbanK_T97307 


T97307 


101046 


entrezj<01160 


KO1160 


101447 


entre2j/121305 


M21305 


101624 


entre^_M55998 


M55998 


124677 


genbankJW1073 


R01073 


110581 


genbanK_H61560 


H61560 


119023 


genbank_N98488 


N98488 


110775 


genbank_N22414 


N22414 


112092 


genbank w R44538 


R44538 


112253 


genbank_R51818 


R51818 


107014 


genbankJ\A598820 AA598820 


114988 


genbank_M251Q89 AA251089 
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TABLE 11: Figure 11 from BRCA 001-3 PCT 

5 Table 11 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset Identifier number 

1 0 ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal body tissue 

R2: Ratio of 90 th percentile tumor to normal body 

15 . R3: Ratio of 75 th percentile normal body to tumor 

R4: Ratio of tumor to normal breast tissue 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey ExAccn UnigenelD Unigene Title 


R1 


R2 


R3 


R4 


100131 D12485 Hs.11951 edonucleotidepyrophosphatase/phosphodi 


13.2 


244 


19 


9.9 


100147 D13666 Hs.136348 osteoblast specific factor 2 (fascidin 


15.7 


1030 66 


5 


100522 X51501 Hs.99949 proladlnnnduced protein 


22.7 


760 


34 


1.4 


100666 L05424 Hs.169610 C044 antigen (homing function and Indian 


8.5 


85 


1 


3.2 


101104 AW862258 Hs.169266 neuropeptide V receptor Y1 


15.3 


153 


1 


14.1 


101478 NMJ)0289QHs.758 RAS p21 protein activator (GTPase activa 


9.6 


96 


1 


8.5 


101724 L11690 Hs.620 bullous pemphigoid antigen 1 (230/240kD) 


9.4 


94 


1 


0.3 


101754 S70114 Hs.239489 TIA1 cytotoxic granule-associated RNA-bi 


8.9 


89 


5 


8 


101888 AL049610 Hs.95243 transcription elongation factor A (Sll)- 


7.3 


73 


1 


5.3 


102165 BE313280 Hs.159627 death associated protein 3 


9.3 


93 


5 


8 


102304 AF015224 Hs.46452 mammaglobin 1 


8.5 


2058 243 


1.4 


102348 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member 


6.4 


428 


67 


2.3 


102457 NM_001394Hs.2359 dual speaTdty phosphatase 4 


20.2 


202 


5 


1.3 


102567 U63830 Hs.146847 TRAF family member-associated NFKB activ 


8.2 


82 


1 


6.8 


102823 085390 Hs.5057 carboxypeptidaseD 


5.6 


56 


1 


5.3 


103557 AL133415 Hs.297753 vimentin 


7.5 


136 


18 


3.4 


103613 Nty_000346Hs.2316 SRY (sex determining region YHjox 9 (ca 


7.3 


73 


1 


5.2 


104115 AF183810 Hs.26102 opposite strand to trlchorhinophalangeal 


29 


290 


1 


26.8 


104667 AJ239923 Hs.30098 ESTs 


14.9 


149 


1 


6.4 


104804 AI858702 Hs.31803 ESTs, Weakly similar to N-WASP [H.sapien 


7.7 


77 


1 


5.1 


104807 A1139058 Hs.125790 leucine-rich repeat-containing 2 


7 


70 


1 


6.5 


104896 AW015318 Hs.23165 ESTs 


7.4 


74 


1 


6 


104943 AF072873 Hs.1 14218 frizzled (Drosophila) homolog 6 


16.2 


162 


1 


42 


105038 AW503733 Hs.9414 K1M1 488 protein 


5.5 


55 


1 


5.2 


105329 AA234561 Hs.22862 ESTs 


2.8 


131 


47 


3.9 


105500 AW602166 Hs.222399 CEGP1 protein 


25.4 


508 


20 


3 


105516 AK001269 Hs.30738 hypothetical protein FU 10407 


8.3 


83 


3 


1.8 


105730 AW377314 Hs.5364 DKFZP564I052 protein 


6.9 


69 


1 


4.4 


106012 AI240665 Hs.8895 ESTs 


21.2 


212 


6 


17.4 


106095 AF115402 Hs.11713 E74^ke factor 5 (ets domain transcript 


26.3 


356 


14 


1 


106155 AA425414 Hs.33287 mdear factor l/B 


9.9 


483 


49 


1.8 


107102 AB037765 Hs.30652 KIAA1 344 protein 


6.3 


63 


1 


5.4 


107136 AV661958 Hs.8207 GK001 protein 


2.5 


392 


155 


4.3 


107151 AW378065 Hs.8687 ESTs 


15.6 


156 


7 


10.8 


107922 BE153855 Hs.61460 ig superfamiiy receptor LNiR 


9 


90 


1 


5.5 


108339 AW151340-HS.51615 ESTs. WeaWy similar to ALU7JWMAN ALUS 


18.7 


187 


1 


17 


109112 AW419196 Hs.257924 hypothetical protein FU 13782 


4.1 


334 


82 


3.4 


109292 AW975746 Hs.188662 K1AA1 702 protein 


7.1 


71 


1 


6.5 


109415 U80736 Hs.110826 trinucleotide repeat containing 9 


12.3 


123 


1 


11.3 


109912 AW390822 Hs.301528 L-kynurenine/alpha^noadipateaminotra 


14.2 


142 


1 


9.5 


110009 BE075297 Hs.6614 ESTs,WeakrysimaartoA43932mudn2p 


6.3 


693 


110 


7.2 


110915 BE092285 Hs.29724 hypothetical protein FU13187 


20.9 


209 


1 


19.5 


111164 N46180 Hs.122489 Homo sapiens cONA FU 13289 fis, clone OV 


7.7 


77 


1 


5 
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111179 AK000136 Hs.10760 asporin (LRR class 1) 


25.1 


288 


12 


6.7 


111190 AK002055 Hs.151046 hypothetical protein FU1 1193 


6.3 


63 


1 


5.8 


111223 AA852773 Hs.334838 KIAA1866 protein 


3.6 


402 


112 


4.9 


111357 BE314949 Hs.87128 hypoffietical protein RJ23309 


3.8 


425 


111 


4 


112244 AB029000 Hs.70823 K1AA1 077 protein 


57 


567 


100 


67 


113047 AI571940 Hs.7549 ESTs 


9.6 


124 


13 


9 


113702 T97307 gb:ye53h05.s1 Soares fetal fiver spleen 


12.3 


129 


11 


11.7 


114124 W57554 Hs.125019 lymphoid nuclear protein (LAF-4) mRNA 


24.2 


242 


10 


5.6 


114138 AW384793 Hs.15740 Homo sapiens mRNA; cDNA DKFZp434E033 (fr 6.7 


67 


1 


6.3 


114768 AF212848 Hs.182339 ets homologous factor 


137 


137 


1 


8.9 


114860 AL157545 Hs.42179 faromodomaln and PHD finger containing, 3 


9.1 


91 


1 


7.6 


114965 AI733881 Hs.72472 BMP-R1B 


35.9 


359 


10 


29.7 


1 14988 AA251089 gb:zs04f05.s1 NCI_CGAPJ3CB1 Homo sapiens 1 1.5 


115 


1 


6.9 


115206 AW183695 Hs.186572 ESTs 


5.8 


58 


1 


5 


115719 AW992405 Hs.59622 Homo sapiens, clone IMAGE:3507281, mRNA. 


7.6 


144 


19 


13.9 


115844 AI373062 Hs.332938 hypothetical protein MGC5370 


6.2 • 


62 


1 


5.4 


116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 


1.8 


1047 596 


1.6 


1 16786 H25836 Hs.301527 ESTs, Moderately similar to unknown \Hs 


22.8 


228 


9 


12.4 


1 17280 M18217 Hs.172129 Homo sapiens cDNA: FU21409 fis, clone C 


3.9 


322 


83 


4.4 


117412 N32536 Hs.42645 solute carrier family 16 (monocarboxyiic 


17.4 


174 


9 


6.9 


118472 AL1 57545 Hs.42179 bromodomain and PHD finger containing, 3 


14.5 


145 


1 


2.4 


119271 A1061118 Hs.65328 Fanconi anemia, complementation group F 


8.2 


82 


1 


6.4 


119771 AI905687 Hs.2533 EST 


3.5 


2073 


I 595 


21 


120562 BE244580 Hs.302267 hypothetical protein FIJI 0330 


8.5 


127 


15 


1.6 


121463 AK000282 Hs.239681 hypothetical protein FU20275 


10.3 


103 


1 


9.3 


121723 AA243499 Hs.104800 hypothetical protein FU10134 


2.9 


214 


74 


3.7 


122963 AA478446 Hs.69559 WAA 1096 protein 


7.2 


72 


1 


5.7 


123137 AI073913 Hs.100686 ESTs, Weakly similar to JE0350 Anterior 


9,9 


351 


36 


13.9 


123619 AA602964 gb:no97c02.s1 NCLCGAP Pr2 Homo sapiens 


8.5 


85 


1 


4.3 


123709 AA706910 Hs.1 12742 ESTs 


3.9 


60 


16 


4.8 


124006 AI147155 Hs.270016 ESTs 


5.8 


321 


55 


17 


124059 BE387335 Hs.283713 ESTs, Weakly similar to S64054 hypotheti 


10.4 


880 


85 


5.3 


124308 AA249027 Hs.241507 ribosomal protein S6 


10.5 


105 


1 


9.9 


125279 AW401809 Hs.4779 KIAA1 150 protein 


13.1 


131 


1 


5.1 


125517 AA287921 Hs.164950 ESTs 


6.7 


67 


1 


6 


127439 D60237 Hs.14368 SH3 domain binding glutamic acid-rich pr 


30.6 


306 


4 


26.5 


128305 A1954968 Hs.279009 matrix Gla protein 


7.5 


75 


1 


6.5 


128482 A1694143 Hs.296251 programmed ceB death 4 


7.2 


72 


1 


5.8 


128790 AF026692 Hs.105700 secreted frizzled-related protein 4 


17.4 


409 


24 


7.8 


128925 R67419 Hs.21851 Homo sapiens cDNA FU 12900 fis, clone NT 


7.1 


392 


56 


3.6 


129017 AA1 15333 Hs.107968 ESTs 


8.2 


82 


1 


7.4 


129229 AF013758 Hs.109643 polyadenylate binding pratekvinteractin 


7.1 


71 


1 


6.2 


129337 NMJ)14918Hs.110488 K1AA0990 protein 


9.5 


95 


1 


8.5 


129366 BE220806 Hs.184697 Homo sapiens clone 23785 mRNA sequence 


7.1 


150 


21 


14.5 


129821 AB028945 Hs. 12696 cortacun SH3 domain-binding protein 


11.4 


114 


1 


10 


130036 BE061916 Hs.125849 chromosome 8 open reading frame 2 


6.7 


67 


1 


5.7 


130057 AF027153 Hs.324787 solute carrier family 5 (inositol transp 


1 


1 


1 


1 


130095 AKD01635 Hs. 14838 hypothetical protein FU10773 


14.6 


219 


15 


7.6 


130343 AB040914 Hs.278628 K1AA1481 protein 


13.2 


331 


25 


12.4 


130385 AW067800 Hs.155223 stanniocaldn 2 


72.2 


722 


1 


1.9 


130407 BE385099 Hs.334727 hypothetical protein MGC301 7 


6.5 


65 


4 


5.3 


130441 U63630 Hs.155637 protein kinase, DNA-acttvated, catalytic 


6.1 


61 


1 


5.7 


130455 090041 Hs.155956 N^cetyitrarisferase 1 (aryfamlne Nicety 


10.8 


706 


66 


9.2 


130604 AA383256 Hs.1657 estrogen receptor 1 


32.2 


322 


1 


4.7 


130617 M90516 Hs.1674 glutamine-fm<tose-6i)hosphatetransarnin 


10 


100 


1 


7.6 


130712 AJ271881 Hs.279762 bromodomah-containBig 7 


17.5 


175 


2 


12.8 


131148 AW953575 Hs.303125 p53-induced protein PIGPC1 


3.8 


585 


153 


3.7 


131388 NM.014810HS.92200 K1AA0480 gene product 


7.6 


76 


1 


5 


131564 T93500 Hs.28792 Homo sapiens cDNAFU 11 041 fis, clone PL 


4.7 


381 


81 


6.4 


131742 AA961420 Hs.31433 ESTs 


11.7 


117 


1 


10.1 


131877 J04Q88 Hs.156346 topoisomerase (DNA) II alpha (170kD) 


6.8 


68 


1 


5.6 


131985 AA503020 Hs.36563 hypothetical protein FU22418 


40.2 


402 


1 


4 


132316 U28831 Hs.44566 KIAA1 641 protein 


18.6 


186 


10 


1.5 


132528 T78736 Hs.50758 SMC4 (structural maintenance of chromoso 


9.3 


93 


1 


8.4 


132742 AA025480 Hs.292812 ESTs, Weakly similar to T33468 hypotheti 


6.5 


65 


1 


5.6 


132990 X77343 Hs.334334 transcription factor AP-2 alpha (activat 


12.7 


311 


25 


24 


133015 AJ002744 Hs.246315 UDP-N^c8ty^lpha-£VgabctosamIne.-polyp 


4.6 


427 


93 


10.4 
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133199 
133240 
133271 
133640 
5 133746 



134110 
134485 
134654 
10 1346B0 
135029 
135389 
128305 



AF231981 

AK001489 

Z48633 

AW246428 

AW410035 

AA535244 

U41060 

XB2153 

AKD01741 

AI879195 

H58818 

U05237 

AI954968 



Hs.250175 

Hs.242894 

Hs.283742 

Hs.75355 

Hs.75862 

Hs.78305 

HsJ9136 

Hs.83942 

Hs.8739 

Hs.90606 

Hs.187579 

Hs.99872 

Hs.279009 



homolog of yeast long chain polyunsatura 


3 


816 


275 


3.9 


ADP-fibosytation factor-like 1 


8.1 


81 


1 


4.6 


H.sapiens mRNA for retrotransposon 


12.4 


124 


6 


10.8 


ubiquitin-conjugaiing enzyme E2N (homo)o 


8.5 


85 


1 


72 


MAO (mothers against decapentaptegic, Or 


9.3 


93 


1 


7.8 


RAB2, member RAS oncogene family 


7.8 


78 


1 


5.6 


UV-1 protein, estrogen regulated 


4.5 


1472 330 


2.1 


caihepsin K (pycnodysostosis) 


34.3 


411 


12 


5.1 


hypothetical protein FU 10879 


6.4 


64 


1 


5.1 


15 kDa selenoprotein 


5.7 


57 


1 


5 


hydroxysteroid (17-beta) dehydrogenase 7 


115 


115 


1 


10 


fetal Alzheimer antigen 


20.6 


206 


4 


19.1 


matrix Gla protein 


9.4 


94 


3 


5.3 
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TABLE 11A 



5 

10 
15 
20 



Table 11A shows the accession numbers for those pkeys lacking unigenelD's for Table 1 L 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset Identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

123619 371681 1 AA602364 AA609200 

113702 genbank T97307 T97307 

114988 genbank_AA251089 AA251089 
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TABLE 12: Figure 12 from BRCA 001-3 PCT 



Table 12 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey. Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal body tissue 

R2: Ratio of 90* percenfile tumor to body 

R3: Ratio of 75 ft percentile body to tumor 

R4: Ratio of tumor to normal breast tissue 



Pkey 


ExAccn UnigenelD Unigene Title 


R1 


R2 


R3 


R4 


100131 


D12485 Hs.11951 phosphodiesterase I (PC-1) 


13.2 


244 


19 


9.9 


105500 


AW602166 Hs.222399 ESTs 


25.4 


508 


20 


3 


112244 


ABO29O0O Hs.70823 KIAA1077 protein 


5.7 


567 


100 


6.7 


114124 


W57554 Hs.125019 ESTs 


24.2 


242 


10 


5.6 


119771 


AI905687 Hs.2533 ESTs 


3.5 


2073 


595 


2.1 


121723 


AA243499 Hs.104800 ESTs 


29 


214 


74 


3.7 


128790 


AF026692 Hs.105700 secreted frizzied-related protein 4 


17.4 


409 


24 


7.8 


131148 


AW953575 Hs.303125 ESTs 


3.8 


585 


153 


3.7 


131985 


AA503020 Hs.36563 ESTs 


40.2 


402 


1 


4 


133199 


AF231981 Hs.250175 Homo sapiens clone 23904 mRNA sequence 3 


816 


275 


3.9 
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TABLE 13: Table 1 from BRCA 001-5 US 
Table 13 depicts a preferred group of genes upregulated in breast cancer 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigeneiD: Unigene number 

Unigene Title: Unigene gene due • 



R1: 



Ratio of tumor to normai body tissue 



Pkey 


ExAccn 


UniGene ID 


Unigene Title 


R1 


100038 


M97935 




control 


16.7 


100039 


M97935 




control 


6.3 


100040 


M97935 




control 


8.3 


100041 


M97935 




control 


14.8 


100082 


AB003103 


Hs.4295 


proteasome (prosome; macropain) 26S sub 


7.5 


100091 


AF000177 


Hs.111783 


Lsm1 protein 


4.9 


100100 


AF006084 


Hs.11538 


actin related protein 2/3 complex; subunlt 


4.7 


100103 


AF007875 


Hs.5085 


dolichyl-phosphate mannosyltransferase p 


13.4 


100114 


000596 


Hs.82962 


thymidyiate synthetase 


15.9 


100121 


D10495 


Hs. 155342 


protein kinase C; delta 


4.6 


100123 


D10523 


Hs. 168669 


oxogkrtarate dehydrogenase (lipoamide) 


7.5 


100128 


D11094 


Hs.61153 


proteasome (prosome; macropain) 26S sub 


4.4 


100131 


D12485 


Hs.11951 


phosphodiesterase 1/nuoleotide pyrophosp 


8.7 


100137 


D13627 


Hs.15071 


chaperonin containing TCP1; subunit 8 (t 


9.5 


100144 


D13643 


Hs.75616 


Human mRNA for K1AA0018 gene; comp 


6 


100147 


D13666 


Hs.136348 


osteoblast specific factor 2 (fasciciin Hike 


8.5 


100154 


D14657 


Hs.81892 


KIAA0101 gene product 


10.5 


100164 


D14812 


Hs.173714 


MORF-related gene X 


4.6 


100169 


D14878 


Hs,82043 


D123gene product 


7.9 


100190 


D21090 


Hs.178658 


RAD23 (S. cerevisiae) homolog B 


5.6 


100203 


D25538 


Hs.172199 


adenylate cyclase 7 


9.9 


100209 


D26308 


Hs.76289 


bilrverdin reductase 8 (flavin reductase (N 


4.9 


100215 


026598 


Hs.82793 


proteasome (prosome; macropain) subunit 


14.2 


100216 


D26599 


Hs.1390 


proteasome (prosome; macropain) subunit 


11-3 


100219 


D28137 


Hs.118110 


bone marrow stromal cell antigen 2 


5.7 


100227 


D28915 


Hs.82316 


toterferon-induced; hepatitis C-assoclated 


5.7 


100248 


D31888 


Hs.78398 


KIAA0071 protein 


7.4 


100287 


D43950 


Hs.1600 


chaperonin containing TCP1 ; subunit 5 (e 


5.6 


100294 


D49395 


Hs.75454 


antioxidant protein 1 


12.9 


100307 


D50525 


Hs.699 


hypothetical protein 


8.4 


100335 


063391 


Hs.6793 


platelet-activating factor acetythydrolase; 


6.8 


100340 


D63487 


Hs.82563 


KIAA0153 protein 


4.4 


100355 


D78129 


Hs.71465 


Homo sapiens mRNA for squalene epoxid 


12.6 


100363 


078514 


Hs.78563 


ubiquitin-conjugating enzyme E2G 1 (horn 


4.6 


100368 


D79987 


Hs.153479 


extra spindle poles; S. cerevisiae; homolo 


6.5 


100372 


D79997 


Hs.184339 


K1AA0175 gene product 


8.4 


100375 


D80004 


Hs75909 


KIAA0182 protein 


4.5 


100379 


D82060 


Hs.278721 


Ke4 gene; mouse; human homolog of 


8.1 


100387 


D83777 


Hs.75137 


KIAA0193 gene product 


10.7 


100393 


084145 


Hs.39913 


novel RGD-containing protein 


72 


100398 


D84557 


Hs.155462 


minichramosome maintenance deficient (m 


72 


100405 


D86425 


Hs.82733 


nidogen2 


5.4 


100406 


D86479 


Hs.1 18397 


AE-binding protein 1 


4.3 


100409 


D86957 


Hs.80712 


K1AA0202 protein 


115 


100421 


D86985 


Hs.79276 


Human mRNA for WAA0232 gene; comp 


9.7 


100446 


D87464 


Hs.10037 


K1AA0274 gene product 


6.4 


100447 


D87465 


Hs.74583 


K1AAQ275 gene product 


10 


100448 


087469 


Hs.57652 


EGF^Klomain; multiple 2 


6.2 
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100467 
100468 
100486 
100497 
100618 
100661 
100667 
100668 
100676 
100775 
100783 
100829 
100830 
100840 
100350 
100866 
100906 
100914 
100916 
100945 
100975 
100988 
100996 



D89052 

089289 

HT1112 

HT1400 

HT2710 

HT3018 

HT3127 

HT3938 

HT3742 

HT26388 

HT4018 

HT4343 

HT4344 

HT4392 

HT417 

HT4582 

HT5158 

HT511 

HT544 

HT884 

J02923 

J03589 



101011 
101017 
101031 
.101038 
101054 
101061 
101091 
101097 
101104 
101143 
101152 
101183 
101216 
101233 
101247 
101282 
101326 
101332> 
101348 
101352 
101378 
101396 
101404 
101439 
101464 
101469 
101472 
101478 
101484 
101539 
101540 
101544 
101552 
101580 
101600 
101663 
101664 
101667 
101684 
101702 
101754 
101758 
101767 



J03934 
J04430 
J04599 
J05070 
J05249 
K02405 
K03515 
L06132 
L06797 
L07615 
L12723 
L13800 
L19779 
L25876 
L29008 
133801 
L38810 
L42572 
U7276 
L77213 
L77701 
M13755 
M15796 
M16342 
M20902 
M22538 
M22877 
M22960 
M23379 
M24594 
M30818 
M30938 
M31169 
M31642 
M34677 
M37583 
M60750 
M60752 
M60858 
M63256 
M64929 
M77142 
M77836 
M81057 



Hs.7476 
Hs.1 18722 
Hs.10842 
Hs.79137 
Hs.114599 
Hs.132748 
Hs.1 69610 
Hs.169610 
Hs.287820 
Hs.69603 
Hs.191356 
Hs.278544 
Hs.4756 
Hs.183418 
Hs.297939 
Hs.75113 
Hs.5398 
Hs.324178 
Hs.73946 
Hs.180686 
Hs.76506 
Hs.76480 
Hs.14623 
Hs.80706 
Hs.1211 
Hs.821 
Hs.151738 
HsJ9411 
Hs.73931 
Hs.180532 
Hs.149155 
Hs.89414 
Hs.169266 
Hs.90093 
' Hs.9884 
Hs.795 
Hs.84113 
Hs.878 
Hs.78802 
Hs.79387 
Hs.78504 
Hs.156346 
Hs.30954 
Hs.16297 
Hs.833 
Hs78996 
Hs.182447 
Hs.268571 
Hs.51299 
Hs.169248 
Hs.118126' 
Hs.758 
Hs.20315 
Hs.926 
Hs.84981 

Hs.82314 

Hs.83363 

Hs.119192 

Hs.2178 

Hs.121017 

Hs.79110 

Hs.75124 

Hs.179574 

Hs.239489 

Hs.79217 

Hs.180884 



ATPase; H+ transporting; lysosomal (vacu 
fucosyitransferase 8 (alpha (1;6) fucosyitr 
Ras-Like Protein Tc4 
Carboxyl Melhyltransferase, Aspartate, A 
Collagen, Type Vui, Alpha 1 



Epican, Alt Splice 11 

Epican, Alt Splice 12 

Fibronectin, Alt Splice 1 

Mucin 1, Epithelial, Ait Splice 9 

Basic Transcription Factor, 44 Kda Subun 

Cytosolic Acetoacetyl-Coenzyme A Thio 

Rad2 

Protein Kinase Pitslre, Alpha, Alt Splice 
Cathepsln B 
Transcription Factor Ifia 
Guanosine 5 , -Monophosphate Synthase 
Ras Inhibitor Inf 

Endothelial Cell Growth Factor 1 
Oncogene E6-Ap, Papillomavirus 
lymphocyte cytosoDc protein 1 (L-plastin 
ubiquitin-fike 4 

Interferon; gamma-inducible protein 30 
dlaphorase (NAOH/NADPH) (cytochrom 
acid phosphatase 5; tartrate resistant 
bigiycan 

matrix metalloproteinase 9 (gelatinase B; 
replication protein A2 (32kD) 
Human MHC class U HLA-DG-beta mRN 
glucose phosphate isomerase 
voltage-dependent anion channel 1 
chemokine (C-X-C motif); receptor 4 {fus 
Human neuropeptide Y receptor Y1 (NPY 
heat shock 70J(O protein 4 
Homo sapiens liver expressed protein gen 
H2A histone family; member 0 
cyclin-dependent kinase Inhibitor 3 (CDK 



glycogen synthase kinase 3 beta 
proteasome (prosome; macropain) 26S sub 
inner membrane protein; mftochondriaJ (m 
Homo sapiens (cell One HL-6) alpha topo 
phosphomevalonate kinase 
COX17 (yeast) homolog; cytochrome c ox 
tnterferon-stimulated protein; 15 kOa 
proliferating cell nudear antigen 
heterogeneous nuclear ribonucleoprotein C 
apolipoprotein C-l 

NADH dehydrogenase (ubiquinone) flavo 
Human somatic cytochrome c (HCS) gene 
. protective protein for beta-galactosidase ( 
RAS p21 protein activator (GTPase activa 
Interferon-induced protein 56 
myxovlrus (influenza) resistance 2; homol 
X-ray repair complementing defective rep 
Human proptonyl-CoA carboxylase beta-s 
hypoxanthlne phosphoribosyttransferase 1 
ONA segment on chromosome X (unique) 
H2A histone family; member Z 
H2B histone family; member A 
H2A histone family; member A 
nudeolin 

cerebellar degeneration-related protein (62 
protein phosphatase 2 (formerly 2A); regu 
TIA1 cytotoxic granule-associated RNA-b 
pyrrdine-5-carboxylate reductase 1 
carboxypeptidaseBI (tissue) 



7.5 
5 

16.9 

5.6 

7.5 

4.4 

4.6 

4.4 

9 

4.7 

13.7 

10.6 

5.5 

4.1 

4 

4.9 

8.7 

7.2 

5.9 

4.6 

30.1 

8.3 

6.9 

4.3 

5.9 

5.1 

Z72 

6.1 

4.3 

4.3 

7.4 

4,6 

18.3 

17.4 

7.6 

10.9 

7.4 

14.6 

7.5 

4.4 

5.8 

18.9 

7.5 

9.3 

18.1 

8.6 

4.5 

6.1 

8.7 

4.2 

6.5 

14 

9.2 

5.1 

4.7 

5.5 

8.5 

4.5 

5.7 

5.8 

13.5 

4 

7.6 
4.2 
4.5 
5.7 
21.7 
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101770 M81601 


Hs.78869 


transcription etongaGon factor A (SU); 1 


4,6 


101791 M83822 


Hs.62354 


cefl division cycle 4-tike 


97 


101803 M86546 


Hs.155691 


pre-B-cel) leukemia transcription factor 1 


5.5 


101809 M86849 


Hs.323733 


Homo sapiens connexm 26 (GJB2) mRNA 


22.5 


101839 M93036 


Hs.692 


membrane component; chromosomal 4; su 


4 


101851 M94250 


Hs.82045 


midkine (neurite growth-promoting factor 


7.6 


mm M99701 


Hs.95243 


transcription elongation factor A (SHHifce 


114 


101973 S82597 


Hs.80120 


UDP-N-acetyMpra-T>gaIactosamine:po 


4.6 


101991 U00966 


Hs.166 


Human SREBP-1 mRNA; complete cds 


4.1 


102009 U02680 


Hs.82643 


protein tyrosine kinase 9 


4.4 


102025 U03911 


Hs.78934 


mutS (E. cofi) homolog 2 (colon cancer; n 


4 


102047 U07158 


Hs.83734 


syntaxin 4A (placental) 


6.1 


102051 U07550 


Hs.1197 


heat shock 10kO protein 1 (chaperonin 10 


4.4 


102083 U10323 


Hs.75117 


interieukin enhancer binding factor 2; 45k 


10.4 


102095 U11313 


Hs.75760 


sterol carrier protein 2 


9.5 


102130 U15009 


Hs.1575 


small nuclear ribonudeoprotfiin D3 polyp 


6.6 


102133 U15173 


Hs.155596 


BCL2/adeno\rirus E1B 19kD-interacting p 


4.3 


102148 U16954 


Hs.75823 


ALLI-fused gene from chromosome 1q 


6.9 


102179 U19713 


Hs.76364 


allograft inflammatory factor 1 


4.8 


102180 U19718 


Hs.83551 


rnicro fibrilJar-associa ted protein 2 


7.2 


102193 U20758 


Hs.313 


secreted phosphoproteln 1 (osteopontin; b 


7.2 


mm V2WQ 


Hs.74598 


polymerase (DNA directed); defta 2; regu 


4.3 


102202 U21931 


Hs.574 


fructose-bisphosphatase 1 


4.5 


102209 U22970 


Hs.265827 


Interferon; alpha-friducibta protein (clone 


9.9 


102211 U23070 


Hs.78776 


putative transmembrane protein 


4.9 


102220 U24389 


Hs.65436 


lysyloxidase-fikel 


8.5 


102224 U24704 


Hs.148495 


proteasome (prosome; macropain) 26S sub 


5.4 


102234 U26312 


Hs.278554 


cnromobox homolog 3 {Drosophila HP1 g 


7.7 


102250 U28014 


Hs.74122 


caspase 4; apoptosis-related cysteine prol 


5.4 


102260 U28386 


Hs.159557 


karyopherin alpha 2 (RAG cohort 1; impo 


6.3 


102261 U28488 


Hs.155935 


complement component 3a receptor 1 


5.7 


102273 U30888 


Hs.75981 


ubiquitin specific protease 14 (tRNA-guan 


6.1 


102298 U32849 


Hs.54483 


N-myc (and STAT) interactor 


4.1 


102302 U33052 


Hs,69171 


protein kinase C-like 2 


4.3 


102305 U33286 


Hs.90073 


chromosome segregation 1 (yeast homolo 


5.4 


102320 U34683 


Hs.82327 


glutathione synthetase 


4.1 


102330 U35451 


Hs.77254 


chromobox homolog 1 (Drosophila HP1 b 


4 


102348 U37519 


Hs.87539 


aldehyde dehydrogenase 8 


9.4 


102361 U39400 


Hs.75859 


chromosome 11 open reading frame 4 


5.2 


102362 U39412 


Hs.75932 


N-ethylmaleimide-sensitive factor attachm 


9.3 


102369 U39840 


Hs.299867 


hepatocyte nuclear (actor 3; alpha 


7.7 


102395 U41767 


Hs.92208 


a disintegrin and metalloproteinase domai 


10.4 


102409 U43286 


HS.U8725 


seienophosphate synthetase 2 


6.2 


102418 U43923 


Hs.79058 


suppressor of Ty (Sxerevisiae) 4 homolog 


4.1 


102425 U44772 


Hs.3873 


patmitoyi-protein thJoesterase (cerokM/po 


4.8 


102457 U48807 


Hs.2359 


dual specificity phosphatase 4 


6.3 


102465 U49352 


Hs.81548 


2;4-dienoyt CoA reductase 1; mitochondri 


9.4 


102495 U51240 


Hs.79356 


Lysosomal-assoctated multjspanning mem 


6.5 


102534 U56833 


Hs.198307 


von Hippet-Undau binding protein 1 


8.6 


102546 U57877 


Hs.3577 


succinate dehydrogenase complex; subuni 


4.3 


102549 U58046 


Hs.198899 


eukaryotic translation Initiation factor 3; s 


6.3 


102557 U58766 


Hs.264428 


tissue specific transplantation antigen P35 


5 


102562 U59309 


Hs.75653 


fumarate hydratase 


6 


102568 U59877 


Hs.223025 


RAB31; member RAS oncogene family 


9.1 


102580 U60808 


Hs.152981 


CDP-diacylglycerol synthase (phosphatid 


7.9 


102581 U61145 


Hs.77256 


enhancer of zeste (Drosophila) homolog 2 


7.6 


102590 U62136 


Hs.79300 


Homo sapiens enterocyte differentiation a 


7 


102591 U62325 


Hs.324125 


amyloid beta (A4) precursor protein-blndi 


4 


102592 U62389 


Hs,11223 


Human putative cytosoOc NADP-depende 


5 


102617 U65928 


Hs.198767 


Jun activation domain binding protein 


6.1 


102618 U55932 


Hs.81071 


extracellular matrix protein 1 


23.2 


102638 U67319 


Hs.9216 


caspase 7; apoptosis-related cysteine prat 


8.9 


102663 U70322 


Hs.168075 


karyopherin (importin) beta 2 


7.1 


102666 U7066Q 


Hs.2799\0 


ATX1 (antioxidant protein 1; yeast) homo 


4.7 


102679 U72661 


Hs.11342 


ninjurin 1; nerve injury-induced protein-1 


4.7 


102687 U73379 


Hs.93002 


ubiquitin carrier protein E2-C 


7.7 


102704 U76638 


Hs.54089 


BRCA1 associated RING domain 1 


5.6 
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102907 
102919 
102929 
102973 
102983 
102985 
103003 
103018 
103023 
103036 
103060 
103073 
103075 
103080 
103094 
103105 
103121 
103149 
103180 
103182 
103188 
103191 
103193 
103194 
103195 
103206 
103207 
103208 
103216 
103226 
103230 
103262 
103278 
103297 
103302 
103316 
103330 
103349 
103352 
103364 
103374 
103380 
103395 
103402 
103410 
103420 
103421 
103427 
103430 
103438 



U77180 Hs.50002 small inducible cytokine subfamily A (Cy 

U79241 . Hs.118666 Human clone 23759 mRNA; partial cds 

U79254 Hs.181311 asparaginyMRNA synthetase 

U79282 Hs.155572 Human done 23801 mRNA sequence 

U79293 Hs.159264 Human done 23948 mRNA sequence 

U82130 Hs.118910 tumor susceptibility gene 101 

U86602 Hs.74407 nudeolar protein p40 

U87269 Hs.154196 E4F transcription factor 1 

U89606 Hs.38041 pyridoxal (pyridoxine; vitamin B6) kinase 

U90426 Hs.179606 nuclear RNA heBcase; OECO variant of 0 

U90904 Hs.83724 Human done 23773 mRNA sequence 

U90914 Hs.5057 carboxypeptidase D 

U91 327 Hs.6456 chaperonin containing TCP1; subunit 2 (b 

U94592 Hs.60658 Human uncoupling protein homolog (UCP 

U95006 Hs.37616 Human 09 splice variant B mRNA; comp 

U961 13 Hs.324275 Homo sapiens Nedd-4-like ubiquitin-prot 

X02419 Hs.77274 plasminogen activator; urokinase 

X06985 Hs.202833 heme oxygenase (decydlng) 1 

X1 2447 aldolase A; fructose-bisphosphate 

X13238 Hs.74649 cytochrome c oxidase subunit Vic 

X1 6663 Hs. 1 4601 hematopoietic cell-specific Lyn substrate 

X17620 Hs.118638 non-metastatic cells 1; protein (NM23A) 

X17644 Hs.2707 G1 to S phase transition 1 

X52003 Hs. 1 406 trefoil factor 1 (breast cancer; estrogen-ind 

X53296 Hs.81 134 tnterteukin 1 receptor antagonist 

X53793 Hs.1 17950 multifunctional polypeptide similar to SA 

X54925 Hs.831 69 matrix metallopratelnase 1 (interstitial col 

X57766 Hs. 155324 matrix metalloproteinase 11 (stromelysin 

X59417 Hs.74077 proteasome (prosome; macropain) subunit 

X59543 Hs.2934 ribonucleotide reductase M1 polypeptide 

X59798 Hs.82932 cydin D1 (PRAD1: parathyroid adenomat 

X60787 Hs.296281 interteukln enhancer binding factor 1 

X61970 Hs76913 proteasome (prosome; macropain) subunit 

X63679 Hs.4147 translocating chain-assodating membrane 

X66363 Hs.171834 PCTAIRE protein kinase 1 

X69433 Hs.5337 Isodtrate dehydrogenase 2 (NADP+); mit 

X69819 Hs.99995 intercellular adhesion molecule 3 

X70040 Hs.2942 macrophage stimulating 1 receptor (omet 

X70218 Hs.2903 protein phosphatase 4 (formally X); cataly 

X70476 Hs.75724 coatomer protein complex; subunit beta 2 

X70649 Hs.78580 DEAD/H (Asp-Ghi-Ala-Asp/His) box pd 

X70940 Hs.2642 eukaryotic translation elongation factor 1 

X72755 Hs.77367 monokine induced by gamma interferon 

X72790 Human endogenous retrovirus mRNA for 

X72841 Hs.31314 retinoblastoma-bindlng protein 7 

X74262 Hs. 16003 retinoblastoma-btrtding protein 4 

X75042 Hs.4431 3 v-rel avian reticulbendotheriosis viral onco 

X75861 Hs.74637 testis enhanced gene transcript 

X78565 Hs.289114 hexabrachlon (tenasdn C; cytotactin) 

X79882 Hs.80680 lung resistance-related protein 

X81788 Hs.9078 Immature colon carcinoma transcript 1 

X821 03 Ks.3059 coatomer protein complex; subunit beta 

X83301 Hs.324728 SMA5 

X85373 Hs.77496 small nudear ribonudeoprotein polypepti 

X89059 serine/threonine kinase 9 

X89398 Hs.78853 uracB-DNA glycosylase 

X90872 Hs.279929 SULTICsulfotransferase 

X91788 Hs.84974 chloride channel; nudeotide-sensitlve; 1A 

X92396 Hs.24167 synaptobrevin-tte 1 

X94754 Hs.279946 methionlne-tRNA synthetase 

X95404 Hs.180370 cofilin 1 (non-musde) 

X96506 Hs.295362 DR 1 -assodated protein 1 (negative cofact 

X97065 Hs.173497 Sec23 (S. cerevisiae) homolog B 

X97074 Hs. 1 19591 adaptor-related protein complex 2; sigma 

X97303 Hs.250655 H.sapiens mRNA for Ptg-12 protein 

X97544 Hs.207 16 translocase of Inner mitochondrial membr 

X98263 Hs.152720 M-phase phosphoproteln 6 
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103464 Y00285 Hs.76473 


insulin-like growth factor 2 receptor 


4.2 


103470 Y00796 Hs.174103 


integrin; alpha L {antigen CD1 1A (p180); 


4.5 


103434 Y08991 Hs.83050 


phosphatidyBnosltoJ 3-kinase-associated p 


4.1 


103505 Y09912 Hs.33102 


transcription factor AP-2 beta (activating 


4.5 


103547 Z14982 Hs.180062 


proteasome (prosome; macropain) subunit 


4.3 


103551 Z15115 Hs.75248 


topoisomerase (DNA) 11 beta (180kD) 


4 


103565 Z22548 Hs.146354 


thtoredoxkwiependent peroxide reductase 


7.6 


103587 Z29083 Hs.82128 


514 oncofetal trophoblast glycoprotein 


14.6 


103621 Z47727 Hs.150675 


polymerase (RNA) ii (DNA directed) pol 


6.3 


103522 Z48042 Hs.278672 


membrane component chromosome 1 1 ; s 


4.4 


103658 Z74615 Hs.172928 


collagen; type 1; alpha 1 


5.9 


103680 Z93784 


Homo sapiens DMA sequence from PAC 


4.4 


103772 AA092473 Hs.278554 


chromobox homolog 3 (Drosophila HP1 g 


4.9 


103774 AA092898 Hs.92918 


ESTs; Weakly similar to R07G3.8 [C.eleg 


6.1 


103821 AA157623 Hs.198793 


KIAA0750gene product 


23.3 


103835 AA172215 Hs.93748 


ESTs; Moderately similar to TRANSCRIP 


4 


103886 AA236384 Hs.105737 


ESTs; WeaWy similar to gene 9306 protei 


4.9 


103890 AA236843 Hs.72085 


ESTs; Weakly similar to unknown [S.cere 


7.8 


103892 AA243523 Hs.239189 


ESTs 


4.8 


104054 AA393432 Hs.7100 


hypothetical protein 


5.3 


104115 AA428090 Hs.26102 


ESTs 


28.7 


104136 AA442669 Hs.268371 


zv68f6.r1 SoaresJotaLfetus_Nb2HF8_9w 


5.7 


104147 AA451992 Hs.283037 


ESTs; Highly similar to HSPC039 protein 


6.9 


104173 AA476564 Hs.76561 


ESTs; Weakly similar to ringer protein HZ 


5.2 


104181 AA479521 Hs.283740 


ESTs 


7.8 


104183 AA480838 Hs.1 14309 


ESTs 


5.1 


104192 AA486946 Ks.21321 


Homo sapiens mRNA; cDMA DKFZp564 


4.3 


104209 AB000221 Hs.16530 


small Inducible cytokine subfamily A (Cy 


113 


104234 AB002357 Hs.168212 


kinesin family member 3B 


6.2 


104271 C01687 Hs.7381 


ATP synthase; H+ transporting; mitochon 


4.2 


104278 C02582 Hs.109253 


EST s; Highly similar to N-terminal acetyl 


4.5 


104307 D52818 Hs.111680 


endosulfine alpha 


4.7 


104309 D55869 Hs.284123 


Homo sapiens mRNA full length insert cD 


4.2 


104370 H19378 Hs.21851 


Homo sapiens mRNA; cDNA DKFZp586 


6.4 


104446 L44497 Hs.7351 


ESTs 


4.9 


104453 M19169 Hs.123114 


cystatin SN 


11.6 


104476 N33807 Hs.324275 


protease; serine; 15 


5.6 


104558 R56678 Hs.88959 


Human DNA sequence from done 967N2 


6.3 


104592 R81003 Hs.325820 


serine protease; umbilical endothelium 


13.6 


104634 AA004274 Hs.19151 


ESTs 


6.3 


104636 AA004415 Hs.106106 


ESTs 


10.1 


104658 AA007145 Hs.27268 


Homo sapiens mRNA; cONA DKFZp564 


4.3 


104667 AA007234 Hs.30G98 


ESTs 


16.6 


104675 AA009596 Hs.301553 


ESTs; Moderately similar to till ALU SU 


4.6 


104767 AA025534 Hs.8852 


ESTs 


4.8 


104785 AA027163 Hs.7942 


ESTs 


8.1 


104791 AA029046 Hs.301871 


ESTs; Moderately similar to cAMP indue 


10.9 


104604 AA031357 Hs.31803 


ESTs; WeaMy similar to N-WASP [H.sap 


5.5 


104807 AA032147 Hs.23296 


ESTs 


10.4 


104837 AA039469 Hs.21126 


ESTs; Weakly similar to K1AA0299 {H.s 


4.6 


104849 AA040270 Hs,241507 


Homo sapiens mRNA; cDNA DKFZp564 


4.3 


104867 AA045481 Hs.225979 


Human gene from PACs 37M17 and 305B 


4.5 


104884 AA053021 Hs.14511 


SCO (cytochrome oxidase deficient; yeast 


4.7 


104906 AA055809 Hs.26802 


ESTs; Weakly similar to phosphoprotein { 


8.8 


104919 AA057193 Hs.25252 


ESTs 


5.5 


104921 AA057839 Hs.1508 


ESTs 


4.2 


104926 M058846 Hs.33363 


DKFZP434N093 protein 


7 


104938 AA064627 Hs.318725 


ESTs; Highly similar to CGI-72 protein [H 


7.1 


104943 AA065217 Hs.114218 


ESTs 


5.7 


104957 AA074919 Hs.10026 


ESTs; Weakly similar to ORF YJL063c [S 


4.7 


104961 AA076672 Hs.33905 


ESTs 


5.5 


104968 AA084602 Hs.29669 


ESTs 


4.3 


104975 AA086071 Hs.50758 


chromosome-associated polypeptide C 


8.3 


104977 AA088228 Hs.18272 


ESTs 


6.2 


104978 AA088458 Hs.19322 


ESTs 


6.7 


104987 AA101723 Hs.11861 


ESTs 


9.2 


105002 AA1 13266 Hs.182704 


ESTs; Moderately similar to alternatively 


6.9 



219 



WO 02/059377 



PCT/US02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



105012 AA1 16036 


Hs9329 


rhmmncAmp 90 nnpn rPflHlnn framp 1 


107 

lv.» 


10R01Q AA191A70 
ivoui? MAlllO'o 


He P9A0 


piuicaounib ipiUbumt;, induupdinj suDunii 


R 7 


105029 AA1 26855 


Hs 13268 


ESTs 


4.4 


105011 AA127Q64 
iujujo AAIt/3UT 


Hs 274329 


TP*i1 tamot nana 1 


fi 1 

0.0 


10501*1 AA19A4A6 


Hs AARQ 
no. 0009 


P<iTe 
Colo 


A R 
0.0 


iosoiq aahoiaq 

l U OUOo MA 1 ouOH J 


He 1R47R 


PQTe 
COl 5 


A 

H 


105062 AA1 34968 


Hs 36529 


P9Ts 


4 1 


105076 AA142A<ift 

IUJU/Q AAlt&OJO 


He 17AM 


P<?Te 

COI3 


RA 

0.*t 


105087 AA147884 


Hs.9812 


ESTs 


Q2 


105091 AA1 48859 


Hs 179909 
rw. 1 1 33V3 


FSTs* Wpakfv similar tn till Al 1 1 <?1 IRFA 
coi o, t ¥ cdMy auiiuaf iu iiii alu ouo~a 


R 7 
o./ 


10*00.1 AA14Q0R1 
1 UOUoO MA I *toU0 I 


He 19A0R 


co IS 


0.0 


105107 AA152302 


Hs 25035 
no. £ouoo 


DKF7P5fififi991 nrnfprn 


A 9 


MR137 AA1RA119 


He 1MQR7 
no.OuloO/ 


co i s, weaKiy similar io cuniatns sirnuan 


7 


105132 AA159501 


Hs 247280 


HRV aeenrialprl inritv 


4.2 


10R1A1 AA1RR111 
IU0 ISO MM 100000 


Hq 9AR0A 


Colo 


A 7 


10R1R4 AA171716 

IVW IO 4 * MA I / I / CO 


H<! 1RQ47 
no.ooo**/ 


IllcUiyrV/pU UulUlIly UUIIldlN piQlcUl *t 


Q 
3 




He 4rtfti 

ns.4Uo4 


ftiMMiuzo pfoiein 


O 1 


lUolOO AAlblOli: 


nS.ZoUUo 


nomo sapiens mKNA, cuna uixrzpoM 


4Q 0 

1b\o 


-inROftq A A90RO79 

1 UOcU-j MAZUOU/£ 


He 997741 


klAAOOAn nmloln 
rviAAUoou pnjttiui 


7 4 


4AR901 AAOI-IIQQ 

lUOiiiO aa^iiooo 


MS./ /oU 


ColS 


5.1 


inCOCO AA007/10 


Up 07OQ 

ns.y/iio 


tbis, WeaKJy similar to K1AAQ512 prote 


11,1 


lAOUl AA007VMR 
1U0O0 AA^/440 


Ue 

nS.OUUo 


MAMiwoo pnoiem 


0.4 


10R3R1 AA997R71 


He R1R1 


Mtis panner i 






ns.oooZ 


CCTe 

tolS 


C 7 


10R97A AA99R199 


He 9R1RRR 


ATPase,* H+ l/aosporfing,* lysosomal (vacu 


0.0 


10R9Q7 AA911451 
IU0C97 MMZ00*i0 1 


He IRIR^fl 


uanscnpuonai inifiuneaiary lauor l 


ft 7 
O./ 


IfKlftO AAO*JT70n 
lUOJUy AAZoJ/bU 


nS.s IIM 


co I s, weawy similar to cuna co I yoj 




10R119 AA911AR4 
IUOOI£ MAc000O4 


Ue 9114ft 


Q nhoeo l/'mooo nccnr^afarl nrnlain 0 /n/^\ 

o-piwioc rundbe-ossoudlea pioiein ^ yJ'+o; 


C Q 
0.0 


IflRIAO AA9T.R0RR 
IU004Z MAZOO^OO 


He 4R7n7R 

ns.ia/U/o 


PQTc 

col s 


A C 


10R17R AA91RRRQ 
IUOJ/0 AAlODOOo 


He R7RR 

ns.o/Do 


PQTe> Woalh* cJmibr tn (III Al 1 1 CI IRPA 

Co i s, weaKry simtiar io mi mlu oUBrM 


R R 
0.0 


10R1RR AA91RQR0 
luOOOO AAZOOoOU 


He R11R 


PQTc 
Col S 


0.0 


1051Q7 AA942RR8 


Hs 71Q6 
no./ 030 


PCTe* WpaHu similar tn hnneaJropninn n 


7.7 


1051QQ AA941007 
lujoyy aazhouu/ 


He 1R490 


PCTe- Hlnhlu eimllar In QH1 Hnmain_htnri 

coio, niymy buiiikii io ono uornain-Dinu 


R R 
0.0 


inR^nn AA94irtR9 

lUOIUU AAZ40U0Z 


He RRfitlfl 
nS.uOO'tO 


RNA bindiog motif protein 6 


R Q 
0.0 


106404 AA941101 


Hs 911R7 
nox I lot 


CCTe 
COIO 


Q 1 

3. 1 


IIV^riQ AA941RR9 
lUOIUy AAZ4000Z 


Ue IfMflRC 
tlS.JU J ODD 


ColS 




10R41R AA9R9179 


He 917A^fi 
nb. 40/000 


CO 1 o, MUOcialUiy SUTlual \U WMIW UJUUC 


R 1 

0. 1 


1U0 *5o0 AAZDOO/4 


He 97/1 Cfl 

ns^osoo 


CCTe 
CO IS 


4.y 


106491 AA966968 


He. 109A3 


FCTs 

COI O 


o 


IflR/QG AA0RRH7 
lUDIaD MM^OOOI/ 


He* 9Q70R 


nomo sapiens mKNA, cuna Uf\rzpooo 


o.z 


10R4QR AA9RR191 
MMZOOO^O 


He 101 QQ7 

nb.ou i as/ 


ut\TLr l *o i kii\£X> pro 16 in 


A 7 

o.r 


4 akkra aaorrarr 
IUODUU MAzoMOO 


He 900700 


uoi*bo protein 


O R 

b.o 


10RR07 AA9RRA7A 
1 UOOU/ AAZOOO * O 


He 99R11A 
no.cioo to 


co i o, ivioucraiciy oiiniidr w V/W»r\*i-dooua 


4 1 


lUOoOO aaZOOOOU 


He 90i;Q7 


nng linger protein (uorizu J type) o 


4.1 


IftRRAA AA9R1QR4 
I VdDVi MMZOlsOH 


He 94R7A 


PCTc 
Colo 


0 

0 


105546 AA2o^OJz 


OCDOIH 

HS^:bo/:o1 


rr»T., i»r_ i_ conn - rr\ 

cois, Weakly simuar to 62D9.a [D.melan 


0.1 


IflCG/ia AAOR9417 
11)0043 AAZ0c4U 


He <;A1R 


CO IS 


A R 
4.0 


4nCCC4 AAOCOA77 

IUooo 1 PduvtHi t 


He 9R000 


ribonuclease HI; large subunit 


y.i 


105560 AA262783 


Hs.306915 


ESTs 


4.5 


105565 AA278302 


Hs.18349 


ESTs; Weakly similar to partial COS [C.e 


4^2 


105566 AA278323 


Hs.17481 


Homo sapiens clone 24606 mRNA sequen 


11.9 


105575 AA278717 


Hs.12772 


ESTs 


5.9 


105584 AA279012 


Hs.3454 


ESTs; Weakly similar to KIAA0665 prote 


4.4 


105596 AA279418 


Hs.18490 


ESTs 


4 


105604 AA279787 


Hs.15467 


ESTs; Moderately similar to putative pho 


5.6 


105610 AA279991 


Hs.99872 


ESTs; Weakly similar to trithorax homoto 


5.3 


105621 AA280865 


Hs.6375 


Homo sapiens mRNA; cDMA DKFZp564 


4.8 


105627 AA281245 


Hs^3317 


ESTs 


7.5 


105638 AA281599 


Hs.247817 


Homo sapiens mRNA for for histone H2B 


5.9 


105645 AA282138 


Hs.11325 


ESTs 


6.4 


105650 AA282347 


Hs.25635 


ESTs; Highly similar to HSPC003 (H.sap 


11.3 


105666 AA283930 


Hs.34906 


ESTs 


4.7 


105674 AA284755 


Hs.279789 


CDW52 antigen (CAMPATH-1 antigen) 


8 


105687 AA286809 


Hs.28423 


ESTs 


7.1 


105700 AA287643 


Hs.35254 


ESTs; Weakly similar to hypothetical pro 


4.9 


105705 AA290767 


Hs.101282 


Homo sapiens mRNA; cDNA OKFZp434 


8 


105709 AA291268 


Hs.26761 


OKFZP586L0724 protein 


6.8 
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105731 AA292711 Hs.29131 ESTs 

105753 AA299789 Hs.1 10857 ESTs 

105774 AA348014 Hs.23412 ESTs 

105784 AA350771 Hs.1785Q ESTs 

5 105791 AA358038 Hs.14368 SH3-bindmg domain glutamic acid-rich p 

105807 AA393803 Hs.16869 ESTs; Moderately similar to COLLAGEN 

105808 AA393808 Hs.286131 KIAA0438 gene product 

105812 AA394126 Hs.20814 ESTs; Highly similar to CGI-27 protem [H 

105813 AA394140 Hs.18585 ESTs 

10 105819 AA397920 Hs.28783 Homo sapiens mRNA; cONA DKF2p564 

105870 AA399623 Hs.101067 ESTs 

105874 AA400074 Hs.171118 ESTs 

105896 AA400999 Hs.7838 Human ring zinc-finger protein (ZNF127- 

105934 AA404248 Hs.16577 ESTs 

15 105935 AA404277 Hs.263727 ESTs; Weakly similar to blsphosphate 3'- 

105966 AA406105 Hs.5344 adaptor-related protein complex 1; gamma 

105974 AA406321 Hs.6224 KIAA0895 protein 

105990 AA410336 Hs.29403 ESTs; Weakly similar to PROBABLE AT 

105995 AA410510 Hs.5345 ESTs 

20 106000 AA410972 Hs.20726 ESTs 

106007 AA411462 Hs.11042 ESTs; Weakly similar to veli 1 [H .sapiens 

106016 AA411819 Hs.8164 K1AA0898 protein 

106034 AA412473 Hs.14928 ESTs 

106042 AA412700 Hs.169895 ubquitin-conjugating enzyme E2L 6 

25 106057 AA417067 Hs.289074 ESTs 

106065 AA417558 Hs.25206 ESTs 

106070 AA417761 Hs.5957 Homo sapiens done 24416 mRNA sequen 

105103 AA421104 Hs.12094 ESTs 

106126 AA424006 Hs.22972 ESTs; Moderately similar to H5AR [M.m 

30 106154 AA425304 Hs.6994 ESTs 

106157 AA425367 Hs.34892 ESTs 

106166 AA425872 Hs.19561 NADH dehydrogenase {ubiquinone) 1 alp 

106204 AA428024 Hs.21479 ESTs 

106210 AA428239 Hs.10338 ESTs 

35 106220 AA428582 Hs.32196 ESTs; Moderately similar to metargidin p 

106236 AA429951 Hs.21104 ESTs 

106240 AA430074 Hs.18552 ESTs; Weakly similar to Ylr218cp [S.cere 

106263 AA431462 Hs.28329 ESTs 

106288 AA435536 Hs.24336 ESTs 

40 106293 AA435591 Hs.301444 signal sequence receptor; gamma (transloc 

106310 AA436244 Hs.17240 ESTs 

106317 AA436568 Hs.108124 ESTs 

106328 AA436705 Hs.28020 K1AA0766 gene product 

106341 AA441798 Hs.5243 ESTs; Moderately similar to plL2 hypoth 

45 106348 AA442253 Hs.10702 ESTs 

106350 AA442763 Hs.194698 cydinB2 

106371 AA443923 Hs.170310 ESTs 

106389 AA446949 Hs.6236 ESTs 

106394 AA447223 Hs.25320 Homo sapiens done 25142 mRNA sequen 

50 106426 AA448282 Hs.16206 ESTs; Weakly similar to F55C12.5 [Cele 

106459 AA449741 Hs.4029 gfioma^mplified sequence-41 

106462 AA449912 Hs.30532 ESTs; Highly similar to CGl-77 protein |H 

106468 AA450047 Hs.14770 ESTs 

106479 AA450351 Hs.75251 ESTs 

55 106494 AA452108 Hs.18387 transcription factor AP-2 alpha (activating 

106503 AA452411 Hs29679 ESTs; Highly similar to mediator [H-saple 

106507 AA452584 Hs267819 protein phosphatase 1; regulatory (inhibito 

106533 AA453786 Hs.145998 ESTs 

106568 AA455970 Hs.28285 patched related protein translocated in ren 

60 106586 AA456598 Hs,57787 ESTs 

106589 AA456646 Hs28661 ESTs 

106606 AA457730 Hs283437 Homo sapiens done 23851 mRNA sequen 

106611 AA458904 Hs.26267 ESTs; Weakly similar to torsinA [H.sapie 

106614 AA458934 Hs256150 ESTs 

65 106628 AA459657 Hs.12311 Homo sapiens done 23570 mRNA sequen 

106637 AA459961 Hs.250824 ESTs 

106644 AA460239 Hs.12680 ESTs 
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106664 
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108676 
65 108687 
108695 
108733 



106919 
106920 
106941 
106942 
106948 
106968 
106973 
106980 
106981 
106998 
107008 
107028 
107032 
107052 
107053 
107056 
107080 
107102 
107109 
107129 
107132 
107136 
107140 
107151 
107159 
107174 
107217 
107252 
107295 
107299 
107324 
107372 
107373 
107481 
107531 
107859 
107690 
107908 
108039 
108040 
108102 
108217 
108255 
108358 



AA460969 Ks.7510 mitogen-activated protein kinase kinase kl 

AA463745 Hs.29403 ESTs; Weakly similar to PROBABLE AT 

AA465171 Hs.236844 ESTs 

AA465339 Hs.3886 ESTs 

AA476473 Hs.171957 triple functional domain (PTPRF interact! 

AA477263 Hs.25584 ESTs 

AA477717 Hs.306117 interleukin 13 receptor; alpha 1 

AA478558 Hs.227913 APl5-Gke1 

AM82014 Hs.29463 centrin; Ef -hand protein; 3 (CDC31 yeast 

AA482112 Hs.238707 ESTs 

AA482548 Hs.5534 ESTs 

AA486183 Hs.285123 ESTs; Weakfy similar to similar to oxyste 

AA487228 Hs.19479 ESTs 

AA488872 Hs.12314 Homo sapiens mRNA; cDNA DKFZp586 

AA489101 Hs.24734 oxysterol binding protein 

AA489665 Hs.25245 ESTs 

AA490323 Hs.250747 SUMO-1 activating enzyme subunlt 1 

AA490885 Hs.21766 ESTs 

AA490899 Hs.296323 ESTs 

AA496204 Hs.237971 ESTs 

AA496347 Hs.31314 retinoblastoma-blnding protein 7 

AA496788 Hs.21077 KIAA0532 protein 

AA504631 Hs.26813 ESTs; Weakly similar to hypothetical 43.2 

AA505141 Hs.11923 Human ONA sequence from done 167A1 

AA521121 Hs.8858 bromodomain adjacent to zinc finger dom 

AA521157 Hs.74101 ESTs 

AA598461 Hs.195464 insulin-tike growth factor binding protein 

AA598710 Hs.23740 ESTs 

AA599214 Hs.24143 ESTs 

AA599472 Hs.247309 sucdnate-CoA Ugase; GDP-forming; beta 

AA600134 Hs. 12482 glyceronephosphateO-acyltransferase 

AA600147 Hs.5741 ESTs; Weakly similar to NADH-cytochro 

AA600310 Hs.18720 programmed cefl death 8 (apoptosls-induc 

AA609210 Hs.19221 ESTs 

M609723 Hs.30652 ESTs 

AA609943 Hs.32793 ESTs 

AA620553 Hs.4756 flap structure-specific endonuclease 1 

AA620598 Hs.9052 ESTs 

AA620795 Hs,8207 ESTs 

AA620889 Hs.170088 ESTs 

AA621169 Hs.8687 ESTs 

AA621340 Hs.10600 ESTs; Weakly similar to ORF YKROSIc [ 

AA621714 Hs.25338 ESTs 

051095 Hs.35861 DKF2P586E1621 protein 

059971 Hs.25925 ESTs 

T34527 Hs,80120 UDP-N-acetyl-alpha-D-galactosamlne:po 

T40327 Hs.30661 lung resistance-related protein 

T81665 Hs.278422 DKFZP586G1 122 protein 

U85625 Hs.8297 ribonuclease 6 precursor 

U85773 Hs. 154695 phosphomannomutase 2 

W58247 Hs.279766 Homo sapiens kinesin superfamily motor 

Y13936 Hs.17883 protein phosphatase 1G (formerly 2C); ma 

AA024835 Hs.47584 potassium voltage-gated channel; delayed 

AA026030 Hs.6131 1 ESTs; Weakly similar to CALPAIN 2; LA 

AA026894 Hs.42826 ESTs 

AA041341 Hs.46670 ESTs 

AA041551 Hs.159971 ESTs 

AA046424 Hs.49433 ESTs; Weakly similar to HYPOTHETICA 

AA058686 Hs.62588 ESTs 

AA063157 Hs.172608 ESTs 

AA071514 Hs.1634 ESTs 

AA100694 Hs,69499 Human ONA sequence from BAC 15E1 o 

AA112396 Hs.44276 ESTs; Moderately similar to HOMEOBO 

AA1 15562 Hs.274417 Homo sapiens mRNA; cONA DKFZp564 

AA120785 Hs.54347 ESTs 

AA121315 H3.70323 KIAA 1077 protein 

AA1 26422 znS4f1 .si Stratagene lung carcinoma 9372 
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108774 AA128125 Hs.71040 ESTs; Moderately similar to CELL GROW 

108828 AA131584 Hs.273344 DKFZP564O0463 protein 

108872 AA134063 Hs.1 11680 ESTs 

108884 AA134958 Hs.293591 ESTs 

5 108893 AA135894 Hs.194691 retinoic acid Induced 3 

109008 AA156360 Hs.87128 ESTs 

109010 AA155460 Hs.44229 dual spedficfty phosphatase 12 

109011 AA156542 HsJ2127 ESTs 

109042 AA159525 Hs.71779 Homo sapiens DNA from chromosome 19 

10 109085 AA166695 Hs^70737 tumor necrosis factor (ligand) superfamily 

109090 AA167006 Hs.70499 ESTs 

109101 AA167708 Ks.52184 ESTs 

109112 AA169379 Hs.257924 ESTs 

109160 AA179387 Hs.301997 DKFZP434N 126 protein 

15 109166 AA179845 Hs.73625 RAB6 interacting; Wnesin-BKe (rabkinesin 

109178 AA181600 Hs.283707 ESTs 

109179 AA181902 Hs.192789 ESTs; Weakly similar to !!H ALU SUBFA 
109261 AA195255 Hs.61779 ESTs 

109270 AA195515 Hs.3585 ESTs; Weakly similar to alternatively spli 

20 109277 AA196332 Hs.86043 ESTs 

109313 AA206800 Hs.86276 ESTs; Moderately similar to zinc finger p 

109415 AA227219 Hs.1 10826 trinucleotide repeat containing 9 

109454 AA232255 Hs.295232 ESTs 

109467 AA232904 Hs.63187 ESTs 

25 109481 AA233342 Hs.289069 ESTs; Weakly similar to WD40 protein C 

109508 AA233892 Hs.55902 ESTs; WeaWy similar to III! ALU SUBFA 

109514 AA234087 Hs,262346 ESTs; Weakly similar to ORF2: function 

109572 F02027 Hs.171937 ESTs 

109632 F04165 Hs.235873 ESTs; Weakly similar to K1 1C4.2 [C.eleg 

30 109644 F04477 Hs.291531 ESTs; Moderately similar to GLYCERAL 

109703 F09684 Hs.24792 ESTs; Weakly similar to ORF YOR283W 

109726 F10009 Hs.9196 ESTs 

109747 F10161 Hs.22969 ESTs 

109799 F10770 Hs.180378 Homo sapiens done 669 unknown mRNA 

35 109814 F10979 Hs.153106 Homo sapiens clone 23728 mRNA sequen 

110189 H20543 Hs.6278 OKF2P586B1 621 protein 

110240 H25577 Hs.176588 ESTs; Weakly similar to CYTOCHROME 

110280 H29285 Hs.32468 ESTs 

110520 H56965 Hs.4082 yr09f06.s1 Scares fetal fiver spleen 1NFL 

40 110561 H59617 Hs.5199 ESTs; Weakly similar to UBIQUITIN-CO 

110707 H95079 Hs.15617 ESTs; Weakly similar to IUI ALU SUBFA 

110734 H98714 Hs.24131 ESTs 

110770 N22262 Hs.131705 ESTs 

110780 N23174 Hs.22891 solute carrier family 7 (cationic amino aci 

45 110787 N24716 Hs.12244 ESTs; Weakly similar to C44B9.1 (C.eleg 

110794 N25262 Hs.27931 ESTs 

110799 N26101 Hs.323401 Human ring zinc-finger protein (ZNF127- 

110818 N29454 Hs.27552 ESTs; Weakly similar to putative p150 |H 

110839 N30856 Hs.30246 solute carrier family 19 (thiamine transpo 

50 110844 N31952 Hs.167531 Homo sapiens mRNA full length Insert cD 

110854 N32919 Hs.27931 ESTs 

110856 N33063 ESTs; Weakly similar to S164 (Rsapiens 

110860 N33438 Hs.170065 ESTs 

110897 N39148 Hs.6880 DKFZP434D1 56 protein 
55 110915 N46252 Hs.29724 ESTs 

110935 N48787 Hs.305979 protease inhibitor 1 (anti-elastase); alpha- 

110970 N51374 Hs.96870 Homo sapiens mRNA full length Insert cD 

111006 N53375 Hs.166146 Homer, neuronal immediate early gene; 3 

111008 N53388 Hs.7222 ESTs 
60 111018 N54067 Hs.3628 mitogen-activated protein kinase kinase ki 

111084 N59543 Hs.15456 PD2 domain containing 1 

111100 N62522 Hs.20450 ESTs 

111125 N63823 Hs.269115 ESTs 

111132 N64378 Hs.83293 ESTs; Highly similar to unknown function 
65 111139 N64683 Hs.290943 ESTs 

111164 N56857 Hs.14808 ESTs; WeaWy similar to III! ALU CLASS 
111172 N67102 Hs.21851 Homo sapiens mRNA; cONA DKFZp586 
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111178 M67227 


Hs.24633 


ESTs 


5.7 


111179 N67239 


Hs.10760 


ESTs 


37 


111181 N67278 


Hs. 171802 


ESTs; Weakly similar to hypothetical pro 


5.6 


111184 N67437 


Hs.243901 


Homo sapiens mRNA; cDNA DKFZp564 


8.7 


111221 N68869 


Hs.15119 


ESTs 


7.3 


111223 N68921 


Hs.297939 


ESTs; Weakly similar to neogenin (H.sap 


9 


111229 N69113 


Hs.1 10855 


ESTs 


8.9 


111241 N69514 


Hs.288880 


ESTs; Weakly similar to CGI-82 protein ( 


6.9 


111268 N70481 


Hs.26118 


Homo sapiens clone 24766 mRNA sequen 


4.5 


111295 N73275 


Hs.21275 


ESTs; Weakly similar to ublquitin-conjug 


5.6 


111299 N73808 


Hs.24936 


ESTS 


8.5 


111336 N79565 


Hs.29894 


ESTs 


6.7 


111357 N91023 


Hs.87128 


ESTs 


15 


111370 N92915 


Hs.94631 


brefeldin A-inhibited guanine nudeotide-e 


5.2 


111806 R33468 


Hs.279008 


ESTs 


10 


111825 R35885 


Hs.286148 


stromal antigen 1 


4.5 


111836 R36228 


Hs.25119 


ESTs 


7.2 


111890 R38678 


Hs.12365 


ESTs 


17.3 


111923 R39995 


Hs.25925 


Homo sapiens done 23860 mRNA sequen 


7.3 


111942 R40576 


Hs.21590 


ESTs 


9.2 


111987 R42036 


Hs.6763 


KIAA0942 protein 


10.6 


112101 R44793 


Hs.296341 


adenylyl cydase-associated protein 2 


5.3 


112134 R46025 


Hs.7413 


ESTs 


17.4 


112197 R49482 


Hs.5637 


ESTs 


4.4 


112244 R51309 


Hs.70823 


KIAA1077 protein 


11 


112253 R51818 




Homo sapiens mRNA; cDNA OKFZp566 


9.3 


112305 R54822 


Hs.26244 


ESTs 


4.4 


112449 R63802 


Hs.124186 


ring finger protein 2 


6.3 


112483 R66534 


Hs.285885 


ESTs 


4.9 


112519 R68631 


Hs.11861 


ESTs 


14.3 


112610 R79392 


Hs.23643 


ESTs 


5.2 


112693 R88741 


Hs.91065 


ESTs; Moderately similar to proliferation 


4.6 


112751 R93507 


Hs.8207 


ESTs 


5.6 


112801 R97486 


Hs.157160 


protein kinase; ONA-activated; catalytic p 


8.7 


112869 T03313 


Hs.4747 


dyskeratosis congenita 1; dyskerin 


5.9 


112871 T03352 


Hs.12285 


ESTs 


5.8 


112908 T10065 


Hs.3530 


TLS-assodated serine-arginine protein 


4.1 


112966 T17119 


Hs.102548 


glucocorticoid receptor ONA binding fact 


5.7 


112971 T17185 


Hs.83883 


ESTs 


6.4 


112995 T23528 


Hs.7155 


ESTs; Weakly similar to TYKi protein (M 


9.1 


113047 T25867 


Hs.7549 


ESTs 


5.4 


113075 T34660 


Hs.6986 


ESTs; Weakly similar to 111! ALU SUBFA 


5.7 


113117 T47819 


Hs.159153 


ESTs 


5.8 


113206 T58044 


Hs.241471 


ESTs; Moderately similar to HI! ALU SU 


6.4 


113248 T63857 




yc16e1.s1 Stratagene lung (#93721) Homo 


4.6 


113260 T64896 


Hs.287420 


ESTs 


6.9 


113277 T65797 


Hs.11774 


protein (peptidyl-prolyl ds/trans isomeras 


5.6 


113278 T65B02 


Hs.11135 


yc11h10.s1 Stratagene lung (#937210) Ho 


4.5 


113440 T86121 


l 1 A t\ A A A r* 

Hs.191445 


ESTs 


6.4 


113523 T90037 


Hs.95549 


ESTs 


6.4 


113604 T92735 


Hs.296083 


ESTs 


8.7 


113702 T97307 




ESTs; Moderately similar to 111! ALU SU 


9.5 


113783 W19222 


Hs.7041 


ESTs; Weakly similar to (ill ALU SUBFA 


5.2 


113794 W37382 


Hs.11090 


ESTs 


11.9 


113808 W44735 


Hs.9286 


ESTs 


16.7 


113811 W44928 


Hs.6994 


ESTs 


4 


113822 W47350 


Hs.17466 


reflnolc acid receptor responder (tazaroten 


4.8 


113823 W47388 


Hs.55099 


rab6 GTPase activating protein (GAP and 


4 


113836 W56792 


Hs.12040 


ESTs; Weakly similar to KIAA0881 prate 


4.1 


113857 W65477 


Hs.5297 


Homo sapiens mRNA; cDNA DKFZp564 


4.3 


113886 W72471 


Hs.23920 


ESTs 


4.6 


113895 W73738 


Hs.12921 


ESTs 


7.1 


113923 W80763 


Hs.3849 


ESTs; Weakly similar to FK506-binding p 


6.8 


113931 W81205 


Hs.3496 


ESTs 


6.1 


113950 W85765 


Hs.30504 


Homo sapiens mRNA; cDNA DKFZp434 


14 


113970 W86748 


Hs.8109 


ESTs 


15 


114051 W94942 


Hs.177534 


dual specificity phosphatase 10 


5.4 
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114057 


W96222 


Hs.34192 


ESTs 


4.8 


114086 


Z38266 


Hs.288649 


Homo sapiens PAC done DJ0777O23 fro 


5.1 


114098 


Z38347 


Hs.1 18338 


ESTs; Weakly similar to similar to S. cere 


6.2 


114109 


238435 


Hs.184108 


ribosomai protein L21 


4.6 


114124 


Z38595 


Hs.125019 


ESTs; Highly similar to KIAA0886 prate 


22 


114138 


Z38763 


Hs.15740 


amyloid beta (A4) precursor protein-bindi 


8.8 


114149 


Z38814 


Hs.27196 


ESTs 


4 


114162 


Z38909 


Hs.22265 


ESTs 


7.2 


114177 


Z39062 


Hs^3740 


ESTs 


5.3 


114196 


Z39211 


Hs.150926 


fucose-l-phosphate guanyryltransferase 


4.4 


114208 


Z39301 


Hs.7859 


ESTs 


5.1 


114250 


239897 


Hs.13297 


ESTs 


7.2 


114251 


Z39898 


Hs.21948 


ESTs 


14.7 


114292 


240715 


Hs.1B4641 


deita-6 fatty acid desaturase 


19.4 


114297 


Z40758 


Hs.173091 


DKFZP434K151 protein 


8.9 


114334 


Z41342 


Hs.22941 


ESTs 


13.7 


114460 


AA024604 


Hs.26102 


ESTs 


10.1 


114471 


AA028074 


Hs.104613 


ESTs 


5.7 


114480 


AA032243 


Hs.151678 


UOP-N-acetyf-alpha-C>gaIactosamine:po 


7.3 


114518 


AA046407 


Hs.106469 


suppressor of varl (S.cerevisiae) Wke 1 


4.3 


114542 


AA055768 


Hs.293380 


ESTs 


11.7 


114549 


AA056484 


Hs.292833 


ESTs 


7.3 


114652 


AA101416 


Hs.107149 


ESTs; Weakly similar to PT&ASSOCIAT 


6.1 


114673 


AA1 13303 


Hs.95583 


transmembrane 4 superfamUy member (to 


4.3 


114698 


AA126951 


Hs.1 10857 


ESTs; Highly similar to putative DNA-dir 


7.1 


114767 


AA148885 


Hs.154443 


mini chromosome maintenance deficient (S 


5.3 


114799 


AA159323 


Hs.109929 


ESTs 


4.2 


114804 


AA160363 


Hs.269956 


ESTs 


4.8 


114811 


AA161161 


Hs.95907 


multiple inositol polyphosphate phosphata 


7.1 


114821 


AA165313 


Hs.55468 


ESTs 


4.4 


114852 


AA235035 


Hs.38260 


ESTs; Moderately similar to uMquKin spe 


5 


114901 


AA236276 


Hs.196437 


ESTs; Weakly similar to R26660J; parti 


16.9 


114902 


AA236359 


Hs.39504 


ESTs 


5.1 


114940 


AA243012 


Hs.75928 


ESTs 


6.5 


114965 


AA250737 


Hs.72472 


ESTs 


35.1 


115047 


AA252627 


Hs.82916 


homeo box B5 


5.7 


115054 


AA252863 


Hs.87729 


ESTs 


6.2 


115061 


AA253217 


Hs.41271 


ESTs 


13 


115082 


AA255557 


Hs.198269 


NADH dehydrogenase (ubiquinone) 1 alp 


28.2 


115116 


AA256486 


Hs.62275 


ESTs 


8.8 


115140 


AA258030 


Hs.279938 


ESTs; Weakly similar to supported by GE 


4.1 


115205 


AA262470 


Hs.284216 


ESTs 


8.3 


115206 


AA262491 


Hs.186572 


ESTs 


5.1 


115239 


AA278650 


Hs.73291 


ESTs; Weakly similar to similar to the bet 


4.6 


115242 


AA278755 


Hs.283732 


ESTs 


8.3 


115249 


M278961 


Hs.71124 


ESTs 


10.1 


115259 


AA279071 


Hs.13453 


splicing factor 3b; subunit 1; 155kO 


9.5 


115285 


AA279799 


Hs.293736 


ESTs 


5.8 


115291 


AA279943 


Hs.122579 


ESTs 


5.1 


115357 


AA281793 


Hs.72988 


ESTs 


5 


115377 


AA282247 


Hs.193063 


ESTs 


6.1 


115400 


AA283198 


Hs.89113 


ESTs 


4.9 


115439 


AA284561 


Hs.193090 


ESTs 


5.8 


115471 


AA287133 


Hs.59346 


ESTs; Weakly similar to ASPARTYL-TR 


11.7 


115506 


AA292537 


Hs.45207 


Human DMA sequence from clone 620E1 


6.8 


115522 


AA331393 


Hs.47378 


ESTs 


5.8 


115572 


AA398392 


Hs.59594 


ESTs; WeaWy similar to F33G12.3 gene p 


97 


115587 


AA399264 


Hs.283037 


ESTs; Highly similar to HSPC039 protein 


8.7 


115600 


AA400247 


Hs.42173 


ESTs 


4 


115612 


AA400948 


Hs.71243 


ESTs; Weakly similar to zinc finger prote 


8.4 


115646 


AA4Q4352 


Hs.305971 


ESTs 


5.3 


115652 


AA405098 


Hs.38178 


ESTs 


16.1 


115657 


AA405620 


Hs.55158 


ESTs; Weakly similar to weak similarity t 


4.7 


115658 


AA405625 


Hs.183056 


Human DMA sequence from clone 34321 


5.1 


115675 


AA406546 


Hs.82065 


Homo sapiens mRNA; cONA DKFZp564 


20.5 


115721 


AA417102 


Hs.90960 


ESTs 


4.8 


115763 


AA421560 




ESTs 


7 
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115764 AA421562 Hs.91011 anterior gradient 2 (Xenepus laevis) homo 

115835 AA428576 Hs.41371 ESTs 

115844 AA430124 Hs.7773 ESTs 

115875 AA433943 Hs.43946 ESTs; Weakly similar to Weak similarity 

115888 AA435839 Hs.76591 KIAA0887 protein 

115922 AA441911 Hs.71869 ESTs; Weakly similar to K1AA0926 prote 

115941 AA443602 Hs.46679 ESTs 

115947 AA443793 Hs.94761 ESTs 

115948 AA443798 Hs.43445 poly(A)-spedfic ribonuclease (deadenylat 
115951 AA443918 Hs.301048 cofilin 1 (non-muscle) 

115967 AA445887 Hs.42911 ESTs 

115984 AA447687 Hs.91109 ESTs 

116009 AA449448 Hs.44238 ESTs 

116024 AA451748 Hs.83883 Human DNA sequence from clone 718 J7 

116028 AA452112 Hs.42644 thioredoxin-fike 

116050 AA453656 Hs.88417 ESTs 

116097 AA456099 Hs.176376 ESTs 

116108 AA457566 Hs.28777 ESTs 

116121 AA459254 Hs.48855 ESTs 

116127 AA459703 Hs.279884 v-myc avian myelocytomatosls viral onco 

116129 AA459956 Hs.49163 ESTs; Highly similar to putative ribonude 

116142 AA460649 Hs.39457 ESTs 

116204 AA465701 Hs.108646 ESTs 

116221 AA478397 Hs.50180 ESTs 

116222 AA478415 Hs.89986 ESTs 

116238 AA479362 Hs.47144 DKFZP586N0819 protein 

116246 AA479961 Hs.25Q646 ESTs; Highly similar to ubiquitin-conjuga 

116249 AA480886 Hs,86693 ESTs 

116250 AA480975 Hs.44829 ESTs 

116254 AA481146 Hs.41086 ESTs; Weakly similar to OXYSTEROl-B 

116256 AA481256 Hs.88201 ESTs; Weakly similar to lysophospholipa 

116264 AA482594 Hs.272239 Homo sapiens mRNA; cDNA DKFZp586 

116265 AA482595 Hs.55189 ESTs; Weakly similar to F25B5.3 [C.eleg 
116282 AA486550 Hs.204501 ESTs; Weakly similar to Wiskott-Aldrich 
116298 AA489046 Hs.94109 ESTs 

116300 AA489194 Hs.159471 ESTs; Weakly similar to snRNP protein B 

116327 AA490959 Hs.28005 Homo sapiens mRNA; cDNA DKFZp564 

116334 AA491457 Hs.48948 ESTs 

116337 AA496127 Hs.44070 ESTs 

116351 AA5041 16 Hs.82501 Homo sapiens mRNA; cDNA OKFZp434 

116357 AA504806 Hs.90797 Homo sapiens clone 23620 mRNA sequen 

116415 AA609204 Hs.27973 WAA0874 protein 

116443 AA620313 Hs.190488 ESTs; Weakly similar to KERATIN; TYP 

116470 C13992 Hs.83484 ESTs 

116480 C14088 glyceraldehyde-3-phosphatedehydrogena 

116578 D51272 Hs.75337 nucleolar phosphoprotein p1 30 

116579 051276 Hs.81915 leukemia-associated phosphoprotein p1 8 
116626 F02028 Hs.81907 ESTs 

116647 FQ3069 Hs.15395 ESTs; Weakly similar to ARGINYL-TRN 

116674 F04816 Hs.92127 ESTs 

116680 F08813 Hs.273829 LINE retrotransposabte element 1 

116700 F09983 Hs.317589 ESTs 

116724 F13665 Hs.65641 ESTs 

116726 F13681 Hs.53913 ESTs 

116732 F13779 Hs.165909 ESTs 

116734 F13789 Hs.93796 DKFZP586D2223 protein 

116760 H11054 Hs.155342 protein kinase C; delta 

116780 H22566 Hs.30098 ESTs 

116786 H25836 Hs.301527 tumor necrosis factor (ligand) superfamlly 

116787 H28581 Hs.15641 ESTs 

116790 H29532 Hs.101174 microtubule-assoclated protein tau 

1 16803 H47357 Hs.109701 ESTs; Moderately similar to weak similar 

116877 H68116 Hs.168732 ESTs 

116921 H72948 Hs.821 blglycan 

117216 N20083 Hs.42792 ESTs 

117232 N20579 Hs.61153 ESTs 

117284 N22162 Hs.183779 ESTs; Weakly similar to cDNA EST yk33 
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117344 N2404B 
117367 N24954 
117392 N26175 
117394 N26257 
117412 N26722 
117493 N31726 
117557 N33920 
117634 N36421 
117639 N35923 
117754 N47469 
117852 N49408 
117879 N50050 
117924 N51056 
117950 N51394 
117992 N520Q0 
118138 N57773 
118215 N62195 
118229 N62339 
118285 N62827 
118336 N63604 
118363 N64168 
118429 N66158 
118470 N66769 
118472 N66818 
118475 N66845 
118493 N67149 
118528 N67689 
118542 N68010 
118600 N69222 
118695 N7178J 
118698 N72113 
118901 N90719 
118952 N929S6 
118976 N93629 
118986 N94362 
118989 N94439 
119027 N99256 
119042 R05316 
119075 R36451 
119260 T15916 
119271 T16387 
119298 T23820 
119302 T25725 
119341 T62571 
119495 W35390 
119580 W42451 
119602 W46286 
119620 W47620 
119676 W60473 
119717 W69134 
119729 W69747 
119805 W73788 
119859 W80702 
119867 W80852 
119873 W81129 
119899 W84767 
119940 W86779 
119943 W86835 
119970 W87812 
120131 Z38656 
120150 Z39549 
120206 Z40805 
120241 Z41815 
120255 AA169752 
120314 AA194166 
120325 AA195651 
120352 AA211400 



Hs.210706 


ESTs 
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ESTs 
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ESTs; Highly similar to myelin gene expr 


5.8 


Hs.44532 


dinhintiitin 




Hs.13323 


ESTs* Wealdv similar to SOOfl JM- AMD 
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Hs.44833 


ESTs 


0 


Hs.59757 


ESTs 
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K1AA0853 protein 
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EST 


HX 


Hs.47166 


ESTs 


7 O 


Hs 48938 


ESTs 


c 
0 


Hs.74649 


ESTs 


4 1 


Hs.291033 


ESTs 


5.4 


Hs.42179 


ESTs 


10.8 




ESTs; Weakly similar to Jill ALU CLASS 


4.5 


Hs.50115 


ESTs 


O.J 


Hs.49397 


ESTs 


10.4 


Hs.49427 


ESTs 


79 




ESTs 


9.2 


Hs 50081 
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ESTs 
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ESTs 
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Hs.93391 


ESTs 


0 


Hs.125830 


ESTs 


7.3 


Hs.45105 


ESTs 


8.2 


Hs.1 14611 


ESTs 


c 

0 


Hs.5472 


ESTs 


4 


Hs.287820 


fibronectin 1 


c 

0 


Hs.102950 


ESTs; Highly similar to coat protein gamm 


4.1 


Hs.65328 


ESTs 


12.1 


Hs 15547A 


n/rtin TO 


0.0 




ESTs 


14.J 


Hs 146388 






Hs.55533 


ESTs 


0.0 




high-mobility group protein 2-dke 1 


5.6 


Hs IW&iA 
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6.5 


Hs.56009 


2'-5 , oligoadeny1ate synthetase 3 


8.1 


Hs.57787 


ESTs 


5.5 


Hs.57987 


ESTs 


4.6 


Hs.94806 


K1AA1062 protein 


4 


Hs.43213 


ESTs 


4 


Hs.58461 


ESTs 


4.8 


Hs.250696 


KDEL (Lys-Asj)-Glu-Leu} endoplasmic re 


4.2 


Hs.44865 


Homo sapiens mRNA; cDNA DKFZp586 


4.8 


Hs.58698 


ESTs 


5.9 


Hs.272531 


DKFZP586B0319 protein 


9 


Hs.14158 


copina ill 


4.8 


Hs.93581 


Homo sapiens mRNA; cDNA DKFZp586 


4 


Hs.75887 


coatomer protein complex; subunitatpha 


4.2 


Hs.153746 


ESTs 


11 


Hs.91668 


ESTs 


8.2 


Hs.65946 


ESTs 


15.6 


Hs.5672 


ESTs; WeaJdy similar to Similarity to Yea 


4.2 


Hs.221040 


KIM1038 protein 


6.8 


Hs.104106 


ESTs 


15.2 


Hs.193172 


ESTs 


6.8 
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120428 AA23B822 Hs.173694 


KIAA1097 protein 


5.6 


120524 AA261852 Hs.192905 


ESTs 


5.6 


120528 AA262107 Hs.104413 


ESTs 


4.5 


120571 AA280738 Hs.34892 


ESTs 


4.9 


120649 AA287115 Ks,192843 


ESTs 


4.5 


120655 AA287347 Hs.238205 


ESTs 


6.7 


120666 AA287833 Hs.292913 


ESTs 


8.3 


120712 AA292654 Hs.102506 


eukaryotic translation initiation (actor 2 al 


4.6 


120713 AA292655 H&96557 


ESTs 


10.6 


120724 AA293470 Hs.100747 


ESTs 


5,4 


120873 AA358015 


EST 


7.1 


120885 AA365515 Hs.301872 


ESTs; Moderator/ similar to HI! ALU SU 


4.6 


120919 AA381125 Hs.301444 


ESTs 


8.2 


120948 AA397822 Hs.104650 


ESTs; Highly similar to similar to mago n 


8.6 


120969 AA398116 Hs.129206 


casein kinase 1; gamma 3 


10.5 


120977 AA398155 Hs.97600 


ESTs 


10.9 


121103 AA398936 Hs.97697 


EST 


7.4 


121291 AA401753 Hs.8186 


lung cancer candidate 


5.3 


121320 AA403008 Hs.301927 


T-cell receptor; alpha (V;D;J;C) 


13.5 


121463 AA411745 Hs.239681 


ESTs; Weakly similar to K1AA0554 prote 


8.9 


121596 AA416740 Hs.174104 


ESTs 


22.6 


121723 AA419622 Hs.104800 


ESTs; Weakly similar to Mouse 19.5 mRN 


8 


121748 AA421171 Hs.234545 


ESTs 


5.6 


122125 AA434411 Hs.98806 


ESTs 


5.3 


122522 AA449444 Hs.98969 


ESTs 


4 


122655 AA454756 Hs.97837 


ESTs 


4 


122704 AA456326 Hs,99445 


ESTs 


6.2 


122782 AA459894 Hs.99472 


ESTs 


5.3 


122856 AA463740 Hs.75367 


Src-fike-adapter 


13.1 


122882 AA465381 H3.108812 


ESTs; Weakly similar to B0041. 5 [Oeteg 


5.5 


122928 AA476578 Hs.101840 


ESTs 


6.3 


122974 AA478625 Hs.194215 


ESTs 


6 


122997 AA479295 Hs.106290 


Kelch motif containing protein 


12.5 


123016 AA480103 Hs.323231 


ESTs; Weakly similar to alternatively spli 


4.4 


123107 AA486071 Hs.104207 


ESTs 


8.3 


123111 AA486273 Hs.191721 


ESTs 


4.2 


123114 AA486407 Hs.129928 


ESTs; Moderately similar to K1M0454 p 


5.2 


123136 AA487449 Hs.194024 


ESTs 


4.2 


123137 AA487468 Hs.100686 


ESTs; Weakly similar to secreted cement 


14.6 


123169 AA468892 


ESTs; Weakly similar to Gag-Pol polypro 


4.5 


123176 AA489O20 Hs.69233 


ESTs 


5.2 


123338 AA504249 Hs.187585 


ESTs 


4 


123436 AA598714 Hs.223014 


protease; serine; 15 


7.3 


123442 AA598803 Hs.111496 


ESTs 


5.9 


123449 AA598899 Hs.112493 


Homo sapiens mRNA; cDNA DKFZp564 


. 4.1 


123494 AA599786 Hs.112110 


ESTs 


4 


123503 AA600121 Hs.293156 


ESTs 


12.8 


123533 AA608751 


ESTs; Weakly similar to !!!! ALU SUBFA 


7.9 


123619 AA609200 


ESTs 


23.1 


123673 AA609471 Hs.158549 


ESTs 


6.6 


123729 AA609778 Hs.278672 


membrane component; chromosome 11; s 


4.7 


123819 AA620636 Hs.1 12264 


ESTs 


4 


123960 AA621765 Hs.287733 


methytmafanale-semialdehyde dehydroge 


7.6 


124000 057317 Hs.74861 


activated RNA polymerase II transcription 


4,4 


124006 D60302 Hs.270016 


ESTs 


20.6 


124012 D80240 Hs.241471 


HUM5G11A Human fetal brain (TFujiwa 


6.7 


124021 F02859 Hs.13974 


ESTs 


4.7 


124049 F10523 Hs.74519 


primase; polypeptide 2A (58kD) 


4.7 


124059 F13673 Hs.283713 


ESTs 


7.7 


124243 H66710 Hs.133525 


ESTs 


5.5 


124308 H93575 Hs.241507 


Homo sapiens mRNA; cDNA DKFZp564 


11.4 


124314 H94877 . Hs.215766 


GTP-binding protein 


13.7 


124315 H94892 Hs.288757 


v-ral simian leukemia viral oncogene horn 


14 


124350 N21359 Hs.101282 


Homo sapiens mRNA; cDNA DKFZp434 


8.6 


124352 N21626 Hs.1024Q6 


ESTs 


7.2 


124357 N22401 


yw37g07.s1 Morton Fetal Cochlea Homo 


5.2 


124390 N29325 Hs.7535 


ESTs; Highly similar to COBW-like place 


7.9 
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124438 N4Q188 Hs.11090 
124447 N48000 

124457 N50114 Hs.266175 

124539 N63172 Hs.146409 

124626 N74604 Hs.11090 

124632 N79515 Hs.306117 

124644 N91279 Hs.109654 

124676 R01037 Hs.181013 

124677 R01073 

124724 R12405 Hs.1 12423 

124773 R40923 Hs.106604 
124777 R41933 

124792 R44357 Hs.48712 

124857 R63652 Hs.137190 

124911 R88992 Hs.180612 

124955 T10598 Hs.324841 

124958 T11134 Hs.431 

125038 T78089 Hs.270134 

125092 T92544 Hs.137548 

125132 W15495 Hs.129781 

125144 W37999 Hs.24336 
125154 W38419 

125243 W86423 Hs.105413 

125279 W93640 Hs.4779 

125299 239436 Hs.1Q2720 

125303 Z39821 Hs.288193 

125304 Z39833 Hs.124940 
125474 AA151216 Hs.75103 
125509 AA044232 Hs.288967 
125580 AA126504 Hs.267812 
125582 AA507383 Hs.74649 
125670 AI432621 Hs.82685 
125698 AA748483 Hs.191356 
125745 AI283493 Hs.75722 
125852 H09290 Hs.76550 
125972 AA434562 Hs.35406 
126160 N90950 Hs.265398 
126257 N99638 Hs.124084 
126337 AI066486 Hs.40500 
126405 U46278 Hs.122489 
126537 W40262 Hs.146310 
126590 W78968 Hs.181307 
126712 AA205862 Hs7942 
126721 T72569 Hs.125359 
126764 AI334393 Hs.102178 
126804 A1203334 Ks.160628 
126819 AA305536 Hs.279607 
126877 A1052047 Hs.26102 
126991 R31652 Hs.821 
127479 AA513722 Hs.179729 
127514 M826926 Hs.204214 
127663 W07286 Hs.10340 
127677 AA916752 Hs.264190 
127814 AA761755 Hs.136713 
127997 AI281549 Hs.311054 
128092 AA904617 Hs.166229 
128218 H02682 Hs.292154 
I28466 059653 Hs.241471 
I28482 U83908 Hs.296251 
128517 AA280617 Hs.100861 

1 28530 AA504343 Hs.183475 

I28559 AA226801 Hs.101448 

I28574 AA412048 Hs.38260 

I28595 U31875 Hs.152677 

128610 L38608 Hs.10247 

I28629 AA399187 Hs.102708 

I28649 AA142853 Hs.1Q3106 



ESTs 

Homo sapiens mRNA; cDNA DKFZp586 
ESTs 

cell division cycle 42 (GTP-binding prote 
ESTs 

interteukin 13 receptor; alpha 1 

ESTs; Moderately similar to outer membr 

phosphoglycerate mutase 1 (brain) 

ESTs; Weakly similar to !!!! ALU CLASS 

Homo sapiens mRNA; cDNA DKF2p586 

ESTs 

ESTs 

ESTs; Weakly similar to cDNA EST EMB 

ESTs 

ESTs 

ESTs; Weakly similar to 111! ALU SUBFA 
murine leukemia viral (bmM) oncogene h 
ESTs 

C084 antigen (leukocyte antigen) 

chromosome 21 open reading frame 5 

ESTs 

ESTs 

ESTs 

ESTs; Moderately similar to similar to AD 

ESTs 

ESTs 

GTP-binding protein 

tyrosine 3-rnonooxygenase/tryptophan 5-m 
ESTs 

sorting nexin 4 

cytochrome c oxidase subunit Vic 
CD47 antigen (Rh-retated antigen; integri 
general transcription factor IIH; polypepti 
ribophorin II 

Homo sapiens mRNA; cDNA DKFZp564 
ESTs 

ESTs; Weakly similar to transformation-r 
tumor necrosis factor receptor superfamily 
similar to S. cerevisiae RER1 
ESTs 

ESTs; Weakly similar to putative p150 {H 

H3histone; family 3A 

ESTs 

Thy-1 ceil surface antigen 

ESTs 

ESTs 

ESTs 

ESTs 

bigfycan 

cofiagen; type X; alpha 1 (Schmid metaph 
ESTs 

ESTs; Weakly similar to weak similarity t 
ESTs; Highly similar to MEM3 [M.muscu 
ESTs; Weakly similar to V4-1 (H^apiens 
ESTs 
ESTs 

ESTs; Moderately similar to recombinatio 
EST 

programmed cell death 4 

ESTs; Weakly simitar to p60 katanm (H.s 

Homo sapiens done 25061 mRNA sequen 

metastasis associated 1 

keratin 8 

short-chain alcohol dehydrogenase family 
activated leucocyte cell adhesion molecule 
DKFZP434A043 protein 
Homo sapiens mRNA for G7b protein (G 
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128651 
128653 
128656 
128717 
126727 
128764 
128781 
128793 
128835 
128845 
128871 
128922 
128925 
128938 
128946 
128948 
128955 
129005 
129009 
129017 
129057 
129075 
129095 
129124 
129160 
129164 
129180 
129224 
129229 
129240 
129241 
129243 
129247 
129259 
129270 
129281 
129300 
129318 
129319 
129351 
129366 
129383 
129388 
129391 
129404 
129406 
129426 
129453 
129513 
129519 
129606 
129622 
129626 
129627 
129628 
129642 
129663 
129665 
129691 
129783 
129800 
129836 
129850 
129869 
129896 
129982 
129985 



AA446990 

R48943 

AA458542 

T30617 

M64174 

N49308 

X85372 

W93562 

W15528 

AA455658 

AA400271 

AA252023 

D61676 

AA410325 

N29353 

AA485655 

F10290 

AA460049 

AA1 31421 

H13108 

X62466 

AA129465 

L12350 

AA234530 

AA131252 

AA282183 

R40556 

X89109 

AA211941 

W24360 

AA435665 

H88033 

AA151574 

AA090695 

Z35227 

AA026318 

C20976 

N93155 

AA037467 

AA167268 

H18027 

W92984 

AA151621 

T80814 

AA172056 

N23707 

AA412087 

AA421213 

C00225 

AA298786 

R21443 

AA278243 

AA447410 

AA258308 

U26727 

R5OO08 

AA442768 

M88458 

X06700 

AA454618 

AA252436 

AA452161 

N20593 

AA102520 

AA043021 

M87789 

AA450045 



Hs.103135 

Hs.10315 

Hs.10326 

Hs.104222 

Hs.50651 

Hs.104938 

Hs.105465 

Hs.105749 

Hs.106390 

Hs.10649 

Hs.106778 

Hs.9589 

Hs.21851 

Hs.107260 

Hs.107318 

Hs.223025 

Hs.185807 

Hs.13323 

Hs.75607 

Hs.107968 

Hs.276770 

Hs.83765 

Hs.108623 

Hs.108802 

Hs.109007 

Hs.109045 

Hs.318401 

Hs.109606 

Hs.109643 

Hs.237868 

Hs,1Q9706 

Hs.109727 

Hs.109733 

Hs.181385 

Hs.109918 

Hs.289101 

Hs.1 10165 

Hs.285976 

Hs.30340 

Hs.62349 

Hs.184697 

Hs.288224 

Hs.110964 

Hs.11101 

Hs.317584 

Hs.1 11 138 

Hs.111323 

Hs.1 11632 

Hs.306163 

Hs.1 12242 

Hs.166254 

Hs.323949 

Hs.111334 

Hs.71968 

Hs.1 174 

Hs.11806 

Hs.1 1866 

Hs.1 18778 

Hs.119571 

Hs.12479 

Hs.12540 

Hs.206521 
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130693 M487202 
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Hs.23703 
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Hs.155017 
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Hs.241392 
Hs.155485 
Hs.178710 
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Hs.252587 
Hs.15984 
Hs.16085 
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Hs.1657 
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Hs.1695 
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Hs.18879 
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Hs.20830 
Hs.20991 
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ribosoma! protein 126 
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KIAA0704 protein 

ESTs 
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tubulin; gamma 1 

calrium/calmodurin-dependent serine pro 
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activated p21cdc42Hs kinase 

ubiquitin-acfivating enzyme E1C (homoto 

K1AA0212 gene product 
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nuclear receptor interacting protein 1 
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TATA box binding protein {TBP)-associa 

cyclin-dependent kinase 5 

K1AA0005 gene product 

Homo sapiens mRNA for putative glucosy 

ESTs 

small inducible cytokine A5 (RANTES) 
huntingOnnnteracting protein 2 
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protein kinase; DNA-activated; catalytic p 
N-acetyltransferase 1 (arytamine N-acetyl 
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cartilage oRgomeric matrix protein (pseud 
pituitary tumor-transforming 1 
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ESTs; Highly similar to CGM3 protein [H 
ESTs; Moderately similar to similar to C.e 
ESTs 

estrogen receptor 1 

ESTs 

ESTs 

ESTs; Weakly similar to cytochrome P45 
matrix metaltoproteinase 2 (gelatinase A; 
matrix metafloproteinase 12 (macrophage 
ATPase; H+ transporting; r/sosomal (vacu 
interferon-stimubted transcription factor 
ESTs 
ESTs 

Rho-associated; coHed-coa containing pro 

ESTs 
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adenylate cyclase 7 
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occludin 

POP7 (processing of precursor; S. cerevis 
ESTs 

Homo sapiens done 23965 mRNA sequen 
ESTs 

ESTs; Weakly similar to katanin p80 subu 
putative DNA/chromatin binding motif 
ceroid-lipofuscinosis; neuronal 2; late Infa 
ubiquitin-activating enzyme E1 (A1S9T a 
kinesin-Iike 2 
SET domain; bifurcated; 1 
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6.1 
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8.3 
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13.3 

10.1 
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4.7 


133239 AA059405 


Hs.179882 


Homo sapiens clone 24655 mRNA sequen 


5.5 


133240 D31161 


Hs.242894 


ESTs 


9 


133257 AF006086 


Hs.6895 


actin related protein 2/3 complex; subunit 
ESTs; Weakly similar to cDNA EST yk37 


7.7 


133264 W72187 


Hs.69192 


6.7 


133274 AA488886 


Hs.6949 


ESTs 


4.2 


133281 M421079 


Hs.69594 


ESTs; Weakly similar to Sox-like transcri 


4.9 


133283 AA410507 


Hs.6968 


ESTs 


4.3 


133287 L15702 


Hs.69771 


B-factor, properdin 


9.3 


133294 R79723 


Hs.69997 


zinc finger protein 238 


30.4 


133297 AA600057 


Hs.70266 


KIAA0905 protein 


10.4 


133318 AA256168 


Hs.152316 


ESTs 


8.5 


133362 H06195 


Hs.7194 


ESTs; Highly similar to CGI-59 protein [H 


14 


133370 AA156897 


Hs.72157 


DKF2P564I1922 protein 


5 


133391 X57579 


Hs.727 


Inhibbi; beta A (actlvln A; activln AB alp 


13.9 


133395 AA491296 


Hs.72805 


ESTs 


4.3 


133422 N79516 


Hs.73287 


ESTs; Weakly similar to eyelid lD.melano 


4.5 


133431 AA255438 


Hs.7358 


Homo sapiens mRNA; cDNA DKFZp566 


8 


133435 T23983 


Hs.323966 


ESTs 


5 


133449 AA094989 


Hs.7381 


voltage-dependent anion channel 3 


8.7 


133468 X03068 


Hs.73931 


major histocompatibility complex; class II 


5 


133484 X78710 


Hs.211581 


metakegulatory transcription factor 1 


5.3 


133506 AA316868 


Hs.74346 


ESTs; Weakly similar to 140G11.h [D.me 


6.8 


133517 X52947 


Hs.74471 


gap junction protein; alpha 1; 43kO (conn 


5.7 


133551 D63480 


Hs.278634 


K1AA0146 protein 


4.8 


133569 AA313977 


Hs.172772 


transcription elongation factor B (Sill); po 


9.5 


133572 W94333 


Hs.279915 


transbcase of inner mitochondrial membr 


5 


133577 F03717 


Hs.75063 


human immunodeficiency virus type 1 enh 


7.4 


133589 L37368 


Hs.75104 


RNA-bindlng protein S1; serine-rich dom 
glyoxalase 1 


5 


133603 D13315 


Hs.75207 


4.2 


133617 AA148318 


HS75249 


K1AA0069 protein 


4.5 


133627 U09587 


Hs.75280 


glycyl-tRNA synthetase 


10 


133633 D21262 


Hs.75337 


nucleolar phosphoprotein p130 


4.5 


133634 U24166 


Hs.234279 


microtubule-associated protein; RP/EB fa 


15.2 


133640 D83004 


Hs.75355 


ubiquitin-conjugating enzyme E2N (homo 


9.1 


133644 089077 


Hs.75367 


Srciike-adapter 


6.4 


133649 AA479139 


Hs.75393 


add phosphatase 1; soluble 


4.8 


133652 AA287383 


Hs.7540 


ESTs 


4.2 


133674 AA458946 


Hs.75497 


ESTs 


4.3 


133700 K01396 


Hs.297681 


protease inhibitor 1 (antl-elastase); alpha- 


8.3 


133705 N21648 


Hs.75659 


MpV17 transgene; murine homolog; glom 


4.6 


133716 Y00282 


Hs.75722 


ribophortn II 


7.5 


133720 L27841 


Hs.75737 


pericentrlotar material 1 


9.4 


133752 U49278 


Hs.75875 


ubiquitin-conjugating enzyme E2 variant 


4.5 


133765 D21255 


Hs.75929 


cadherin 11 (OB-cadherin; osteoblast) 


6.4 
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133772 W73693 
133774 Z23090 
133776 J03473 
133784 AA214305 
133814 M33882 
133829 AA453783 
133834 AA147510 
133839 M59815 
133842 U73477 
133845 T68510 
133859 U86782 

133867 043948 

133868 U58090 
133871 AA454597 
133893 X01060 
133914 N32811 
133918 W72783 
133944 AA045870 
133946 AA156565 
133963 L34587 
133980 D00760 
133990 C02374 
133999 M28213 
134030 J03077 
134032 Z81326 

134045 S82470 

134046 028473 
134064 087685 
134070 H98621 
134087 U51166 
134090 M22382 
134098 X06323 
134110 U41060 
134132 U32519 
134168 AA398908 
134170 M63138 
134208 U88871 
134258 L28010 
134288 AA430008 
134310 AA313414 
134326 U16306 
134329 D38551 
134331 AA452020 
134351 R82074 
134357 L43575 
134363 M37033 
134367 X54199 

134374 062633 

134375 M412720 

134376 X02874 
134381 U56637 
134388 -M15841 
134395 L09717 
134399 H99801 
134401 AA243746 
134405 J04177 
134415 AA329274 
134417 087969 
134419 108044 
134421 AA122386 
134423 W96151 
134438 AA449984 
134446 T25732 
134453 X70683 
134470 X54942 
134487 R38185 
134495 063477 



Hs.76038 
Hs.76067 
Hs.1 77766 
Hs.301064 
Hs.76391 
Hs.76550 
H3.288660 
Hs.170250 
Hs.285013 
Hs.76704 
Hs.178761 
Hs.76989 
Hs.183874 
Hs.182793 
Hs.77356 
Hs.77542 
Hs.58382 
Hs.7780 
Hs.173878 
Hs.184693 
Hs.250811 
Hs.7822 
Hs.78305 
Hs.78575 
Hs.78589 
Hs.78768 
Hs.172801 
Hs.78893 
Hs.78946 
Hs.173824 
Hs.79037 
Hs.79086 
Hs79136 
Hs.220689 
Hs.181634 
Hs.79572 
Hs.79993 
Hs.808 
Hs.8117 
Hs.8148 
Hs.81800 
Hs.81848 " 
Hs.111222 
Hs.82109 
Hs.82171 
Hs.82212 
Hs.82285 
Hs.8236 
Hs.82389 
Hs.82396 
Hs.184270 
Hs.82575 
Hs.8262 
Hs.82689 
Hs.211577 
Hs.82772 
Hs.82911 
Hs.82921 
Hs.82961 
Hs.82985 
Hs.83006 
Hs.246857 
Hs.83419 
Hs.83484 
Hs.83758 
Hs.83954 
Hs.84087 



isopentenyWiphosphate delta isomerase 


7.9 


heat shock 27kD protein 1 


4.1 


ADP-ribosyttransferase (NAD+; poly (AD 


13 


ESTs 


5.2 


myxovirus (influenza) resistance 1; homol 


11.7 


Homo sapiens mRNA; cONA OKFZp564 


9,4 


serine protease; umbilical endothelium 


4.8 


complement component 4A 


6.7 


putative human HLA dass IJ associated p 


7.1 


ESTs 


6.3 


26S proteasome-assoclated pad1 homotog 


13.7 


K1AA0097 gene product 


4.1 


cullin4A 


4 


ESTs 


4.7 


transferrin receptor (p90;CD7l) 


8.3 


ESTs 


5 


ESTs; Weakly similar to C13F10.5 [Cete 


4.5 


Homo sapiens mRNA; cONA DKFZp564 


6.3 


4-n(trophenylphosphatase domain and non 


6.4 


transcription elongation factor 8 (Sill); po 


6.3 


proteasome (prosome; macropaln) subunit 


11.9 


Homo sapiens mRNA; cONA DKFZp564 


8.2 


RAB2; member RAS oncogene family 


5.2 


prosaposin (variant Gaucher disease and v 


4.6 


protease inhibitor 12 (neuroserpin) 


6.5 


BB1 


11.9 


isaleucine-tRNA synthetase 


52 


K1AA0244 protein 


7.3 


cuilin 3 


4.7 


thymine-ONA glycosylate 


7 


heat shock 60kD protein 1 (chaperonin) 


4.5 


ribosomal protein; mitochondrial; L3 


9.4 


UV-1 protein; estrogen regulated 


4.4 


Ras-GTPase-aciivaling protein SH3-doma 


6.6 


Human Chromosome 16 BAC done CIT9 


8.6 


cathepstn D (lysosomal aspartyl protease) 


9.3 


peroxisomal biogenesis factor 7 


6.3 


heterogeneous nudear ribonudeoprotein F 


4.3 


ESTs 


6.9 


Homo sapiens clone 24856 mRNA sequen 


7.4 " 


chondroitin sulfate proteoglycan 2 (verslc 


6.1 


RAD21 (S. pombe) homolog 


8.6 


ESTs; Weakly similar to CGM28 protein 


6.1 


syndecan 1 


4.4 


Human done 191B7 placenta expressed m 


6.6 


CD53 antigen 


5.3 


phosphoribosylglycfoamide formyltransfe 


4.8 


ESTs 


15.2 


ESTs; Highly similar to CGI-118 protein 


7.2 


2 , ;5'-oligoadenylate synthetase 1 


6.4 


capping protein (actin filament) muscle Z- 


4 


small nudear ribonudeoprotein potypepti 


5.7 


lysosomal-associated membrane protein 2 


6.9 


tumor rejection antigen (gp96) 1 


4.5 


kinedin 1 (klnesin receptor) 


112 


collagen; type XI; alpha 1 


15,3 


protein tyrosine phosphatase type IVA; m 


4.1 


solute carrier family 35 (CMP-sialic acid t 


4.2 


trefoil factor 3 (intestinal) 


5.9 


collagen; type V; alpha 2 


5.8 


ESTs; Highly similar to CGI-139 protein 


4.4 


ESTs; HigWy sMaf to proteine kinase JN 


7 


WAA0252 protein 


4.6 


SRY (sex determining region Y)4}ox 4 


5.1 


CDC28 protein kinase 2 


20.3 


Homo sapiens unknown mRNA 


5 


WAA0143 protein 


16.1 
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Hs.284226 

Hs.89603 

Hs.89718 

Hs.89751 

Hs.89981 

Hs.90061 

Hs.90280 

Hs.904 

Hs.90419 

Hs.9071 

Hs.92308 



134498 M63180 Hs.84131 
134506 U45328 Hs.84285 
134529 H24460 Hs.848 
134570 U66615 Hs.172280 
5 134582 AA234966 Hs.86041 
134600 R68884 Hs.86347 
134623 X74496 Hs.86978 

134654 W23625 Hs.8739 

134655 AA454070 Hs.123090 
10 134675 AA250745 Hs.87773 

134711 X04011 Hs.88974 
134714 U89922 Hs.890 
134722 W47183 
134776 J05582 

15 134806 Z49099 
134810 M27394 
134840 U51477 
134843 H60595 
134853 D82348 

20 134866 U84011 
134868 Z39762 
134885 N27670 

1349S2 N46086 

134989 AA236324 Hs.92381 

25 134992 H05625 Hs.5831 
134993 AA282343 Hs.301005 
135010 059675 Hs.92927 
135015 U54999 Hs.278338 
135029 AA224180 

30 135032 AA243497 Hs.173685 
135037 U77948 Hs.278589 
135059 AA598449 Hs.93832 
135071 L08069 Hs.94 
135083 AA495950 Hs.94262 

35 135117 W52493 Hs.94694 
135144 AA044842 Hs.95260 
135154 AA126433 Hs.267812 
135218 D31157 Hs.324277 
135237 AA454930 Hs.9691 

40 135243 AA215333 Hs.97101 
135335 H20989 Hs. 198281 
135349 083174 Hs.9930 
135367 AA4B0109 Hs.9963 
135389 U05237 Hs.99872 

45 135400 M23263 
135411 L10333 
300019 M97935 

300021 M97935 

300022 M97935 
50 300089 AI199738 

300107 AI694585 
300254 AW079607 Hs.188417 
300328 AW015860 Hs.224623 
300549 AA699328 Hs.298119 

55 300711 AI492179 Hs.166244 
300921 AW293224 Hs.232165 
301124 T79326 Hs.298262 
301165 N85789 Hs.150186 
301576 AI682905 Hs.270431 

60 301604 AA373124 Hs.24809 
301704 AA526313 Hs.293691 
301782 N99399 Hs.143046 
301884 AA312082 Hs.1Q5445 
301936 NM_004694Hs.1 14924 

65 302002 AF013956 Hs.5637 
302032 NM.001992HS.128087 
302067 H05698 Hs.222399 



Hs.99915 
Hs.99947 



Hs.208275 
Hs.270464 



uiicunyi-irwiH synineiase 


6.1 


uDiqunin-oonjugaung enzyme tzi (nomoi 


4.6 


Fk^rtfi JiinHrrvi nrMa'm A KtAfW 

rfvavo-oinaing proiein h (ay wj/ , 


6.2 


CIA/I/CMC rAtalaH' molnv *see/\**irtif\A' m*Jfi n 

ovvuoNr reiaiea, mainx associateo, acun 


4.8 


ocaO tnpiet repeat binding protein 1 


4.7 


co j s, weawy similar to predicted using G 


5.8 


prolyl endopeptidase 


4.5 


co i s, weaKiy similar to UKr tuKZUuc [ 


13.7 


CCTe 

tola 


5.8 


protein kinase) cAMP-dependent; catalytj 


8.9 


cytochrome b-245; beta polypeptide (chro 


6.8 


lymphotox'in beta (TNI- superfamity; mem 


35.7 


ESTs; Weakly similar to neural F box pro 


8.1 


mudn 1; transmembrane 


6.2 


spermine synthase 

membrane-spanning 4-domains; subfamily 


4.2 
7 


diacylgiycerol kinase; zeta (104kD) 


4.1 


progesterone binding protein 


4.7 


5-arniao!mldazo!e-4-carboxarn]de ribonuc 


10.2 


amyfo-1;6-glucostdase; 4-afpha-glucanotr 


12.1 


KiAA08o2 protein 


6 


progesterone membrane binding protein 


5 


EOT© * * 

coTS 


4.1 


nomo sapiens mruvft, cnromosome i spe 


16.8 


COlS 


4 


punne-ncn eiemeni Dinging protein o 


4.4 


FQTe 
CO IS 


7 


1 Ok I - --» ' 

LGN protein 


4.8 


ESTs; Moderately similar to 17-beta-hydr 


13.6 


Human DNA sequence from done 30M3 


4 


general transcription factor 11; i 


8 


Homo sapiens done 24483 unknown mRN 


5.4 


neat shock protein, DNAJ-UKe 2 


9.3 


FQTe 
COI S 


6.7 


Homo sapiens clone 24837 mRNA sequen 


10.2 


Homo sapiens mRNA; cDNA DKFZp586 


6.6 


ouiuiiy iicahi *i 


7.4 


cois, weawy similar to growth (actor-res 


6.2 


FCTc 


19.5 


puuiuvti \j pioicui-coupiea receptor 


8.8 


nun tun fa Hna<;«' mucrJo 
jjyiuvduc RJUdic, fnudCfc 




wiidyciruinuing protein <c icouigen c) 


5.5 


TYRO protein t/rosinQ kinase binding pro 


5.4 


mtdi Muricirnef anugen 


7.8 


driuiuyen recepior {QjnyuroiesiDSierone re 


9.1 


icuujiun i 


5.3 


AFFY ronfrnl* CTAT1 


8.3 


AFFY (V»ntml- QTAT1 
MrrA cgnUui. ol A 1 1 


7 


AFFX control: STAT1 


14 


ESTs; Weakly similar to 1111 ALU CLASS 


9.1 


ESTs; Weakly similar to l!ll ALU CLASS 


7.4 


ESTs; Weakly similar to ZnM [H^apien 


30.1 


ESTs 


11,9 


ESTs 


5.5 


ESTs; Weakly similar to cDNA EST yk40 


11 


ESTs 


11 


ESTs; Weakly similar to (IJ88J8.1 {Rsapi 


8.8 


ESTs; Weakly similar to PTERIN-4-ALP 


6 


ESTs; Weakly similar to Ml ALU SUBFA 


. 4.7 


ESTs; Weakly similar to C17G10.1 [Cele 


8 


ESTs 


4.2 


EST cluster {not in UniGene) with exon h 


18 


GDNF family receptor alpha 1 


20.7 


EST duster (not in UniGene) wiin exon h 


11.6 


chromobox homolog 4 (Drosophila Pc cla 


9.2 


EST duster (not in UniGene) with exon h 


4.3 


ESTs; Weakly similar to protein-tyrosine 


7.8 
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302145 NNL003613HS.151407 
302236 Aimm Hs.6557 
302276 NMJ>04448Hs.32391O 
302290 AL1 17607 Hs.175563 
5 302326 NMJKW271HS.184018 
302342 AB023141 Hs.190386 
302372 AL117406 Hs.20Q102 
302422 AB021227 Hs.3743 
302431 AF129530 Hs.226434 

10 302501 AF022726 Hs.251446 
302505 AL049650 Hs.247874 
302533 L36149 Hs.248116 
302638 AA463798 Hs.102696 
302656 AW293005 HsJ0704 

15 302792 AA343696 Hs.46821 
302820 X04588 Hs.85844 
302838 U66049 Hs.82171 
302892 N5B545 Hs.42346 
302977 AW263124 Hs.315111 
20 302989 N46406 Hs.84700 
303007 AA478876 Hs.317714 
303052 AF140242 Hs.279926 

303131 AW081061 Hs.103180 

303132 AI929819 Hs.4055 
25 303153 U09759 Hs.246857 

303387 AA908797 Hs.180799 
303499 AI815990 Hs.293515 
303502 AA488528 
303576 T07216 Hs.301226 

30 303620 AA397546 Hs.119151 
303634 AI953377 Hs.28444 
303642 AW299459 Hs.111977 
303654 AA436942 Hs.288529 
303733 AW502496 Hs.15220 

35 303780 AM24014 Hs.18995 
303792 C75094 Hs.199839 
303842 A1337304 Hs.126268 
303951 AW475081 Hs.172928 
304465 AA421948 

40 304507 AA456426 
304591 AA505702 
304601 AA507875 
304659 AA533185 
305040 AA630582 Hs.169476 

45 305134 AA653159 Hs.179661 
305415 AA725116 Hs.78465 
305453 AA738110 
305898 AA872838 
305913 AA876109 

50 305950 AA884479 

306004 AA889992 Hs.2186 
306009 AA894560 Hs.2B3370 
306060 AA906161 Hs.76277 
306398 AA970548 Hs.297681 

55 306505 AA987722 Hs.172928 
306576 AA995761 Hs.276092 
307117 AJ184111 Hs.76067 
307138 AI185516 Hs.172928 
307187 AI190870 Hs.276417 

60 307542 AI280859 Hs.62954 
307554 AI281603 Hs.172928 
307606 AI351739 Hs.276726 
308079 AI472733 Hs.270208 
308307 AI581398 Hs.172928 

65 308511 AI687580 Hs.169476 
308615 A1738593 Hs.101774 
308677 AI761173 



EST duster (not in UniGene) with exon h 


15.1 


zinc finger protein 161 


25.8 


EST cluster (not in UniGene) with exon h 


21.6 


Homo sapiens mRNA; cONA OKFZp564 


41.4 


EST duster (not in UniGene) with exon h 


8.9 


KIAA0924 protein 


5.4 


Homo sapiens mRNA; cONA DKF2p434 


8.9 


matrix metaiioproteihase 24 (membrane-in 


5.2 


EST cluster (not in UniGene) with exon h 


5.3 


EST duster (not in UniGene) with exon h 


9.9 


multiple UniGene matches 


4.3 


chemokine (C motif) XC receptor 1 


4,9 


ESTs; Weakly similar to C11D2.4 [C.eleg 


5.3 


ESTs 


8.4 


ESTs; Weakly sim3ar to putative [Rsapie 


4.5 


EST duster (not in UniGene) wifo exon h 


6.8 


EST duster (not in UniGene) with exon h 


8.4 


hlstone deacetytase 3 


22.6 


EST duster (not in UniGene) with exon h 


6.8 


EST duster (not in UniGene) with exon h 


8.9 


pallid (mouse) homolog; pairtdin 


10.1 


EST duster (not in UniGene) with exon h 


24.4 


actin-fikee 


6.3 


ESTs 


17.7 


mitogen-activated protein kinase 9 


11.4 


ESTs 


15.8 


ESTs 


7.2 


EST duster (not in UniGene) with exon h 


5.3 


EST duster (not in UniGene) with exon h 


16.2 


ESTs 


8.9 


ESTs; Weakly similar to predicted using G 


12 


EST duster (not in UniGene) with exon h 


4.2 


ESTs 


8.4 


ESTs; Weakly similar to zinc finger prate 


5.2 


ESTs; Moderately similar to K1AA0456 p 


28.4 


ESTs; Highly simBar to NG22 (H^apiens 


4.4 


ESTs; Weakly similar to similar to PDZ d 


8.1 


collagen; type 1; alpha 1 


7.5 


EST singleton (not in UniGene) with exon 


6.5 


EST 


5.4 


EST singleton (not in UniGene) with exon 


9.8 


EST singleton (not in UniGene) with exon 


7.5 


EST singleton (not in UniGene) with exon 


7 


glyceraldehyde-3-phosphate dehydrogena 


12.4 


EST singleton (not in UniGene) with exon 


8.7 


EST singleton (not in UniGene) with exon 


5.3 


EST singleton (not in UniGene) with exon 


4.1 


keratin 8 


7.7 


EST singleton (not in UniGene) with exon 


6.3 


EST singleton (not in UniGene) with exon 


5.6 


EST singleton (not in UniGene) with exon 


13.2 


EST singleton (not in UniGene) with exon 


4.4 


EST singleton (not in UniGene) with exon 


4.6 


EST singleton (not in UniGene) with exon 


7.6 


EST singleton (not in UniGene) with exon 


19.7 


EST singleton (not in UniGene) with exon 


5.5 


heat shock 27kD protein 1 


7.7 


collagen; type!; alpha 1 


8.8 


EST singleton (not in UniGene) with exon 


4.1 


EST singleton (not in UniGene) with exon 


6 


EST singleton (not in UniGene) with exon 


10.8 


EST singleton (not in UniGene) with exon 


4.7 


ESTs 


4.2 


collagen; type I; alpha 1 


5.4 


EST singleton (not in UniGene) with exon 


10.1 


EST singleton (not in UniGene) with exon 


15.1 


EST singleton (not in UniGene) with exon 


4.6 
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308852 A1829848 Hs.1 82937 


pepGdytprolyl isomerase A (cyclophilin A 


5.9 


308974 AI872290 Hs.300697 


Immunoglobulin gamma 3 (Gm marker) 


4.5 


308981 AI873242 


EST singleton (not in UniGene) with exon 


7.6 


308995 AI880172 


EST singleton (not In UniGene) with exon 


6.6 


309177 AI951118 


EST singleton (not in UniGene) with exon 


243 


309186 AI952723 Hs.90207 


EST singleton (not in UniGene) with exon 


6.1 


309198 AI955915 


major histocompatibility complex; class I; 


5.6 


309226 AI969897 


EST singleton (not In UniGene) with exon 


6.2 


309279 A1990102 


EST singleton (not in UniGene) with exon 


7.9 


309583 AW170035 


EST 


64.5 


309624 AW191929 Hs.252989 


EST 


5.3 


309629 AW192764 HsA72928 


col/agen; type/; alpha 1 


6.9 


309641 AW194230 Hs.253100 


EST 


11.4 


309698 AW238461 Hs.73742 


rfoosomai protein; large; P0 


4.3 


309700 AW241170 Hs.179661 


Homo sapiens clone 24703 beta-tubulin m 


11.9 


310073 AI335004 Hs.148558 


ESTs 


U 


310094 AW450967 Hs.235240 


ESTs 


5.7 


310373 AW080778 Hs.1455B2 


ESTs 


4.8 


310438 AW022192 HS.20Q197 


ESTs 


39.1 


310470 AI281848 Hs.194691 


ESTs 


4.9 


310583 AW205632 Hs,211198 


ESTs 


7 


310877 T47784 Hs.188955 


ESTs 


4.1 


311067 AI587332 Hs.209115 


ESTs 


11.2 


311166 AI821294 Hs.1 18599 


ESTs 


24.1 


311199 T57896 Hs.191095 


EST cluster (not in UniGene) 


5.7 


311465 AI758660 Hs.206132 


ESTs 


15.7 


311587 AI828254 Hs.271019 


ESTs 


6.4 


311774 AA700870 Hs.14304 


ESTs 


6.2 


311785 AI056769 Hs.133512 


ESTs 


5 


311923 T60843 Hs.189679 


ESTs 


5.9 


311935 AA216387 


EST cluster (not in UniGene) 


5.5 


311972 N51511 Hs.188449 


ESTs 


5.2 


312014 AI435650 Hs.1 28778 


ESTs 


4.3 


312047 AA588275 Hs. 180669 


ESTs 


14.7 


312147 T89855 Hs.195648 


EST cluster (not In UniGene) 


9.8 


312153 AA759250 Hs.15302S 


cytochrome 1*561 


27.1 


312168 T92251 Hs.198882 


ESTs 


4.2 


312172 AI222168 Hs.191168 


ESTs 


6.1 


312226 AI796815 Hs.199993 


ESTs; Weakly similar to ubiquitous TPR 


5.5 


312292 AW451893 Hs.151124 


ESTs 


18.4 


312312 AI080505 Hs.134529 


ESTs 


11.9 


312369 AA582039 Hs.173884 


Homo sapiens mRNA; chromosome 1 spe 


4 


312407 R46180 Hs.153485 


ESTs 


13.6 


312430 AW139117 Hs.117494 


ESTs 


4.1 


312470 AW451347 Hs.175862 


ESTs 


4.6 


312483 AI417526 Hs.7753 


ESTs 


15.3 


312521 AA033609 Hs.319093 


ESTs 


125 


312544 AI498371 Hs.183526 


ESTs 


14.6 


312638 AW439195 Hs.256880 


ESTs 


5.3 


312754 R99834 Hs.250383 


ESTs 


8.4 


312772 H63791 


EST cluster (not in UniGene) 


4.3 


312821 AA699325 Hs.269880 


ESTs 


8.3 


312837 AW292286 Hs.255058 


ESTs 


7.1 


312849 AA846353 Hs.194054 


ESTs 


5.9 


312854 AA826713 Hs.321058 


EST cluster (not in UniGene) 


4.1 


312992 AA088446 Hs.170298 


ESTs 


7.3 


313096 AI422367 Hs.163533 


ESTs 


6.1 


313112 AA732534 Hs.269099 


ESTs 


42 


313126 AA720887 Hs.283313 


EST cluster (not in UniGene) 


18.1 


313136 N59284 Hs.288010 


ESTs 


17 


313197 A1738851 Hs.222487 


ESTs 


12.9 


313219 N74924 Hs.182099 


ESTs 


7.1 


313258 AW068358 Hs.183918 


ESTs 


13.7 


313328 AW449211 Hs.105445 


ESTs 


275 


313352 AW292127 Hs.144758 


ESTs 


9.8 


313417 AA741151 Hs.137323 


ESTs 


6.2 


313455 AW061702 Hs.98571 


ESTs 


6.9 
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313590 AA8Q4410 Hs.291677 


EST duster (not in UniGene) 


5.3 


313663 AI953261 Hs.169813 


ESTs 


7.6 


313657 U69201 Hs. 13684 


ESTs; Weakly similar to choline kinase is 


12.5 


313749 AW450376 Hs.1 19004 


ESTs 


5.5 


313832 AW271022 Hs.133294 


ESTs 


4.3 


313881 AA535580 Hs.16331 


ESTs 


7.7 


313915 AI969390 Hs.163443 


ESTs 


27.1 


313955 AI858884 Hs.270647 


ESTs 


5J 


313974 AI310151 Hs.173524 


ESTs 


4.3 


314097 AA648744 Hs.269493 


ESTs 


14.5 


314129 AA228366 Hs.1 15122 


ESTs 


9.5 


314359 AA205569 Hs.194193 


ESTs 


5.4 


314384 AA535840 Hs.162203 


ESTs; Weakly similar to alternatively spy 


5.3 


314394 AI380563 Hs.130816 


ESTs 


13.2 


314462 AA347951 Hs.326413 


ESTs 


6.2 


314465 AA602917 Hs.156974 


ESTs 


18.1 


314470 AI934422 Hs.30661 


ESTs .' 


4.2 


314488 AA358265 Hs.182890 


ESTs 


6.1 


314506 AA833655 Hs.206868 


ESTs 


27.8 


314510 A1204418 Hs.190080 


ESTs 


9.5 


314558 AI873274 Hs.190721 


ESTs 


22.5 


314661 AA436432 Hs.324239 


EST duster (not in UniGene) 


13.3 


314691 AW207206 Hs.136319 


ESTs 


21.4 


314754 AW026761 Hs.134374 


ESTs 


4.4 


314775 AI149880 Hs.188809 


ESTs 


4.4 


314943 AI476797 Hs.184572 


cell division cycle 2; G1 to S and G2 to M 


18.4 


314961 AW008061 Hs.231994 


ESTs 


10.2 


314963 AI689617 Hs.200934 


ESTs 


5.3 


315006 AI538613 Hs.298241 


ESTs 


20.7 


315010 AA531082 Hs.240049 


ESTs 


5 


315019 AA532807 Hs.105822 


ESTs 


6.1 


315033 AI493046 Hs.146133 


ESTs 


12 


315036 AA534953 Hs.1 63297 


ESTs 


8.3 


315037 AW205863 Hs.133988 


ESTs; Weakly similar to gene MAC25 pr 


6.1 


315051 AW292425 Hs.163484 


EST 


12.7 


315054 AI968598 Hs.78768 


ESTs 


7.6 


315073 AW452948 Hs.257631 


ESTs 


13.9 


315080 AA744550 Hs.136345 


ESTs 


4.4 


315083 AI221325 Hs.205442 


ESTs 


5.1 


315088 AA557351 Hs.152448 


ESTs; Moderately similar to MULTIFUN 


4.7 


315175 AI025842 Hs.152530 


ESTs 


11.9 


315195 AA972756 Hs.44898 


ESTs 


28.8 


315296 AA876905 Hs.125286 


ESTs 


16.1 


315303 AW194364 Hs.128022 


ESTs; Weakly similar to FIG-1 PROTEIN 


25.7 


315352 AA604799 Hs.136528 


ESTs; Moderately similar to 111! ALU SU 


MJ3 


315364 AA643602 Hs.155485 


ESTs; Highly similar to serine protease (H 


4.6 


315368 AW291563 Hs.104696 


ESTs 


4.8 


315390 A1801565 Hs.200113 


ESTs; Weakly similar to alternatively spll 


4.4 


315408 AW273261 Hs.216292 


ESTs 


5 


315458 AA872000 Hs.116104 


ESTs 


7.6 


315472 AA828850 Hs.165469 


ESTs 


4.9 


315478 AA665612 Hs.120874 


ESTs 


5.2 


315498 AA628539 Hs.1 16252 


ESTs; Moderately similar to U!l ALU SU 


4.8 


315527 AI791138 Hs.116768 


ESTs 


4,4 


315530 AI200852 Hs.127780 


ESTs 


22.4 


315562 AA737415 Hs.152826 


ESTs 


5.9 


315634 AA837085 Hs.220585 


ESTs 


8.8 


315647 AA648983 Hs.212911 


ESTs 


15 


315652 AI521489 Hs.3053 


ESTs 


6.3 


315676 AW002565 Hs.124660 


ESTs 


92 


315680 AA814309 Hs.123583 


ESTs 


8.1 


315735 AI831760 Hs.155111 


ESTs 


13.4 


315741 AA812168 Hs-122559 


ESTs 


5.4 


315769 AA744875 Hs.189413 


ESTs 


4.4 


315978 AA830893 Hs.1 19769 


ESTs 


10.4 


315984 AI015862 Hs.131793 


ESTs 


5 


316042 AW297979 Hs.170698 


ESTs 


14.7 
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316136 AA830808 Hs.1 24366 


ESTs 


4 


316177 AI908272 


Hs.293102 


EST cluster (not in UniGene) 


326 


316313 AA741300 Hs^Q2599 


ESTs 


4.8 


316405 AA757900 Hs.270823 


ESTs 


4.8 


316480 AI749921 


Hs.205377 


ESTs 


12.9 


316564 AI743571 


Hs.168799 


ESTs; Weakly similar to llll ALU SUBFA 


8.1 


316714 AA809792 Hs.123307 


ESTs 


5 


316715 AI440266 


Hs.170673 


ESTs 


4.2 


316828 AA828116 Hs.173076 


ESTs 


5.2 


316869 A1954880 


Hs.134604 


ESTs 


13.3 


316905 AW138241 Hs.210846 


ESTs 


6.2 


316943 AW014875 Hs.137C07 


ESTs 


5.3 


316949 AA856749 Hs.124620 


ESTs 


7.2 


317008 AW051597 Hs.143707 


ESTs 


4.1 


317028 AA962623 


Hs.189144 


ESTs; Weakly similar to RENAL SODIU 


4.2 


317067 AI805392 


Hs.325335 


ESTs 


4.5 


317069 AI732892 


Hs.190489 


ESTs 


6.4 


317210 AA490718 




EST duster (not In UniGene) 


4.4 


317298 A1922374 


Hs.1 58549 


ESTs 


5.9 


317658 AW139077 Hs.202217 


ESTs 


4.6 


317674 AW294909 Hs.132208 


ESTs 


5.2 


317685 AI798630 


Hs.149997 


ESTs 


4.3 


317836 AA983913 Hs.128929 


ESTs 


12.4 


317881 AI827248 


Hs.224398 


ESTs 


12.1 


317902 AI828602 


Hs.211265 


ESTs 


8.8 


317916 AI565071 


Hs.159983 


ESTs 


12.6 


318042 AW294522 Hs. 149991 


ESTs 


5.6 


318053 AI074465 


Hs.133469 


ESTs 


4 


318064 AW296888 Hs.170939 


ESTs 


5.2 


318070 AJ024594 


Hs.248942 


ESTs 


4.7 


316073 AW167067 Hs.131562 


ESTs 


15.7 


318146 AI040125 


Hs.150521 


ESTs 


5.9 


318186 AW016773 Hs.3709 


ESTs 


5.3 


318481 AI291584 


Hs.145921 


ESTs; Weakly similar to HYPOTHET1CA 


7.6 


318566 AI335361 


Hs.226376 


ESTs 


5.8 


318617 AW247252 Hs.75514 


nucleoside phosphorylase 


11.1 


318662 AI285898 


Hs.294014 


ESTs 


16.3 


318691 AW192139 Hs.181307 


H3 histone; family 3A 


4 


318740 NM_002543Hs.77729 


EST duster (not in UniGene) 


21.3 


318744 AI793124 


Hs.144479 


ESTs 


35 


318948 AA317274 Hs.13996 


ESTs 


11.7 


319163 F15257 


Hs27 


glycine dehydrogenase (decarboxylating; 


7 


319478 R06841 


Hs.270307 


EST duster (not in UniGene) 


8.9 


319545 R83716 


Hs.14355 


ESTs 


8.2 


319668 NMJ)02731Hs.87773 


EST duster (not in UniGene) 


25.4 


319763 AA460775 


Hs.6295 


ESTs 


7 


319913 AA179304 


Hs.271586 


ESTs; Moderately simitar to ill) ALU SU 


8.7 


319936 W22152 


Hs.282929 


EST duster (not in UniGene) 


5.6 


319951 AA307665 


Hs.14559 


ESTs 


43 


319962 H06350 


Hs.135056 


ESTs 


9.2 


319977 AA632632 




EST duster (not in UniGene) 


4.6 


320074 AA321166 


Hs.278233 


EST cluster (not in UniGene) 


16.7 


320092 AF022799 


Hs.1 13292 


calpain 9 (nCL-4) 


5.4 


320107 AA836461 


Hs.291712 


EST duster (not in UniGene) 


5.3 


320133 063271 




EST duster (not in UniGene) 


5.5 


320167 AA984373 


Hs.90790 


EST duster (not in UniGene) 


15 


320187 T99949 


Hs.303428 


EST duster (not in UniGene) 


6.7 


320211 AL039402 


Hs.125783 


DEME-6 protein 


24.3 


320401 U90449 


Hs.152717 


nudeoside diphosphate kinase type 6 (inn 


10 


320456 AI884396 


Hs.24131 


ESTs 


5.4 


320488 R31386 


Hs.191791 


EST duster (not (n UniGene) 


4.9 


320521 N31464 


Hs.24743 


ESTs 


9.5 


320661 AA864846 


Hs.1 15175 


EST duster (not in UniGene) 


6.6 


320691 R61576 


Hs.313951 


hypothetical protein 


5.9 


320699 R63161 


Hs.1 18249 


EST duster (not in UniGene) 


4 


320727 U96044 


Hs.181125 


EST duster (not in UniGene) 


45.3 


320993 AL050145 


Hs.225986 


Homo sapiens mRNA; cONA DKFZp586 


7.2 
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321012 AA737314 

321050 AW393497 

321051 AF134149 
321171 A1769410 
321192 AA295304 
321354 AA078493 
321337 H66014 
321412 AW366305 
321489 AW392474 
321539 N93619 
321593 H84762 
321665 028390 
321891 AW157424 
321910 H67065 
321953 AW068268 
321978 N77342 
322017 AA310039 
322026 AA233527 
322035 AL137517 
322171 AF085968 
322175 AF085975 
322236 AL134970 
322303 W07459 
322735 AA086123 
322777 M679082 
322818 AW043782 
322882 AW248508 
322975 C16391 
322991 C18965 
323011 AA580288 
323091 AW014094 
323107 AI301107 
323136 AL120351 
323168 AL120862 
323195 AI064982 
323201 AL049370 
323203 AA203135 

323243 W44372 

323244 T70731 
323328 AA228078 

323332 A1829520 

323333 AA2288B3 
323570 AL038623 
323604 AI751438 
323685 AA344205 
323753 AA327102 
323817 AA410943 
323845 AI684674 
323930 AA570698 
323997 AA844907 
324047 AA378201 
324261 AL044891 
324302 AA543008 
324338 AL138357 
324344 AW502000 
324432 AA464510 
324495 AW501411 
324497 AW152624 
324598 AA502659 
324603 AW016378 
324620 AA448021 
324727 AI610425 
324774 AI031771 
324783 AA640770 
324824 A1826999 
324826 AA704806 
324902 031323 



Hs.194324 EST cluster {not In UniGene) 
EST cluster (not in UniGene) 

Hs.240395 EST cluster (not in UniGene) 

Hs.221461 ESTs 

Hs.297939 ESTs; Weakly similar to neogenln. Jftsap 

EST cluster (not in UniGene) 

Hs. 141278 ESTs; Weakly similar to II!! ALU SUBFA 

Hs.22891 EST cluster (not In UniGene) 

Hs.172759 ESTs; Moderately similar to III! ALU SU 

Hs.42915 ARP2 (actin-related protein 2; yeast) horn 

Hs.253197 ESTs 

Hs,272897 EST cluster (notin UniGene) 

Hs.165954 ESTs 

Hs.271 530 ESTs; Weakly similar to III! ALU SUBFA 

Hs.292833 ESTs; Weakly similar to !!H ALU CLASS 

Hs.21851 EST duster (not in UniGene) 

Hs.9192 ESTs 

Hs.283675 low density lipoprotein receptor (tamilial 

Hs.306201 EST duster (not in UniGene) 

Hs.48474 EST duster (not in UniGene) 
EST duster (not in UniGene) 

Hs.104222 folfetaMke 1 

Hs.157601 EST duster (not in UniGene) 

Hs.297856 EST duster (not in UniGene) 

Hs.269947 ESTs 

Hs.293616 ESTs 

Hs.279727 DiGeorge syndrome critical region gene 2 

EST duster (not In UniGene) 

Hs.159473 ESTs 

EST duster (not in UniGene) 

Hs.210761 ESTs 

Hs.150790 ESTs 

Hs.30177 EST duster (notin UniGene) 

Hs.124165 ESTs 

Hs.1 1 7950 multifunctional polypeptWe similar to SA 

Hs.13350 Homo sapiens mRNA; cDNA DKFZp586 

Hs.130186 ESTs 

Hs.110771 EST duster (not in UniGene) 

Hs.193620 EST duster (not in UniGene) 

Hs.255096 EST duster (not in UniGene) 

Hs.227513 ESTs 

Hs.208558 EST duster (not In UniGene) 

Hs.208752 ESTs; Weakly similar to !!!! ALU SUBFA 

Hs.41271 ESTs; Weakly similar to 111! ALU SUBFA 

Hs.289088 EST duster (not in UniGene) 

Hs,70266 EST duster (not in UniGene) 
EST duster (not in UniGene) 

Hs.411 27 ESTs; Weakly similar to wadaw [O^nelan 

Hs.8173 ESTs 

Hs.274454 EST duster (not in UniGene) 

Hs,271340 ESTdusterfnotin UniGene) 

Hs.269350 EST duster (not in UniGene) 

Hs.292471 ESTs; Weakly similar to UD ALU SUBFA 

Hs.145078 ESTs 

Hs.46677 EST duster (not in UniGene) 

Hs.152812 EST duster (not in UniGene) 

Hs.122489 ESTs; Weakly similar to 111! ALU CLASS 

Hs.136340 ESTs 

Hs.163986 ESTs 

Hs.292934 ESTs 

Hs,94109 EST duster (not in UniGene) 

Hs.19597 ESTs 

Hs.132586 ESTs 

Hs.200994 EST duster (not in UniGene) 

Hs.224624 ESTs 

Hs.143842 ESTs 

Hs.271492 ESTs 
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324961 AA6 13792 




EST duster (not in UniGene) 


13.3 


324987 T06882 


Hs. 172634 


ESTs 


19.6 


3249Q8 T06997 


HS. 121028 


EST cluster (not In UniGene) 


24.5 


325146 AI064690 


Hs.171176 


ESTs 


4.6 


325622 




, CH.14_hsgil5867000 


5.2 


326213 




CH.17_hs gi|5867224 


8.1 


326474 




CH.19_hs giJ5867405 


12.7 


326816 




CR20_hs gi|6552458 


9.4 


326617 




CH.20_hs gij6552458 


11.7 


327110 




CH.2Lhsglj6117842 


14.7 


327196 




CKOIJhs glJ5867446 


5.1 


327283 




CH.01_hsgi|5867478 


4.3 


327313 




CH.01Jisgi|5867501 


4.8 


327450 




CH.02_hsgi|5867766 


4.1 


328059 




CH.06_hsgi|6117819 


6.2 


328304 




CK07_hs gi[6004478 


5.4 


328492 




CH.07_hsgiJ5868455 


7 


328857 




CH.07Jis gi|6381927 


5.2 


329367 




CRX_hsgi|5868842 


7.6 


329373 




CH.X_hsgi|6682537 


12 


329655 




CH.14_p2gi{6448516 


4 


329899 




CH.15_p2 gil6563505 


4 


329960 




Cai6_p2gij5091594 


7.6 


330084 




CH.19_p2gi|60153O2 


4 


330384 M23263 




androgen receptor (dihydrotestosterone re 


5.8 


330385 AA449749 




ESTs; Highly similar to secreted apoptosi 


10.2 


330387 H14624 




ESTs; Highty similar to secreted apoptosi 


4.4 


330388 X03363 




HER2 receptor tyrosine kinase (c-erbB-2; 


17.7 


330409 D50692 


Hs.78221 


c-myc binding protein 


10.1 


330460 TIGR:HT544 


Hs.73946 


Endc 


330486 M13755 


Hs.833 


interferonsfjmulated protein; 15 kDa 


67 


330494 M29696 


Hs.237868 


interieukin 7 receptor 


6 


330500 M34423 


Hs.79222 


galactosidase; beta 1 


13.1 


330510 M75099 


Hs.227729 


FK506-binding protein 2 (131(D) 


29 


330513 M81Q57 


Hs.180884 


carboxypeptidase B1 (tissue) 


38,5 


330541 U22970 


Hs.265827 


muiOple UniGene matches 


7.4 


330542 U23942 


Hs.226213 


cytochrome P450; 51 (lanosterol 14-aipha 


15 


330547 U32989 


Hs.183671 


tryptophan 2;3-dioxygenase 


11 


330551 U39840 


Hs.299867 


hepatocyte nuclear factor 3; alpha 


6.5 


330562 U49082 


Hs.76460 


transporter protein 


7.7 


330573 U62800 


Hs.83393 


cystatin E/M 


4 


330673 D57823 


Hs.321403 


Sec23 (S. cerevisiae) homoJog A 


10.5 


330711 AA164687 


Hs.177576 


mannosyl (alpha-1;3-)-glycoprotein beta-1 


24.3 


330814 AA015730 


Hs.265398 


ESTs; Weakly simiiarto trans/ormation-r 


44.1 


330850 AA075298 


Hs.322710 


ESTs 


4.4 


330874 AA127474 


HS,191157 


ESTs; Weakly similar to IHl ALU SUBFA 


8.1 


330884 AA133457 


Hs.102548 


ESTs 


5.2 


330912 AA195936 


Hs.82719 


genera! transcription (actor IIA; 1 (37kD a 


5 


330924 AA232136 


Hs.159737 


Homo sapiens mRNA; cDNA DKFZp434 


9.1 


330997 H55762 


Hs.9302 


ESTs 


7.6 


331014 H98597 


Hs.30340 


ESTs 


13.5 


331024 N32919 


Hs.27931 


ESTs 


9.1 


331046 N66563 


Hs.191358 


ESTs 


10.5 


331135 R61398 


Hs.4197 


ESTs 


7.4 


331145 R72427 


Hs.129873 


ESTs; Weakly simiiarto CYTOCHROME 


41.9 


331146 R73816 


Hs.17385 


ESTs 


4.7 


331222 T98531 


Hs.173904 


ESTs 


4.1 


331230 W69807 


H3.16537 


hypothetical protein; similar to (U06944) 


4.9 


331306 AA252079 


Hs.63931 


dachshund (Drosophila) homotog 


15.1 


331327 AA281076 


Hs.109221 


ESTs 


4.8 


331337 AA287662 


Hs.50495 


ESTs 


7.6 


331341 AA303125 


Hs.23240 


ESTs; Weakly similar to Hi! ALU SUBFA 


13 


331344 AA357927 


Hs.126550 


ESTs 


12.4 


331362 M417956 


Hs.40782 


ESTs 


6.5 


331363 AA421562 


Hs.91011 


anterior gradient 2 (Xenepus laevis) homo 


28.2 


331376 AA443802 


Hs.41007 


ESTs; Weakly similar to cDNA EST yk47 


15.1 


331384 AA456001 


Hs.93847 


ESTs 


7.9 
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331478 N26608 


Hs.40639 


331526 N49967 


Hs.46624 


331533 N51517 


Hs.47282 


331681 W85712 


Hs.1 19571 


331686 W88502 


Hs.182258 


331750 AA284372 


Hs.111471 


331751 AA284840 


Hs.143818 


331760 AA292721 


Hs.154434 


331763 AA312861 


Hs.96704 


331825 AA411144 


Hs.292882 


331890 AA432166 


Hs.3577 


331952 AA454756 


Hs.97837 


332015 AA487910 


Hs.208800 


332043 AA490831 


Hs.125056 


332060 AA504779 


Hs.191402 


332071 AA598594 


Hs.205293 


332093 AA608794 


Hs.112592 


332139 AA620669 


Hs.1 12879 


332219 N22508 


Hs.139315 


332225 N33213 


Hs.100425 


332246 N57927 


Hs.120777 


332247 N58172 


Hs.109370 


332260 N70088 


Hs.138467 


332269 N91279 


Hs.109654 


332336 T96130 


Hs.137551 


332340 W15495 


Hs.129781 


332347 W60326 


Hs.288684 


332362 W93640 


Hs.4779 


332467 AA489630 


Hs.119004 


332499 M12036 


Hs.323910 


332513 AA018182 


Hs.154424 


332526 AA281753 


Hs.77515 


332532 N63192 


Hs.1892 


332565 AA234896 


Hs.25272 


332607 R41791 


Hs.36566 


332640 AA417152 


Hs.5101 


332694 AA262768 


Hs.243901 


332702 H93968 


Hs.75725 


332705 T59161 


Hs.76293 


332749 AA479968 


Hs.88251 


332927 




332929 




332930 




332955 




332958 




332961 




332933 




333009 




333010 




333013 




333108 




333139 




333254 




333305 




333343 




333388 




333456 




333459 




333517 




333585 




333679 




333743 




333758 




333767 




333768 




333769 




333795 





ESTs 
ESTs 
ESTs 

collagen; type III; alpha 1 (Ehlers-Oanfos 

ESTs 

ESTs 

ESTs 

ESTs; Weakly similar to unknown pisap 

ESTs 

ESTs 

succinate dehydrogenase complex; subunl 
ESTs 

ESTs; Weakly similarto Ml ALU CLASS 

ESTs 

ESTs 

ESTs 

ESTs 

EST 

ESTs 

ESTs 

ESTs; Weakly slmflarto RNA POIYME 

ESTs 

ESTs 

ESTs; Moderately similar to outer membr 
ESTs 

chromosome 21 open reading frame 5 
ESTs 

ESTs; Moderately similar to similar to AD 

KIAA0665 gene product 

Human tyrosine kinase-type receptor (HE , 

deiodinase; lodothyronine; type IJ 

Inositol 1;4;5-triphosphate receptor; type 

EST; Highly simiiarto PHENYLETHAN 

E1A binding protein p300 

LIM domain kinase 1 

protein regulator of cytokinesis 1 

KIAA1067 protein 

transgeBn2 

thymosin; beta 10 

arylsulfataseA 

CH2i_FGENES.38J 

CH22LFGENES.38 3 

CH2i.FGENES.38.4 

CH22.FGENES.48J2 

CH22_FGENES.48J5 

CH2i_FGENES,48J8 

CH2i.FGENES.54_5 

CH2i.FGENES.61J 

CH23JGENES.6L2 

CH22J=GENES.6L5 

CH2i.FGENES.79J4 

CH2i.FGENES.83J6 

CH2i.FGENES.118 2 

CH22_FGENES.137_2 

CH2i.FGENES.139 12 

CH2i.FGENES.144 3 

CH22J=GENES.157_5 

CH22_FGENES.157_8 

CH2^FGENES.173_2 

CH22_FGENES.203J 

CH22_FGENES.247 6 

CH22.FGENES.264 1 

CH22JH3ENES.268 1 

CH22.fGENES.27L6 

CH22LFGENES.271J 

CH22.JGENES.271J 

CH2ifGENES^75J 



243 



WO 02/059377 



PCT/US02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



333796 


CH22_FGENES.275 3 


6.8 


333892 


CH22 JGENES.292J4 


4.4 


333904 


CH2i.FGENES.294_2 


6.5 


333905 


CH2i.FGENES.294 3 


9.3 


333921 


CH22_FGENE&296j2 


9.6 


333968 


CH22_FGENES.307_4 


15.9 


334102 


CH2i.FGENES.327_60 


7.1 


334222 


CH2i.FGENES.360 3 


6.7 


334223 


CH2i.FGENES.360 4 


33.5 


334264 


CH2i.FGENES.367 15 


18.5 


334343 


CH2i_FGENES.375 25 


6.1 


334360 


CH22.FGENES.378 5 


6.1 


334784 


CH2i_FGENES.43i_9 


4.8 


334789 


CH2i.FGENES.432 14 


5.1 


334794 


CH22 FGENES.434 2 


7 


334889 


CH22J=GENES.452 3 


12.4 


335004 


CH2i.FGENES.472 8 


73 


335115 


CH2i.FGENES.496J2 


183 


335287 


CH22J : GENES.526J1 


4.5 


335342 


CH22JFGENES.536 1 


5.3 


335491 


CH2i.FGENES.570]23 


24 


335495 


CH22_FGENES,570 28 


7 


335498 


CH22_FGENES.57U 


12.2 


335544 


CH22~FGENES.576 5 


8.4 


335610 


CH22_FGENES.5S3._4 


12.9 


335653 


CH2i.FGENES.590_4 


6.7 


335682 


CH2i.FGENES.595_2 


12.1 


335687 


CH22 FGENES.596.2 


13.9 


335755 


CH22IFGENES.604J 


115 


335782 


CH2i.FGENES.609 4 


17.9 


335791 


CH22_FGENES.611 7 


27.3 


335809 


CH2i.FGENES.617 6 


19.2 


335822 


CH2i.FGENES.619 7 


19.1 


335823 


CH2i.FGENES.619 8 


4.5 


335824 


CH22 FGENES.619 11 


40.2 


335825 


CH22 FGENES.619 12 


343 


335895 


CH22_FGENES.635_3 


10.2 


335917 


CH22_FGENES.636J3 


6 


335920 


CH22J=GENES.636J6 


8.8 


336035 


CH22 FGENES.678 6 


5.9 


336042 


CH2i/GENES.679 4 


5.8 


336093 


CH22 FGENES.691 2 


11.6 


336096 


CH22J=GENES.69L5 


7.6 


336150 


CH22 FGENES.706 6 


6.3 


336152 


CH22J=GENES.706..9 


10.5 


336416 


CH22 FGENES.823 38 


5 


336444 


CH22_FGENES.827 10 


4.8 


336449 


CH22_FGENES.829J 


13.6 


336471 


CH2i/GENES.829J0 


6.9 


336512 


CH22_FGENES.834_7 


21.4 


336558 


CH2i_FGENES.842_3 


8.2 


336560 


CH2i/GENES.84i.5 


9 


336676 


CH2iJGENES.43-4 


9.4 


336959 


CH22_FGENES.367-13 


19 


337968 


CH22.EMAC005500.GENSCAN.103-2 


13.4 


338008 


CH22_EM:ACQ05500.GENSCAN,127-9 


15.2 


338057 


CH2i.EM:ACQ05500.GENSCAN.160-1 


13.9 


338410 


CH2i.EMAC005500.GENSCAN.341-6 


8 


338451 


CH2i.EMAC005500.GENSCAN.35W 


11.6 


338588 


CH2i.EM:AC005500.GENSCAN.432-1 


10.3 


338665 


CH2i.EM:AC005500.GENSCAN.464-2 


4.8 


338689 


CH2i_EMAC005500.GENSCAN.475<J 


6.7 


338832 


CH2i.DJ246D7.GENSCAN.6-9 


4.8 


338980 
339352 


CH2i.OA59H18.GENSCAN.2-4 


5.1 


CH2i.BA354l12.GENSCAN.29-7 


6.9 


339373 


CH2i.BA232E17.GENSCAN.1-29 


4.3 
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TABLE 13A 

Table 13 A shows the accession numbers for those pkeys lacking unigenelD's for Table 13. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

Pkey CAT number Accession 

123619 371681.1 AA602964 AA609200 
20 103207 30635_-4 X72790 
103349 11052.-2 X89059 

110856 19346J4 AA992380 N33063 N21418 H79958 R21911 H79957 
113248 328626J T63857 AW971220AA493469T63699 

123169 44573_2 AI950087 N70208 R97040 N36809 AI308119 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
^ AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 M884922 BE328517 

AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488964 
AA283144 AI890387 AI950344 AI741346 A1689062 AA282915 AW102898 AI872193 AI763273 A Wi 73586 AW150329 AI653832 
AI762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 AI961530 AW629970 
- A BE612881 AW276997 AW513601 AW512843AA044209 AW856538AA1 80009 AA337499 AW961 101 AA251669AA251874 

5 U AI819225 AW205862 AJ683338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 

AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 AW613456 A1373032 AJ564269 
F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 
123533 genbank.AA608751 AA608751 
116480 genbank_C14088 C14088 
35 132225 genbank_AA128980 AA1 28980 

125154 genbanRJV38419 W38419 ' 
118475 genbanKJl66845 N66845 

102919 25180.2 M21191 AL035748 AA021266AA323126 AA180515 AI613029 D28356 NM 000034 M1 1560 AW401 425 AW248248 AA0 12956 
An AA323294 W04965 H38759 AA206622 AA580747 AI541 172 AA381075 AA354229 AW402353 AW405575 AW404021 AW406207 

^ u AA075752 AW176066 AA287222 AA195818 T20243 R87945 AA295539 AA402533 AA232419 AA224515 AW401583 AA331357 

AW402140 AW249079 R31488 AA075757 X05236 AW239490 AA338036 AW239495 AA357262 AA431005 AA306726 R33804 
AA216544 AW275288 AA227044 AL038124 AA243300 C03242 AA315615 AL035840 R64336 AA313917 AA018963 AA00 1335 
AA054395 H30840 AW498825 AA086141 AI557324 AA121576 H39128 R77161 AA019688 AA380987 AA348140 AA348257 
, - AW176086 AA362432 AA171389 AA362416 AA299938 AA319093 AA337972 C04921 AA345696 R89640 AA085425 AA461708 

^ D AA313637 AL039229 H84490 H86153 F00656 AA326668 AA347304 R65890 H41 949 AA339309 AW402002 AW404854 

AA192582 AA1 12802 H09248 N83165 H38367 AA356339 AA455763 R66853 AA294935 H8591 1 AA310414 H93436 N87014 
AA001186 H83640 AA411328 AA317929 C04192AW406288 U46335 AA323179 AA427649 AI366131 H14328 AA197161 
AA379497 AA31 1816 AA017206 AA001137 AA017420 AA012990 AW163775 AA021397 AA295513 AA355248 AA374921 
<- n AW380419 AA345864 AA318058 AA371711 AA363255 AA057094 R88057 AA394045 AW362741 AA479579 AW362789 

-> U AW362775 AA223624 AW362809 AW362746 AW362753 AW380412 AA0571Q4 AA295271 X06352 AW362771 X12447 R59553 

R87925 AA292864 R83920 F01 120 AA496740 AA852551 AL039209 AA093339 AA333961 N86416 C03325 C03831 AA194750 
AA359270 AA359460 AW245492 AA339400 AA330784 AI908970 N89223 AW361 938 AI940063 AA134323 AA371741 F01267 
AW372970 AA341973 AA346098 AW372969 AA337549 AA327342 H93855 AI074079 F291 18 AA852940 F35696 AA345963 
<- . %SI^*™ 13785 002989 M095945 003145 005199 AA346024 AA190506 AW361659 AI909845 AW374374 AW374382 

^ AW374401 AW374373AW374370A1909831 AW374367 AA353658 F01041 C02843AA375948AW374414AA213946 AJ525039 

H13744 R31007 AA1 12044 AA134404 H47935 AW177018 AA429768 AA336873 AA1 12875 H46393 AA191267 059131 
AW406037 AA055244 AA341880 AA179024 AA308537 AW406985 AW327311 H30301 AA300705 H43788 AW364149 
AA806213 AA481936 C04941 AW375299 M21190 AA410818 AA250940 AB54547 AA317422 AA250903 A1865497 AA890603 
, n AW498538 M ™*2 AW406461 C03092 F00308 H56488 AA336320 AW406501 AA354102 AA382942 AA096393 

°U AW376830 AA383446 F01259 AW081388 T94036 AA379643 H43842 AI524063 AA621727 AA379099 AA371417 R6681 1 

H44129 N84794 F01 135 AA477852 AA293062 AW361595 H27194 AI831650 H43253 H24797 AI564680 AA380090 W20057 
AI921586 A1192549 AW090808 H25967 AA918121 AI626060 H20221 AA812572 H42178 AA887222 H96000 C03180 F00946 
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C03986 AI318091 AI860172 AA582179 AI633388 AA557193 R68075 F241Q5 AW518239 W56622 AI625219 AI925243 
AW468046 AI921828 AA339164 AI144391 AA643334 AA459631 AA873247 AW373432 AA604384 H27600 AI680458 AA159956 
AA610836 AA364298 AW373435 AA604225 W73754 AW087924 AA599776 N89227 AI630871 AI633128 AW514329 AA010455 
AA563928 AI571596 AI128394 W73707 A1423575 AA583809 AA657988 AI950837 AA169782 M600009 AI885540 AA771884 
5 AI978829 AA505408 AA533937 AA481469 AA610869 AA775241 AW273870 AW070909 AI9Q5695 AA4801 15 AA574051 

AI889185 AA773167 AA331375 AA001437 AA194324 AA194300 AA558632 AI038538 AA41 1329 AA781570 AI833176 
AA935520 AW074197 AA583063 AW073099 AW001 198 AA948025 AA587857 AA191540 AI460085 AA193244 AI538037 
AA515572 AA758587 AI149311 AA508610 AA206409 AA534004 AA994600 AA827543 AI916349 AW245129 AW517804 D25663 
AA781985 AA284536 A1819422 H16040 H27531 AA456564 AA845555 AI423596 AA012908 AA889439 AA71631 1 AA968868 

1 0 AA320508 AA725731 AA834202 AA935997 AA724815 AA769353 AA594803 F00827 AI342442 A1003519 AI002503 AI347597 

AI04O946 AA197162 AA987883 AA292865 AW001944 AI64071 1 AW244044 AA456784 F30588 AA290829 H24754 AI978683 
AA483686 AA583939 AA121382 AA833831 AA477102 AA977322 AA666379 F35456 AA993537 AI749610 AA226934 AA716204 
AW513025 AA628543 AA583705 F25702 AI368748 AI124097 AI880086 AA477513 AI758834 A1690753 M477746 F37761 
AA642243 AA159957 AA250844 AA459406 AA427566 F25054 AI569314 AA961665 AI922050 AI759000 AA555236 AA514432 

1 5 AA293474 AA001129 AA826789 AA641390 AA134405 F35585 AA477416 AW193359 AI36131 5 AA284988 F36340 AI361322 

F26989 AA991922 AA021267 F26973 AI361314 F35891 AI918509 AA250964 AA190992 AA577139 AA865535 AA1 34324 
AW192842 AI224046 F18975 AA779626 AA856894 AW269997 AW014614 H95554 F31378 AA374868 F26343 M654007 
AI830942 AA1 13195 F26432 W56652 AA464690 AA055263 AA340654 AA031448 AA976399 AA972526 AA063476 R83921 
T16240 AA533290 N91545 H44053 AA883451 AA513761 AA086477 H09249 F20482 F26737 M054148 AA857063 AA017259 

20 AA179789AA088908H43704AA194320 F35950A1880127F20441 F32878AA962483H39094H56489 H44621 F19390 

A1302232 F25162 AA826965 AA086052 AA917410 AA454513 R59554 AA196755 AA086369 AA079530 H28106 AA243301 
AI025737 AA101239 AA088887 AI214910 AA974888 F16089 F26054 AA515092 F33436 F32829 M78061 AA235645 F19715 
F37529 AI811549 AA665180 AA708200 F01124 F32382 AA346220 AA627361 F30741 F30010 F28543 AA211715 F20245 
AA331222 F25634 F21996 W28215 F2191 1 R65793 AI192566 H20130 H84491 AA719223 AA557435 F16967 F26989 F30353 

25 AA857159 AA291918 F28234 F20840 F25176 F22437 T27904 AA480355 F19528 R87926 H14286 F27532 AW337864 F28411 

H13692 F25651 AA975454 F24229 F29657 F18024 AA464779 F17588 F34954 AA947328 F18063 AA657777 AA459644 N91455 
F27850 F29608 F27208 F18418 F31459 F18564 F33496 F16376 F29740 F29843 F29904 F29866 F19135 AW276602 T40337 
F24835 F34672 F26474 AI926215 AA464185 F18217 N26193 AL043256 T41197 F33055 F00386 F29500 F34191 F33297 
AI937207 F22724 F15909 F26232 F18889 AA318627 F29085 AA872104 F17509 F23373 F15660 F17552 F17412 F16863 

30 F34033 F21515 F17364 F18383 F16546 F17561 F 1 7260 AA292000F 15723 T47438 F16798 F18046 F18319 F17978 F17566 

F34230 F33258 F20860 F17998 AI695701 
118600 genbanK_N69222 N69222 
118952 genbank_N92966 N92966 
120873 genbankJ\A358015 AA358015 

35 113702 genbank__T973Q7 T97307 

129982 221.267 Z14221 AW381862 M97920 AW401444 Z66542 M29470 AW406502 X61011 M34024 AA327072 214166 Z14167 Z14165 

AW403806 Z14200 AA383972 Z14205 Z14201 M18513 Z14202 AW403684 X14584 AF062221 U43760 X65892 X65883 X62107 
Z80847 X65885 X65893 AF062142 X65891 X17675 Z47274 Z47277 Z47276 X65888 Z47275 X62109 AF0621 40 L01278 
AF062134 AF062139 X81723 Z8084O X81733 X81743 X81744 X81732 Z80843 AW402942 AW403516 X65919 AF062190 

40 AF062177 AF062222AF062115Z47240AF062263AF062261 AF062223 AF062211 Z47238 AW401714 AW404008 AW404991 

AF062280 M99595 Z47214 Z47232 Z47218 M26995 AF062184 X65895 L38433 X81731 Z1 1946 Z47226 AF062205 AF174012 
L01276 AF062168 AF062136 X81755 X81748 AF174019 AF062285 L06924 L1 1699 AW402665 L09070 L28049 L08090 
AW407843 214171 AW402944 L08083 AW405627 L33035 L26907 M17750 L28052 M17751 AJ239360 U19885 L14821 X56526 
AB014341 L12087 L12098 U68231 L12184 AF062242 AF022000 U64499 UQ0570 AI268604 Y15773 X64239 X62969 U00508 

45 • X73605 X99360 U00577 D83677 AB021539 AF035796 Z33899 U00588 AJ239353 AF174062 Z33901 X98899 AF174058 X6308O 

D83676 AB021529 Z18318 U00488 L01412 X81746 U21262 U21272 U00560 AF174060 U00547 U00561 Z18321 S73957 
M26435 AF1 15130 AF1 151 17 Z92896 U21254 Z92895 AF1 151 12 AF062290 T28938 Y09386 AF174067 U27189 Y09384 
U77373 AF174057 M17749 X69692 AF174038 U64478 U64486 AF174098 X99365 AF174077 AF174083 AF174089 AF015133 
AF127792 AJ006171 U00510 U58144 AF004323 AF115109 AF004324 AF015130 AF090414 AF090418 AB021536 AB021519 

50 Z96957 AF021986 Z92898 Z96956 134164 AF062251 AF052524 U00549 U00541 083687 AF052522 AF087420 AF052520 

AF02201 1 L43085 U00536 119915 L43083 AF090420 AB021530 AB021534 Z96955 Z96954 Z33903 X62964 AF103282 U00509 
AF062298 AF062289 AF062307 AW408326 L33037 L04337 L04328 X81742 L04336 L03385 AF032360 S561 84 AF062191 
AF191092 AI906954 AF103184 L04343 AW364860 AF001424 AF103163 Z98717 AF103143 AJ008250 L04323 AF103321 
AF103335 104342 L03818 103B17 AW404978 AW403690 U86801 AF103150 AJ010334 AF035027 AJ007327 AF103115 

55 AF017458 AJQ08207 AJ008183 AJ008196 AJ008241 AJ008208 AF1 03210 AF068668 AF068670 AF068671 AF068664 

AF068669 AF068656 AF068665 AF068672 AW371244 AW403670 AW408074 AW404575 AW362153 AW403803 AW406702 
AW351514 D78345 T29140 J00231 NIVL002179 AW405146 AA301091 X04646 H64660 AW402990 AW406534 T93007 
A1857980 AW368899 AI905B33 AW406586 AA482084 A1872299 AA715266 AW404328 AI831674 AI709348 AA6031 12 
AW514864 AA485775 H64492 AW404789 AA487630 M715498 AA295885 T27613 T98113 

60 115763 genbanK_AA421560 AA421560 
124357 genbanLN22401 . N22401 
108733 504187J AA121022AA126422 
101544 entfez_M31169 M31169 
124447 genbanR_N48000 N4800O 

65 124677 genbank_R01073 R01073 
124777 genbanK_R41933 R41933 
119302 genbankJ-25725 T25725 
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103680 entrez Z93784Z93784 
135029 H58818_at H58818 
112253 genbank_R51818 R51818 
Pkey CAT number Accession 
5 322175 46877J AF085975 H53458 H53459 

323011 139750J AA580288 AA31 5655 Ml 33031 AA377748 
322975 1510563J C16391 C15413 

317210 211994J AW881145 AA490718 M85637AA304575 T06067AA331991 

323817 233566J AA410943 AW948953AA334202AA332882 
10 309583 1046029,-2 AW170035 

324961 376239J AA61 3792 AW1 82329 T05304 AW858385 

303502 325168 J BE174240AA488528 AL042253 

320133 447553,1 BE151746 BE336853 D63271 T94955 AA774994 

311935 174129J AA216387 T63548AA228676 
15 321050 502195 1 C05928 AW393497 

319977 345248J AA534222AA632632 T81234 

312772 4380_7 AW962299 AA310349 AW962294 H63791 H63751 

321354 116028_-2 AA078493 

336512 CH22.3941FG 834 7JUNK_OJ 
20 336558 CH22_3992FG_842_3 LINK.DJ 

336560 CH22J994FG_842_5_LINK_DJ 

329367 cjLhs 

329373 cjehs 

336676 CH22_4154FG_43jL 
25 338008 CH22_6490FG_UNlCB^ACOO 

338057 CH22^6558FG_UNK^EM:AC00 

329855 c14j>2 

336959 CH22_4764FG_367J3_ 

329899 c15j)2 
30 329960 c16j>2 

338410 CH2Z.7067FG_LINI1.EM.ACOO 

338451 CH22_7124FG_LIN^EM:AC00 

338588 CH22_7331FG_L1NK.EMAC00 

338665 CH22_7438FG_UNK_EJVtAC00 
35 338689 CH22_7464FG_UNK_EM^C00 

308677 AI761173 

338832 CH22J678FG_UNK_DJ246D7 
338980 CH22L7859FG_JJNKJ>A59H18 
333009 CH22_233FG_61_1JJNK_EM:A 
40 333010 CH22_234FG_61JJJNK_EMA 
333013 CH22_237FG,6L5_LINK_EMA 
308981 AI873242 
308995 AI880172 

333108 CH22 336FG 79_14 UNK_EM: 
45 333139 CH22_368FG_83J6 UNK^EM; 

333254 CH22_495FGJ18J2„LlNieEM: 

333305 CH22_550FG J 37_2JJNK_EM: 

333343 CH22J89FGJ39_12J.INK_EM 

333388 CH22__634FG_144_3 UNK_EM: 
50 326213 c17Jis 

333456 CH22_706FG_157_5JJNKJEM: 

333459 CH22_709FG_1 57_8_UNK^EM: 

333517 CH22_773FG_173_2JJNK_EM: 

333585 CH22_B46FG_203_4_LINK^EM: 
55 333679 CH22_941FG_247_6JJNK_EM: 

326474 c19_hs 

333743 CH22_1009FG_264JJJNK_EM 
333758 CH22J024FG_268J_LINKJEM 
333767 CH22_1034FG_271 6JJNKJEM 
60 333768 CH22_1035FG_271_7J-INK_EM 
333769 CH22J036FG 271 8 LINK_EM 

333795 CH22J063FGJ275 1_LINK_EM 

333796 CH22J065FGJ275_3JJNll_EM 
335004 CH22_2326FG_472_8JJNK^EM 

65 333892 CH22J163FG_29*J4 UNK_E 
335115 CH2£_2447FG_496JJJNK_EM 
333904 CH22_1176FG__294J2 UNK.EM 
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333905 CH22_1177FG_294_3_LINK_EM 
333921 CH2ZJ194FG.296 12 LINK_E 
333968 CH22_1245FG_307_4_UNK_EM 
328059 c_6_hs 
5 335287 CH22_2629FG_526J1 LINK_E 

326816 c20_hs 

326817 c20_hs 

335342 CH22_2689FG_536_1_LiNK_EM 
335491 CH22_2843FG 570_23_UNK_E 
10 335495 CH22J847FG 570_28_LINK_E 
335498 CH22.2850FG_571_7_LINK.EM 
328304 cJJ\s 
305453 AA738110 

335544 CH22_2899FG_576_5_UNK_EM 
15 335610 CH22_2969FG_583_4J.INK.EM 
335653 CH22_3013FG_590_4_UNK_EM 
335682 CH22JQ43FG_595_2_L!NK_EM 
335687 CH2*_3048FGJ96_2JJNK_EM 
328492 c_7_hs 
20 335755 CH2^3122FG_604_4J_INK_EM 
335782 CH22_3151FG.609_4_UNK_EM 
335791 CH22_3160FG_611_7JLINK_EM 
335809 CH22.3181FG_617_6_UNK.EM 
335822 CH22_3195FG_619_7_UNK_EM 
25 335823 CH22.3196FG_619_8.UNK.EM 

335824 CH22.3197FG.619J1 UNK_E 

335825 CH22_3198FG 619 12ILINK..E 
335895 CH22_3272FG_635 3 LINK_EM 
335917 CH22.3294FG 636 13 LINK.E 

30 335920 CH22.3297FG 636 16 LINK E 

305898 AA872838 

305913 AA876109 

305950 AA884479 

328857 c7_hs 
35 330084 c19_p2 

337968 CH2^6419FG_UNK_EM:AC00 

309177 AI951118 

309198 AI955915 

309226 AI969897 
40 339352 CH22_8317FG_UNK_BA354I1 

309279 A1990102 

339373 CH22_8348FG_JJNK_BA232E1 
325622 c14_hs 

334102 CH22_1380FG_327_60_LINK_E 
45 332927 CH22_148FG_38_1_UNK_C20H 

332929 CH22J50FG_38_3_UNK_C20H 

332930 CH22J51FG 38.4_LlNK.C20H 
332955 CH2Z.179FG_48_12_UNK.EM: 
332958 CH22.182FG 48_15_UNK_EM: 

50 332961 CH22J85FG 48_18.UNK.EM: 
332983 CH22_207FG 54_5JJNK_EM:A 

334222 CH22_1506FG 360 3 UNK.EM 

334223 CH22 1507FG_360 4JJNK.EM 
334264 CH22_1551FG_367_15_UNK_E 

55 327110 c21_hs 

334343 CH22_1636FG_375_25_UNK_E 

334360 CH22_1654FG_378_5_UNK_EM 

327196 c.1_hs 

327283 cj.hs 
60 327313 c.1_hs 

304465 AA421948 

304507 AA456426 

327450 c_2_hs 

304591 AA505702 
65 304601 AA507875 

304659 AA5331S5 

334784 CH22.2096FG_432_9_LINK.EM 
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334789 CH22_2101FGJ32_14_UNK_E 
334794 CH22J106FG 434_2_LINK_EM 
336035 CH22J420FG 678_6_UNK_DJ 
336042 CH22J427FG.679 4 UNK.DJ 
336093 CH22J481FG 691 2 UNKJU 
336095 CH22J484FG 691 JJJNK.DJ 
334889 CH22_2206FG_452_3_LINK_EM 
336150 CH22_3540FG_706 6 LINK.DA 
336152 CH22_3543FG_706 9JJNK DA 
336416 CH22J833FG 823_38_UNK_B 
336444 CH22_3864FG_827_10_UNKJ) 
336449 CH22_3870FG_829_6_UNK_DJ 
336471 CH22_3894FG_829_30JJNKJ> 
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TABLE 13B 



Table 13B shows the genomic positioning for those pkeys lacking imigene ID's and 
accession numbers in Table 13B. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 
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15 



20 



25 



30 



35 
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45 



50 



55 



60 



Pkey: 
Ref; 

Strand: 
Ntposition: 



Pkey Ref 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et at' refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al. t Nature (1999) 402:489-495. 
Indicates DNA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 



Strand Nt_positlon 



332955 Dunham, Utal Plus 
332958 Dunham, I. etal. Plus 
332961 Dunham, I. etal. Pius 
333139 Dunham, I. etal. Plus 
333254 Dunham, I. etal. Plus 
333305 Dunham, I. etal. Plus 
333386 Dunham, I. etal Plus 
333517 Dunham, i. etal. Plus 
333585 Dunham, I. etal. Plus 
333679 Dunham, I, etal. Plus 

333767 Dunham, I. etal. Pius 

333768 Dunham, I. etal. Plus 

333769 Dunham, I. etal. Plus 

333795 Dunham, I. etal. Plus 

333796 Dunham, I. etal. Plus 
333892 Dunham. I. etal. Plus 
333921 Dunham, 1. etal. Plus 
333968 Dunham, I. etal. Plus 
334102 Dunham, I. etal. Plus 
334264 Dunham, I. etal. Plus 
334343 OunhanUetal Plus 
334794 Dunham, t etal. Plus 
334889 Dunham, I. etal. Plus 
335287 Dunham, I. etal Plus 
335491 Dunham, I. etal. Plus 
335495 Dunham, I. etal. Plus 
335498 Dunham, 1. etal. Plus 
335653 Dunham, I. etat Plus 
335687 Dunham, I. etal. Plus 
335809 Dunham, I. etal Plus 

335822 Dunham, I. etal. Plus 

335823 Dunham. I. etal Plus 

335824 Dunham. I. etal. Plus 

335825 Dunham, I etal Plus 
336035 Dunham, I. etal Plus 
336093 Dunham. I. etal Plus 
338096 Dunham, I. etal Plus 
336444 Dunham, 1. etal. Plus 
336959 Dunham, I. etal Plus 
338008 Dunham, I. etal Plus 
338057 Dunham. 1. etal. Plus 
338410 Dunham, I. etal. Plus 
338588 Dunham, 1. etal Plus 
338665 Dunham, I. etat Plus 
338832 Dunham, I. etal Plus 
338980 Dunham, I. etal Plus 
339352 Dunham, I. etal Plus 



2508896-2508992 

2516164-2516310 

2521424-2521555 

33694954369571 

2521424-2521555 

46303884630645 

49137494913805 

5570729-5570925 

62347784234894 

7068795-7068896 

7694407-7694623 

7695440-7695697 

7696625-7696707 

7807688-7807795 

7808253-7808319 

81568254157001 

8380325-8380441 

8681004-6681241 

9995140-9996373 

13234447-13234544 

13655828-13656307 

16374312-16374458 

19286024-19286515 

22299047-22299299 

24128651-24128827 

24140688-24140872 

24172082-24172161 

25329710-25329802 

25445952-25446064 

26310772-26310909 

26364087-26364196 

26365925-26366004 

26376860-26376942 

26378175-26378268 

29016748-29017410 

29556922-29557002 

29578878-29579047 

3419058544190718 

13233040-13233126 

7697058-7697236 

85263974526522 

19292807-19292916 

22896767-22896920 

24472654-24472853 

27775128-27775290 

29896789-29896874 

3354478443545121 
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332929 Dunham, 1. etaL Minus 

332930 Dunham, 1. eLat Minus 
332983 Dunham, I. etaL Minus 

333009 Dunham, I. etaL Minus 

333010 Dunham,!. etaL Minus 
333013 Dunham, LetaL Minus 
333108 Dunham, LetaL Minus 
333343 Dunham, LetaL Minus 
333456 Dunham, L etaL Minus 
333459 Dunham. I. etaL Minus 
333743 Dunham. 1. etaL Minus 
333758 Dunham, LetaL Minus 

333904 Dunham, LetaL Minus 

333905 Dunham, LetaL Minus 

334222 Dunham, LetaL Minus 

334223 Dunham, LetaL Minus 
334360 Dunham, L etaL Minus 
334784 Dunham, I. eta). Minus 
334789 Dunham, I. etaL Minus 
335004 Dunham, LetaL Minus 
335115 Dunham, LetaL Minus 
335342 Dunham. I. etaL Minus 
335544 Dunham, LetaL Minus 
335610 Dunham, I. etaL Minus 
335682 Dunham. LetaL Minus 
335755 Dunham, LetaL Minus 
335782 Dunham, i. etaL Minus 
335791 Dunham, LetaL Minus 
335895 Dunham, LetaL Minus 
335917 Dunham, L etal. Minus 
335920 Dunham, LetaL Minus 
336042 Dunham, I. etaL Minus 
336150 Dunham, LetaL Minus 
336152 Dunham, LetaL Minus 
336416 Dunham, LetaL Minus 
336449 Dunham, L etaL Minus 
336471 Dunham, LetaL Minus 
336512 Dunham, 1. etal. Minus 
336558 Dunham, LetaL Minus 
336560 Dunham, L etal. Minus 
336676 Dunham, LetaL Minus 
337968 Dunham, L etal. Minus 
338451 Dunham, I. etal. Minus 
338689 Dunham, LetaL Minus 
339373 Dunham, I. etal. Minus 
325622 5867000 Plus 
329655 6448516 Minus 
329899 6563505 Minus 
329960 5091594 Minus 
326213 5867224 Minus 
326474 5867405 Plus 
330084 6015302 Minus 

326816 6552458 Plus 

326817 6552458 Plus 
327110 6117842 Plus 
327196 5867446 Plus 
327283 5867478 Minus 
327313 5867501 Minus 
327450 5867766 Minus 
328059 6117819 Plus 
328492 5868455 Minus 
328304 6004478 Minus 
328857 6381927 Minus 
329367 5868842 Minus 
329373 6682537 Minus 



2020758-2020664 

2022565-2022497 

2631933-2631797 

2766043-2765856 

2766207-2766119 

2772278-2772039 

3240494424G389 

4692886^692753 

2631933-2631797 

5144548-5144344 

7573218-7573060 

7666413-7666091 

8217374-8217261 

82177964217670 

12732417-12732289 

12734365-12734269 

13728850-13728751 

16294548-16294360 

16306095-16305996 

20581911-20581794 

21388250-21388146 

22597448-22597284 

24650505-24650403 

25068943-25068841 

25421215-25421093 

25763806-25763747 

25908578-25908440 

25948563-25948411 

26975307-26975239 

27028481-27028377 

27034927-27034811 

29041694-29041500 

3015042S40150256 

3015605340155870 

34047408-34047311 

3420470744204577 

34215091-34214978 

3427837344278275 

3437532544375698 

3437681444376596 

2022565-2022497 

7095797-7095680 

20174286-20174193 

24893073-24892972 

33860127-33860047 

69994-70075 

35565-35843 

111058-111783 

1031-1162 

60751-60927 

16995-18101 

57019-59337 

198354-198436 

199909-200001 

94608-94785 

180921-181333 

567-962 

8973449838 

4792848076 

37052-37204 

46094-46241 

38844952 

8055741051 

8720147587 

3895049301 
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TABLE 14: Table 2 from BRCA 001-5 US 

Table 14, a subset of table 13, depicts a preferred group of genes highly upregulated in breast 
cancer cells. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigeneiD; Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to norma) breast tissue 



Pkey ExAccn UnJGenetD Unigene Title 

100038 M97935 AFFX control: STAT 1 

100114 D00596 Hs.B2962 (hymidylate synfoetase 

100975 J02923 Hs.76506 lymphocyte cytosolic protein 1 (L-plastin) 

101031 J05070 Hs.151738 matrix metaltoproteinase 9 (gelafinase B; 9 

101104 L07615 Hs.169266 Human neuropeptide Y receptor Y1 (NPYY 

101143 L12723 Hs.90093 heat shock 70kD protein 4 

101332 L47276 Hs.156346 Homo sapiens (cell line HIS) alpha topois 

101378 M13755 Hs.833 interferon -stimulated protein; 15 kDa 

101809 M86849 Hs.323733 Homo sapiens connexin 26 (GJB2) mRNA 

102618 U65932 Hs.81071 extracellular matrix protein 1 

102721 U79241 Hs. 118666 Human clone 23759 mRNA: partial cds 

102817 U90904 Hs.83724 Human clone 23773 mRNA sequence 

102907 X069S5 Hs.202833 heme oxygenase (decycBng) 1 

102985 X17644 Hs.2707 G1 to S phase transition 1 

103060 X57766 Hs.155324 matrix metaltoproteinase 11 (stromelysin 3 

103180 X69433 Hs.5337 isocitrate dehydrogenase 2 (NADP+); mito 

103206 X72755 Hs.77367 monokine induced by gamma Interferon 

103821 AA157623 Hs.198793 KIAA0750 gene product 

104115 AA428090 Hs.26102 ESTs 

104667 AA007234 Hs.30098 ESTs 

105186 AA191512 Hs.28005 Homo sapiens mRNA; cDNA DKFZp564G 

106103 AA421104 Hs.12094 ESTs 

107151 AA621169 Hs.8687 ESTs 

109415 AA227219 Hs.1 10826 trinucleotide repeat containing 9 

110189 H20543 Hs.6278 DKFZP586B1 621 protein 

110561 H59617 Hs.5199 ESTs; Weakly similar to UBIQUmN-CON 

110734 H98714 Hs.24131 ESTs 

110915 N46252 Hs.29724 ESTs 

111179 N67239 Hs.10760 ESTs 

111357 N91023 Hs.87128 ESTs 

112134 R46025 Hs.7413 ESTs 

113970 W86748 Hs.8109 ESTs 

114124 Z38595 Hs.125019 ESTs; Highly similar to KIAA0886 protein 

114292 Z40715 Hs.184641 delta-6 fatty acid desaturase 

114901 AA236276 Hs.196437 ESTs; Weakly similar to R26660 1;partial 

114965 AA250737 Hs.72472 ESTs 

115652 AA405098 Hs.38178 ESTs 

115875 AA433943 Hs.43946 ESTs; Weakly similar to Weak similarity t 

116790 H29532 Hs.101174 microtubule-associated protein tau 

116921 H72948 Hs.821 blglycan 

117412 N26722 Hs.42645 ESTs 

120241 Z41815 Hs.65946 ESTs 

120325 AA195651 Hs.104106 ESTs 

121596 AA416740 Hs.174104 ESTs 

123619 AA609200 ESTs 

124006 D60302 Hs.270016 ESTs 

125852 H09290 Hs.76550 Homo sapiens mRNA; cONA DKFZp564B 

126160 N90960 Hs.265398 ESTs; Weakry similar to transformatiorHel 

252 



R1 

16.7 

15.9 

30.1 

37.2 

18.3 

17.4 

18.9 

18.1 

22.5 

23.2 

15 

15.2 

22.7 

20.6 

17.8 

18.9 

15.1 

23.3 

28.7 

16.6 

19.3 

15.4 

19 

20.1 

16.6 

19.5 

30.2 

23.2 

37 

15 

17.4 

15 

22 

19.4 

16.9 

35.1 

16.1 

33.5 

22.2 

20.7 

18.1 

15.6 

15.2 

22.6 

23.1 

20.6 

25.9 

16.4 
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127677 AA916752 Hs.264190 
128595 U31875 Hs.152677 
128717 T30617 Hs.104222 
129124 AA234530 Hs.108802 
5 129366 H18027 Hs.184697 
130455 X17059 Hs.155956 
130604 X03635 Hs.1657 
130913 W03592 Hs.21198 
130944 M97935 Hs.21486 
10 131472 AA608962 Hs.27258 
131562 U90551 Hs.28777 
132180 AA405569 Hs.418 
132406 F09979 Hs.4774 
132465 AA047896 Hs.49169 
15 132994 AA505133 Hs.279905 
133294 R79723 Hs.69997 
133634 U24166 Hs.234279 
134374 D62633 Hs.8236 
134405 J04177 Hs.82772 
20 134470 X54942 Hs.83758 
134495 D63477 Hs.84087 
134714 U89922 Hs.890 
135237 AA454930 Hs.9691 
301884 AA312082 Hs.105445 
25 302276 NMJ04448HS.323910 
302290 AL1 17607 Hs.175563 
309177 AI951118 
309583 AW170035 
310438 AW022192 Hs.200197 
30 311166 AI821294 Hs.118599 
312153 AA759250 Hs.153028 
313915 A1969390 Hs.163443 
314506 AA833655 Hs.206868 
314558 A1873274 Hs.190721 
35 314691 AW207206 Hs.136319 
314943 AI476797 Hs.184572 
315196 AA972756 Hs.44898 
316177 AI908272 Hs.293102 
318073 AW167087 Hs.131562 
40 318662 A1285898 Hs.294014 
318740 NM_002543Hs.77729 
318744 AI793124 Hs.144479 
319668 NMJ)02731Hs.87773 
320074 AA321166 Hs.278233 
45 320211 AL039402 Hs.125783 
320727 U96044 Hs.181125 
322818 AW043782 Hs.293616 
322882 AW248508 Hs.279727 
324261 AL044891 Hs.269350 
50 324432 AA464510 Hs.152812 
324603 AW016378 Hs.292934 
324620 M448021 Hs.94109 
324988 T06997 Hs.121028 
330388 X03363 
5 5 330486 M13755 Hs.833 

330814 AA015730 Hs.265398 
331145 R72427 Hs.129873 
331308 AA252079 Hs.63931 
331890 AA432166 Hs.3577 
60 332526 AA281753 Hs.77515 
332532 N63192 Hs.1892 
332694 AA262768 Hs.243901 
332958 
333769 
65 333968 
334223 
334254 
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I J.J 


EST cluster (not In UniGene) 


50.1 


EST cluster (not In UniGene) 


16.7 


ESTs 


23.1 


EST duster {not in UniGene) 


212 


EST duster (not in UniGene) 


24.5 


HER2 receptor tyrosine kinase (c-erbB-2; E 


17.7 


interferon-stimulated protein; 15 kDa 


67 


ESTs; Weakly similar to transformatiorHel 


44.1 


ESTs; Weakly similar to CYTOCHROME 


41.9 


dachshund (Drosophila) homolog 


15.1 


sucdnate dehydrogenase complex; subunit 


24.3 


Inositol 1;4;5-triphosphate receptor; type 3 


19 


EST; Highly similar to PHENYLETHANO 


15.3 


KIAA1067 protein 


152 


CH22.FGENES.48J5 


17.8 


CH22.FGENES271 8 


483 


CH2^FGENES.307 4 


15.9 


CH2*_FGENES.360_4 


33.5 


CH2^FGENES.367J5 


18.5 
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335791 
336512 
338008 



CH2i.FGENES.611 J 
CH22JGENES.834J 
CH22LEM:AC005500.GENSCAN.127-9 



27.3 
21.4 
15.2 
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TABLE 14A 



Table 14A shows the accession numbers for those pkeys lacking unigenelD's for Table 14. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey CAT number Accession 

309583 1046029^-2 AW170035 
20 336512 CH22_3941FG 834 7 L1NK.DJ 
338008 CH22_6490FG_UMCEM:ACQ0 
333769 CH22_1036FG_271_8 UNK.EM 
333968 CH22_1245FG_307 4 LINK.EM 
335791 CH22_3160FG_611_7_UNK_EM 
25 309177 AI951118 

332958 CH22J82FG_48J5_LINK_EM: 
334223 CH22J507FG 360 4_UNK_EM 
334264 CH22_1551FG_367 15 UNKJE 
123619 371681 J AA602964 AA6O9200 



15 



Pkey: 

CAT number: 
Accession: 



Unique Eos probeset Identifier number 
Gene duster number 
Genbank accession numbers 
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TABLE 14B 

5 Table 14B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 14. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

1 0 Pkey: Unique number corresponding to an Eos probeset 

Ret Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham J. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA Strang' from which exons were predicted. 

NLposifion: Indicates nucleotide positions of predicted exons. 



15 



Pkey Ref Strand NU>osition 



332958 Dunham, I. etal. Plus 2516164-2516310 

20 333769 Dunham, I. etaL Plus 7696625-7696707 

33396B Dunham, I. etal. Plus 8681004-8681241 

334264 Dunham, I. etal. Plus 13234447-13234544 

338008 Dunham, I. etal. Plus 7697068-7697236 

334223 Dunham, I. etaL Minus 12734365-12734269 

25 335791 Dunham, LetaL Minus 25948563-25948411 

336512 Dunham, LetaL Minus 34278373-34278275 
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TABLE 15: Table 3 from BRCA 001-5 US 
Table 15 shows genes downregulated in breast cancer cells. 



Pkey: Unique Eos pnobeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UragenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of normal breast tissue to tumor 

Pkey ExAccn UniGene ID Unigene Title R1 

100115 000632 Hs.172153 glutathione peroxidase 3 (plasma) 1.7 

100499 TIGR:HT1428 Hs.283108 Globin.Beta 15 

100502 TIGR.HT1496 Hs.169228 Adrenal-Specific Protein Pg2 2,3 

100815 TIGR:HT4268 Hs.9739 L^ly^^-Phosphate^lad+Qxidoreduct 1.7 

101125 L10373 Hs.82749 transmembrane 4 superfamiry member 2 1.5 

101367 M12963 Hs.4 alcohol dehydrogenase 1 (class I); alpha po £9 

101397 M15856 Hs.180878 lipoprotein lipase 1.6 

101883 M9S399 Hs.75613 CD36 antigen (collagen type I receptor; thr 1.6 

102227 U25138 Hs.93841 potassium large conductance caJdunvactiv 1.6 

102857 X00129 Hs.76461 retinol-binding protein 4; intersStial 3 

103211 X73079 Hs.288579 polymeric Immunoglobulin receptor 1.8 

103496 Y09267 Hs.132821 flavin containing monooxygenase 2 1.5 

103562 Z2mB Hs.2815 POU domain; class 6; transcription factor 1 1.8 

104672 AA007629 glycerol-3-phosphate dehydrogenase 1 (sol 2.4 

105083 AA146619 Hs.18791 ESTs; Weakly similar to CALCIUM-BIND 1.7 

105138 AA164519 Hs.15248 ESTs 15 

106075 AA417915 Hs.25930 ESTs 1.5 

106870 AA487576 Hs.26530 serum deprivation response (phosphatidyls 1.6 

107099 AA609645 Hs.211568 eukaryoSc translation initiation factor 4 gam 2.7 

107616 AA004901 Hs.261164 ESTs 1.6 

107997 AA037388 Hs.82223 Human DNA sequence from done 1 41 H5o 1.7 

108604 AA099820 Hs.49696 ESTs 24 

111130 N64265 Hs.19515 yz44h1 2.S1 Morton Fetal Cochlea Homo sa 1.7 

111837 R36447 Hs.24453 ESTs 16 

112538 R70255 ESTs 19 

112808 R97970 Hs.281022 EST 1 5 

113086 T40652 Hs.20910O DKFZP434C171 protein 1.9 

115740 AA4 18033 Hs.283559 ESTs 16 

115949 AA443800 Hs.43125 ESTs 2 

115965 AA446661 Hs.173233 ESTs 22 

117224 N20300 Hs.218707 ESTs 17 

117513 N32174 Hs.44317 SRY (sex-determining region Y)-box 10 17 

119059 R15436 Hs.77889 Friedreich ataxia region gene X1 23 17 

119175 R71792 Hs.301002 ESTs; WeaWy similar to cell death acGvato 2.8 

119359 T71021 . Hs.285681 ESTs; Highly similar to WS basic-nelbc-loo 1.9 

119798 W73386 Hs.249129 ESTs 3 

120889 AA365784 Hs.97044 ESTs 1.6 

121381 AA405747 Hs.97984 ESTs; Weakly similar to WASP-famiiy pro 1.8 

121750 AA421 184 Hs.97549 ESTs 15 

122127 AA434447 Hs.108771 ESTs 25 

122348 AA443695 Hs.293410 ESTs 21 

122485 AA448300 Hs.160318 phospholemman 1.5 

123443 AA598841 Hs.167382 natriuretic peptide receptor A/guanylalecy 1.8 

123505 AA600135 ESTs; Moderately similar to Ull ALU SUB 1.5 

125284 W94688 Hs.103253 perflipn 1J 

126300 D81972 HUM427D08B Human fetal brain (TFuJrw 1.8 

126747 R72515 Hs.160318 phospholemman 1.6 

127218 AA309765 Hs,1 16017 ESTs; Weakly similar to K1AA0795 protei 1.5 

127357 AA452788 Hs75432 2x39g11.r1 Soares totaLfetus_Nb2HF8 9 1.7 
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127638 AA634405 Hs.122608 ESTs 1.5 

128213 AA972780 Hs.129t94 ESTs; Weakly similar to lill ALU SUBFA 1.5 

128351 AI092391 Hs.134886 ESTs 1.5 

128842 N44757 Hs.20340 ESTs 1.6 

5 128870 R71403 Hs.75309 eukaryotic translation elongation factor 2 1.7 

129146 AA459944 Hs.108924 DKFZP586P 1422 protein 15 

129285 T62068 Hs.11Q06 ESTs 2.1 

129331 N93465 Hs.279772 ESTs; Highly similar to CGI-38 protein (H 1.5 

130085 M62402 Hs.2743 13 insuMke growth factor binding protein 6 1.7 

10 130400 M25079 Hs.283 108 hemoglobin; beta 1.7 

131267 AA21 1776 Hs.2504 myomesin 1 (skelemin) (185kD) 3.8 

131277 AA131466 Hs.23767 ESTs 1.9 

131282 M12272 Hs.4 alcohol dehydrogenase 3 (class I); gamma p 2.2 

131304 AA295848 Hs.25475 aquaporin7 1.7 

15 131810 D49487 Hs.194236 leptin (murine obesity homotog) 2.5 

132788 AA045503 Hs.56874 ESTs; Weakly similar to Homo sapiens p2 1.6 

132931 241452 Hs.6090 deleted in bladder cancer diromosome regl 1.5 

133120 X64559 Hs.65424 tetranectin (plasmkiogervtinding protein) 2 

133314 U95367 Hs.70725 gamma-aminobutyric acid (GABA) A recep 1.5 

20 133507 X74295 Hs.74369 integrin; alpha 7 1.7 

133601 S95936 Hs.284176 transferrin 2.3 

133702 N56898 Hs.75652 glutathione S-transferase M5 1.9 

134111 N79674 Hs.8022 TU3A protein 4.6 

134699 U56814 Hs,88646 deoxyribonuclease Mike 3 1.5 

25 134749 L10955 Hs.89485 carbonic anhydrase IV 1.6 

135173 M72885 Hs.95910 Human 60S2 protein gene; complete cds 1.9 

300132 AW027556 Hs.156286 ESTs 1.7 

300732 AI369956 Hs.257891 ESTs 1.5 

300750 AA514805 Hs.293055 ESTs 1.8 

30 301140 A1807692 Hs.129129 ESTs 1.6 

301396 AA923549 Hs.224121 ESTs 2.1 

302910 N77976 Hs.251577 hemoglobin; alpha 1 1.8 

303798 VQ0505 Hs.36977 hemoglobin; delta 1.6 

303831 T04868 Hs.46780 EST cluster (not in UniGene) with exon hit 1.7 

35 303844 U94362 Hs.58589 glycogenin2 1.5 

304182 H91086 EST singleton (not In UniGene) with exon 1.5 

304622 AA516384 EST singleton (not in UniGene) with exon 1.5 

304682 AA550994 EST singleton (not in UniGene) with exon 1.7 

305612 AA782347 Hs.272572 EST singleton (not in UniGene) with exon 1.5 

40 306193 AA923457 EST singleton (not in UniGene) with exon « 

307206 AI192534 EST singleton (not in UniGene) with exon 1.6 

307377 AI222691 EST singleton (not in UniGene) with exon 1.5 

308023 AI452732 Hs.251577 EST singleton (not in UniGene) with exon 1.9 

308359 AI612774 Hs.79372 retinoid X receptor; beta 1.5 

45 309838 AW296073 Hs.255504 EST 1.5 

310403 AI720978 Hs.148006 ESTs; Moderately similar to alternatively s 1.8 

311671 AW241947 Hs.232478 ESTs 1.6 

311794 AW238092 Hs.254759 ESTs 2.1 

312082 T79860 Hs.1 18180 ESTs 1.9 

50 312575 H25237 Hs.306814 ESTs 2.3 

313076 N49684 Hs.143040 ESTs 1.8 

313283 W32480 Hs.157099 ESTs 2.2 

313374 AW328672 Hs.132760 ESTs 1.9 

314701 AI754634 Hs.131987 ESTs 1.7 

5 5 315391 AA759098 Hs.192007 ESTs 1.8 

315688 AA680055 Hs.264885 ESTs 1.5 

316249 AA948612 Hs.130414 ESTs 1.6 

316586 AI205077 Hs.294085 ESTs 1.7 

316890 AA837079 Hs.24647 ESTs 1.5 

60 316983 AI480204 Hs.177131 ESTs 1.5 

317604 AI650625 Hs.300756 ESTs 1,6 

317951 AW206520 Hs.129621 ESTs 1.5 

319400 W26902 Hs.154085 ESTs 1.7 

320757 H22654 Hs.6382 EST cluster (not in UniGene) 1.5 

65 321594 AA021402 Hs.1 1067 ESTs 1.7 

322102 H45589 EST cluster (not in UniGene) 1.5 

322814 AI824495 Hs.211038 ESTs 2.2 
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10 



15 



20 



25 



30 



322929 AI365585 


Hs.146246 ESTs 


2.3 


323831 AA335715 


Hs.200299 ESTs 


1.7 


324044 A1045752 


Hs.22350 ESTs 


1.8 


324675 AW014734 


Hs,157969 ESTs 


2.2 


325272 


CH.11_hsgl|5866902 


1.5 


325558 


CH.12_hsgil6056302 


1.6 


325556 


CR14Jsgi}6056305 


1.6 


326120 


CH.17 hsgj|5S67194 


1.5 


326139 


CH.17_hs #867203 


1.5 


326855 


CH.20Jisgij6552460 


1.5 


327438 


CH.02_hsgi[6004454 


1.6 


329733 


CH.14_p2gi|6065783 


1.6 


330931 F01443 


Hs.284256 ESTs 


4.6 


331591 N71677 


Hs.42146 ESTs 


1.9 


332159 AA621393 


Hs.112984 EST 


1.5 


332364 W94688 


Hs.103253 perifipin 


2.1 


332502 H21819 


Hs. 14896 Homo sapiens clone 24590 mRNA sequenc 


1.5 


334175 


CH2i.FGENES.349 10 


1.5 


334347 


CH22_FGENES.375 31 


1.8 


334737 


CH22.FGENES.424 12 


18 


335352 


CH22J : GENES.539_5 


1.5 


335639 


CH22_FGENES.584J9 


1.6 


336244 


CH22.FGENES.746J2 


1.5 


336336 


CH22_FGENES.814J 


1.7 


336865 


CH^GENEaSOS-l 


1.6 


337494 


CH22.FGENES.799-12 


1.6 


337764 


CH223/i:AO)00097.GENSCAN.1 19-1 


1.8 


337983 


CH22_EMAC005500.GENSCAN.110-1 


2 


338192 


CH22.EM^C005500.GENSCAN.228-1 


1.5 


339366 


CH22_BA354U2.GENSCAN.34-2 


1.5 
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TABLE 15A 



5 

10 
15 
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Table 15A shows the accession numbers for those pkeys lacking unigenelD's for Table 15. 
For each probeset, we have listed the gene duster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accesston'numbers 



Pkey CAT number Accession 

126300 250375J D81972 BE003132 

112538 504579.1 AA908813 R70255 

123505 genbanieAA600135 AA600135 * 

104672 6735,7 AA349096 A136B018 F21390 F17759 R48772 AI421485 AI300352 H43971 AI378525 F33652 R47898 AI264177 F22289 N28263 



AI276281 R48205 AI245302 AI190036 A1281050 AW242903 H42892 M910870 AW473816 H25721 AW451438 F19647 F22375 
H45809 F33447 M774528 AA007629 H42537 C01077 F32386 



322102 46708.1 H45589H19807AF075038H19808 H42437 

336865 CH22 4590FG 305 1 

338192 CH22J755FG__UNK_EM:AC00 

329733 c14j>2 

326120 c17 hs 

326139 c17_hs 

326855 c20_hs 

335352 CH22_2699FG_539_5JJNK EM 

335639 CH22_2999FG_584_19_UNK E 

307206 AI192534 

307377 AI222691 

337494 CH22_5727FG_799_1*_ 

337764 CH22_6115FG_UNK_EM:AC00 

337983 CH2^6438FG_UNK.EMAC00 

339366 CH22_8336FG_JJNK^8A354I1 

325272 c11Jis 

325558 c12_hs 

325656 c14_hs 

334175 CH22J455FG_349_10JJN!CE 
304182 H91086 

334347 CH22 1640FG 375_31_UNK E 
327438 cjjhs 
304622 AA516384 

334737 CH2£_2049FG_424J2JJNK_E 
304682 AA550994 

336244 CH22_3642FG_746JLUNKJ)A 
306193 AA923457 

336336 CH22_3746FG_814JJJNK_BA 
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TABLE 15B 

Table 15B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 15. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



10 



15 



20 



25 



30 



35 



Pkey: Unique number corresponding to an Eos probeset 

Refc Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham !. et at" refers to the publication 

entitled The DNA sequence of human chromosome 22.* Dunham I. et al., Nature (1999) 402:489-495. 
Slrand: indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref 



Strand NtjJosiu"on 



334347 
334737 
335639 
337494 
334175 
335352 
336244 
336336 
336865 
337764 
337983 
338192 
339366 
325272 
325558 
325656 
329733 
326120 
326139 
326855 
327438 



Dunham, i. etal. 
Dunham, I. eLa!. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, 1. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I, etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
5866902 
6056302 
6056305 
6065783 
5867194 
5867203 
6552460 
6004454 



Plus 

Plus 

Plus 

Plus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Plus 

Minus 

Plus 

Plus 

Minus 

Minus 

Minus 



13663814-13663926 

15998517-15998685 

25173591-25173696 

33339024-33339148 

11668659-11668597 

22681512-22681384 

31402729-31402583 

33797209-33797076 

8622405-8622289 

40356404035446 

7275495-7275271 - 

13248453-13248277 

33647431-33647293 

13247-13312 

70930-71030 

78190-78707 

163237-163450 

36116-36276 

218901-218960 

111390-111463 

199569-199692 
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TABLE 16: Table 4 from BRCA 001-5 US 

Table 16, a subset of table 15, depicts a preferred group of genes highly downregulated in 
5 breast cancer cells. 



Pkey: Unique Eos probeset identifier number 

ExAccn: . Exemplar Accession number, Genbank accession number 
10 UnigenelD: Unigene number 
Unigene Title: Unigene gene title 



R1: 



Ratio of normal breast tissue to tumor 



1 5 Pkey ExAccn 



20 



25 



30 



35 



40 



45 



100502 
101367 
102857 
104672 
107099 
108604 
115949 
115965 
119175 
119798 
122127 
122348 
129285 
131267 
131282 
131610 
133120 
133601 
134111 
301396 
311794 
312575 
313283 
322814 
322929 
324675 
330931 
332364 
337983 



T1GR:HT1496 

M12963 

X00129 

AA007629 

AA609645 

AA099820 

AA443800 

AA446661 

R71792 

W73386 

AA434447 

AA443695 

T62068 

AA211776 

M12272 

D49487 

X64559 

S95936 

N79674 

AA923549 

AW238092 

H25237 

W32480 

AI824495 

AI365585 

AW014734 

F01443 

W94688 



UniGene (D Unigene Title R1 

Hs. 169228 Adrenal-Specific Protein Pg2 2.3 

Hs.4 alcohol dehydrogenase 1 (class I); alpha 2.9 

Hs76461 retinol-binding protein 4; interstitial 3 

glycerol-3-phosphate dehydrogenase 1 2.4 

Ks.21 1 566 eukaryotic translation initiation factor 4 gam 2.7 

Hs.49696 ESTs " 2.4 

Hs.43125 ESTs 2 

Hs 173233 ESTs 2 2 

Hs.301002 ESTs; Weakly similar to cell death activator 2A 

Hs.249129 ESTs 3 

Hs.106771 ESTs 2.5 

Hs.293410 ESTs 2.1 

Hs.11006 ESTs 2.1 

Hs.2504 myomesin 1 (skelemin) (185kD) 3.8 

Hs.4 alcohol dehydrogenase 3 (class I); gamma 2.2 

Hs. 1 94236 leptin (murine obesity homolog) 2.5 

Hs.65424 tetranectin (plasmlnogen-binding protein} 2 

Hs.284176 transferrin 2.3 

Hs.8022 TU3A protein 4.6 

Hs.224121 ESTs 2.1 

Hs.254759 ESTs * 2.1 

Hs.306814 ESTs 2.3 

Hs.1 57099 ESTs 2.2 

Hs.211038 ESTs 2.2 

Hs.146246 ESTs 2.3 

Hs.157969 ESTs 2.2 

Hs.284256 ESTs 4.6 

Hs.103253 perilipln 2.1 

CH22^EM:AC005500.GENSCAN.110-1 2 
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TABLE 16A 

Table 16A shows the accession numbers for those pkeys lacking unigenelD's for Table 16. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accession 

104672 6735_7 AA349096 A1358018 F21390 F17759 R48772 AI421485 AI300352 H43971 A1378525 F33652 R47898 AI264177 F22289 N28263 
AI276281 R48205 AI245302 A1190036 AI281050 AW242903 H42892 AA910870 AW473816 H25721 AW451438 F19647 F22375 
H45809 F33447 AA774528 AAQQ7629 H42537 C01077 F32386 
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TABLE 17: Table 1 from BRCA 014 P 

Table 17 shows accession numbers representing 759 sequences of breast cancer genes or 
fragments thereof encoding breast cancer modulating proteins. For each overexpressed gene 
identified, a ratio of the relative amount of expression in breast tumors versus normal breast 
tissue. 



10 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number. Genbank accession number 

UnigenelD: Unigene number 



15 R1; 



Unigene Title: Unigene gene title 



Ratio of tumor to normal breast tissue 



Pkey ExAccn UnigenelD Unigene Title 



R1 



20 



25 



30 



35 



40 



45 



50 



55 



60 



100227 AV654694 Hs.82316 interferon-induced, hepatitis C-assodat 3 

100405 AW291587 Hs.82733 nldogen2 3.2 

100406 AI962060 Hs.1 18397 AE-binding protein 1 3 6 
100420 086983 Hs.116893 Melanoma associated gene 3.2 
100911 X83300 Hs.289103 SMA4 " 5.2 
100960 J00124 Hs.117729 keratin 14 {epidermolysis bullosa simple 4.3 
101011 BE387036 Hs.1211 acid phosphatase 5, tartrate resistant 3 
101183 AA442324 Hs.795 H2Ahistone family, member 0 3.2 
101194 L20971 Hs.188 phosphodiesterase 4B, cAMP-specific (dun 3 
101329 U66042 Hs.82171 Homo sapiens done 1 91 B7 placenta expres 4.1 
101378 8E563085 Hs.833 interferon-stimulaled protein, 15 kDa 5.3 
101474 R07566 Hs.73817 small Inducible cytokine A3 (homologous 3.9 
101491 M25809 Hs.64173 ATPase. H+ transporting, lysosomal (vacu 4.5 
101530 M29874 Hs.1360 cytochrome P450, subfamily IIB (phenobar 9 
101602 AA353776 Hs.901 C048 antigen (B-cell membrane protein) 3.4 
101663 NMJW3528 Hs.2178 H2B histone family, member Q 5.6 
101758 BE019494 Hs.79217 pyrroHne-5-carboxylate reductase 1 3.6 
101767 M81057 Hs. 180884 carboxypeptidaseBI (dssue) 12 
101817 M89907 Hs.152292 SWl/SNF related, matrix associated, acti 3.2 
101851 BE260964 Hs.82045 midkine (neurite growth-promoting factor 4.1 
101878 M97815 Hs. 1 83650 cellular retinoic acid-binding protein 2 6.5 
102209 NH.002038 Hs.265827 Interferon, alpha-inducibie protein (do 3 
102214 U23752 Hs.32964 SRY (sex determining region Y)-box 11 3 
102297 NMJ01504 Hs.198252 G protein-coupled receptor 9 3.7 
102299 NMJ505824 Hs.155545 37 kDa leudne-rich repeat (LRR) protein 3.7 
102301 NMJJ05651 Hs.183671 tryptophan 2,3-dioxygenase 5.2 
102305 AL043202 Hs.90073 chromosome segregation 1 (yeast homotog) 3.5 
102369 U39840 Hs.299867 hepatocyte nuclear factor 3, alpha 3.9 
102591 U62325 Hs.324 125 amyloid beta (A4) precursor protein-bind 4 
102721 H16646 Hs.1 18666 hypothetical protein PP591 3.5 
102739 AA363025 Hs.155572 Human clone 23801 mRNA sequence 3.2 
102791 AF080229 gb:Human endogenous retrovirus K done 1 3 
102804 NMJW2318 Hs.83354 lysyl oxidase-like 2 3.2 
102903 M73779 Hs.250505 retinoic acid receptor, alpha 3.3 
103010 X52509 Hs.1 61 640 tyrosine aminotransferase 124 
103042 T81656 Hs.252259 ribosomal protein S3 4.5 
103117 X63578 Hs.295449 parvalbumin 3 
103207 X72790 gb:Human endogenous retrovirus mRNA for 5.9 
103282 BE390551 Hs.77628 steroidogenic acute regulatory protein r 3.9 
103284 AI751601 Hs.8375 TNF receptor-associated factor 4 3 3 
103329 X85134 Hs.72984 retinobtastoma-blnding protein 5 31 
103364 X90872 Hs.279929 gp25L2 protein 3 
103385 NM.007069 Hs.37189 similar to rat HREV107 34 
103456 AA495425 Hs.9629 papillary renal cell carcinoma (transloc 3.2 
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103498 
103558 
103563 
103612 
103825 
104073 
104103 
104115 
.104168 
104173 
104181 
104189 



104307 
104518 
104556 
104658 
104748 
104755 
104825 
104830 
104865 
104906 
104961 
105038 
105088 
105092 
105093 
105304 
105397 
105409 
105431 
105552 
105598 
105650 
105688 
105808 
105809 
105909 
105965 
106135 
106184 
106293 
106400 
106474 
106484 
106533 
106614 
106636 
106661 
106743 
106844 
106864 
106865 
106871 
106942 



107105 
107158 
107248 
107265 
107630 
107710 
107890 
107985 
108000 
108217 



Y09306 
BE616547 
L02911 
BE336654 
AI571835 
AW779318 
AW021102 
AF183810 
M461618 
AA084273 
AF173296 
AB040927 
AI559444 
AI929700 
H20816 
AV650851 
AA360954 
AA015879 
T49951 
AA035613 
AW294092 
T79340 
BE298684 
H78517 
AW503733 
H58589 
AA148982 
AL137566 
AW134924 
AA814807 
AW505076 
AA252033 
AA256750 
AA279439 
W16741 
AI299139 
AI133161 
AW973653 
AA195191 
AA131657 
AL1 17474 
W28948 
N39842 
BE397649 
BE383668 
AA351978 
AL134708 
AA648459 
AW958037 
AW499914 
BE613328 
M465055 
AJ311928 
AW192535 
AW472981 
AA995351 
AF216751 
AW963419 
N32849 
AW263124 
BE379594 
AW961576 
A1955040 
M025386 
T40064 
A1263307 
AA058686 



Hs.30148 
Hs.2785 
Hs.150402 
Hs.70937 
Hs.55468 
Hs.88417 
Hs.21509 
Hs.26102 
Hs.31704 
Hs.76561 
Hs.283740 
Hs.301804 
Hs.293960 
Hs.1 11680 
Hs.112423 
Hs.96900 
Hs.27268 
Hs.33536 
Hs.9029 
Hs.141883 
Hs.21594 
Hs.22575 
Hs:26802 
Hs.33905 
Hs.9414 
Hs.35156 
Hs.29068 
Hs.32405 
Hs.190325 
Hs7395 
Hs.301855 
Hs.242413 
Hs.28802 
Hs.279763 
Hs.25635 
Hs.17517 
Hs.286131 
Hs.20104 
Hs.5111 
Hs.23830 
Hs.41181 
Hs.10762 
Hs.301444 
Hs.94109 
Hs.42484 
Hs.4943 
Hs.145998 
Hs.335951 
Hs.286 
Hs7579 
Hs.21938 
Hs.158213 

Hs.19479 
Hs.321130 
Hs.31314 
Hs.26813 
Hs.155223 
Hs.31844 
Hs.315111 
Hs.49135 
Hs.60178 
Hs.265398 
Hs.61311 
Hs.71968 
Hs.239884 
Hs.62588 



homeodornain-lnteracting protein kinase 3 
keratin 17 

Activln A receptor, type 1 (ACVR1) (ALK 
H3histone family, member A 
ESTs 
ESTs 



ESTs 

opposite strand to trichorhinophaiangeai 
ESTs, Weakly similar to K1AA0227 [H.sapi 
ESTs, Weakly similar to S47072 finger pr 
DC6 protein 
KIAA1494 protein 
ESTs 

endosulfme alpha 

Homo sapiens mRNA; cDNA DKFZp586l1420 {f 
hypothetical protein; KIAA1830 protein 
Homo sapiens cDNA: FU21933 fis, done H 
ESTs 

OKFZP434G032 protein 
ESTs 

hypothetical protein MGC15754 
B-ceD CUL/lymphoma 6, member B (zinc fi 
protein kinase domains containing protei 
ESTs 

K1AA1488 protein 

Homo sapiens cDNA RJ1 1027 fis, clone PL 
ESTs 

Homo sapiens mRNA; cDNA DKFZp586G0321 (f 
ESTs 

hypothetical protein FU23182 
DiGeorge syndrome critical region gene 8 
hypothetical protein DKFZp434K1421 
centaurirvalpha 2 protein 
hypothetical protein FU10504 
HSPC003 protein 
ESTs 

CGI-101 protein 
hypothetical protein FU00052 
hypothetical protein RJ20729 
ESTs 

Homo sapiens mRNA; cDNA DKFZp727C191 (fr 
ESTs 
KIAA1673 

Homo sapiens cDNA FU13634 fis, clone PL 
hypothetical protein aJ10618 



ESTs 

hypothetical protein AF301 222 
ribosomal protein L4 
hypothetical protein FU104Q2 
hypothetical protein RJ12492 
sperm associated antigen 6 
gb:qo89h04j(1 NCLCGAPJ<id5 Homo sapiens 
ESTs 

hypothetical protein MGC2771 



CDA14 

stanniocalcin 2 

hypothetical protein FU12586 

nuclear receptor co-repressor/HDAC3 comp 

ESTs, Moderately similar to ALU7_HUMAN A 

ESTs 

ESTs, Weakly similar to transformation-r 
ESTs, Weakly similar to S10590 cysteine 
Homo sapiens mRNA; cDNA DKFZp564F053 (fr 
H2B histone family, member L 
ESTs 
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108435 T82427 
108591 AB033073 
108733 AA121022 
108771 AF068290 
108819 AA011449 
108912 AA136674 
109086 AF186114 
109124 AK000684 
109132 AI970536 
109163 N23235 
109277 AA196443 
109410 AW504732 
109454 AA232255 
109514 AA234087 
109581 R45584 
109632 AA325138 
109644 AW973964 
109700 F09609 
109768 F06838 
109807 R43646 
109842 AW818436 
109895 AK001680 
110024 AW973152 
110561 AA379597 
110675 H89355 
110707 AI239832 
110915 BE092285 
111139 N64683 
111155 N66563 
111199 AI767435 
111336 AI457338 
111510 R07856 
111532 R08440 
111689 AA602004 
111823 R35252 
111876 R38239 

111892 AA421081 

111893 AF070526 
112125 AW379029 
112170 BE246743 
112287 AB033064 
112300 H24334 
112303 R54797 
112478 R66067 
112561 AI791493 
112631 R82040 
112637 R82331 
112657 AW844878 
112678 AI418466 
112917 AA082465 
113070 AB032977 
113095 AA828380 
113117 AW813731 
113187 BE613410 
113200 T57773 
113206 BE262470 
113374 T79925 
113440 U54727 
113494 T91451 
113518 AW367788 
113571 AI702609 
113822 NMJJ04585 
113835 A1912410 
113938 W81598 
113947 W84768 
113970 W27249 
114086 AA378776 



Hs.194101 
Hs.43857 

Hs.79741 

Hs.271627 

Hs.118681 

Hs.270737 

Hs.183887 

Hs.16603 

Hs.30567 

Hs.86043 

Hs.21275 

Hs.295232 

Hs.262346 

Hs.23025 

Hs.235873 

Hs.291531 

Hs.14763 

Hs.12422 

Hs.23590 

Hs.30488 

Hs.31050 

Hs.5199 

Hs.249159 

Hs.15617 

Hs.29724 

Hs.290943 

Hs.191358 

Hs.29822 

Hs.29894 

Hs.16355 

Hs.23260 

Hs.24944 

Hs.293246 

Hs.12388 

Hs.13429 

Hs.1 18338 

Hs.288529 

Hs.334806 

Hs.26125 

Hs.28664 
Hs.129873 

Hs.164599 

Hs.19769 

Hs.33665 

Hs.125031 

Hs.6298 

Hs.126733 

Hs.159153 

Hs.31575 

Hs.10263 

Hs.241471 

Hs.269165 

Hs.191445 

Hs.86538 

Hs.323954 

Hs.15713 

Hs.17466 

Hs.27475 



Hs.8109 
Hs.288649 



Homo sapiens cONA: FU20869 fis, clone A 3 

similar to glucosamine~6-sulfatases 3.3 

gb:zn84f10.rl Slratagene lung carcinoma 3.9 

hypolhetical protein FU101 16 6.1 

ESTs 3.6 

EST 3.9 

tumor necrosis factor (iigand) superfaml 3.7 

hypothetical protein FU22104 3.1 

hypothetical protein FU13163 3.7 

ESTs, Weakly similar to B34087 hypotheti 4.5 

Homo sapiens cDNA FU13558 lis, done PL 3.7 

hypothetical protein FU1 101 1 4.6 

ESTs, Moderately similar to A46010 X-lin 6.4 

ESTs, Weakly similar to S72482 hypotheti 4.8 
ESTs, Weakly similar to ALU5.HUMAN ALU S 3.3 

hypothetical protein FU22672 3 

ESTs, Highly similar to 120321 7A dehydro 3 

gb:HSC33H092 normalized infant brain cON 3.2 

ESTs 3.2 

ESTs 3.8 

solute carrier family 1 6 (monocarboxyiic 3.3 

DKFZP434F091 protein 3.6 

ESTs .4.2 

HSPC150 protein similar to ubiquifin-con 5.1 

adrenergic, alpha-2A- t receptor 5.3 

ESTs, Weakly similar to ALU4_HUMAN ALU S 3.7 

hypothetical protein FU13187 3.7 

ESTs 4 

ESTs 3.1 

ESTs 4.5 

ESTs 5.4 

ESTs 3.2 

gb:yf19f09.s1 Soares fetal liver spleen 3.1 

ESTs 3.2 

ESTs. Weakly similar to 2109260A 8 cell 3,3 

ESTs. Weakly similar to putative p150 [H 3.1 

ESTs 3.4 

Homo sapiens done 24787 mRNA sequence 3.3 

ESTs, Weakly similar to unnamed protein 4.4 

hypothetical protein FU22635 7.3 

KIAA1238 protein 3.2 

ESTs 4.4 

gb:yg87b07.s1 Soares infant brain 1NIB H 3.4 

ESTs 8.2 

ESTs, Weakly similar to A36036 cytochrom 5.5 

gb:yj06b06.s1 Soares placenta Nb2HP Homo 3.9 

ESTs 5.4 

hypothetical protein MGC4174 3.2 

ESTs 4.7 

choiine/ethanolamlnephosphotransferase 3.7 

KIAA1 151 protein 3.1 

ESTs 3.4 

ESTs, Moderately similar to S65657 alpha 3.4 

SEC63, endoplasmic reticulum translocon 3.2 

ESTs 3.5 

RNB6 6.2 

ESTs, Weakly similar to ALU1_HUMAN ALU S 3.7 

ESTs 3 

ESTs 3.4 

postmeiotic segregation increased 2-like 3.1 

hypothetical protein MGC2776 3.1 

retinoic add receptor responder (tazaro 3.9 

Homo sapiens cDNA FU12749 fis, clone HI 3 

gb:zd88g02.s1 Soares_fetaLhearLNbHH19W 4.6 

gb:zh53d03.s1 SoaresJetaUveLspleeru 3.1 

hypothetical protein FU21080 6.9 

hypothetical protein MGC3077 4.3 
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114148 AW470411 Hs.288433 neuroirimin 4.1 

114424 AW780192 Hs.267596 ESTs 3.4 

114518 AW163267 Hs.106469 suppressor of varl (S.cerevisiae) 3te 3.1 

114563 A1979168 Hs.82226 glycoprotein (transmembrane) nmb 4.8 

5 114965 AI733B81 Hs.72472 BMP-R1B 

114995 AA769266 Hs.193657 ESTs 3.6 

115121 AI634549 Hs.88155 ESTs 3.2 

115134 AW968073 Hs.194331 ESTs, Highly similar to A55713 Inositol 4.2 

115167 AA749209 Hs.43728 hypothetical protein 3 

10 115253 BE149845 Hs.289038 hypothetical protein MGC4126 3.6 

115277 AA814100 Hs.86693 ESTs 3* 

115327 N46436 Hs.109221 ESTs 3.4 

115354 AA281636 Hs.334827 ESTs Jf 

115657 AA405620 Hs.55158 ESTs, Weakly similar to T29520 hypotheti 3.5 

1 5 1 15676 AA953006 Hs.88143 ESTs W 

115709 AW293849 Hs.58279 ESTs, Weakly similar to ALU7JHUMAN ALU S 3.4 

115729 AA417812 Hs.38775 ESTs * 

115787 AI126772 Hs.40479 ESTs 3.1 

115830 AW970529 Hs.86434 hypothetical protein FU21 816 3.6 

20 115835 AA521410 Hs.41371 ESTs 3.1 

115850 NM.014937 Hs.52463 K1AA0966 protein 3 

115900 AK001500 Hs.165186 hypothetical protein FU13852 3.2 

115935 AA354549 Hs.41181 Homo sapiens mRNA; cDNA DKFZp727C191 (fr 3 

115948 AL042465 Hs.43445 poly{A)-specif>c ribonuclease (deadenyta 3.1 

25 116092 AB041035 Hs.93847 NM_016931:Homo sapiens NADPH oxidase 4 ( 67 

116115 AL042355 Hs.70202 WO repeat domain 10 3.6 

116184 AW450737 Hs.128791 CGI-09 protein 3.1 

116192 AA464976 Hs.62528 ESTs. Moderately similar to A45010 X-Un 3.3 

116208 AI219083 Hs.42532 ESTs. Moderately similar to ALU8_HUMAN A 3.2 

30 116246 AF265555 Hs.250646 baculoviral IAP repeat-containing 6 3.6 

116443 AW962196 Hs.321264 LBP protein 32 4.1 

116470 AI272141 Hs.83484 SRY {sex determining region Y>box 4 4.1 

116726 AK001114 Hs.53913 hypothetical protein FU 10252 8.6 

116845 AA649530 gb:ns44f05.s1 NCLCGAPJUvl Homo sapiens 3.2 

35 117026 H88256 Hs.50456 ESTs, Moderately similar to ZN75JHUMAN Z 3.5 

117216 AI569804 Hs,42792 ESTs, Weakly similar to I78885 serine/th 3.1 

117296 AL133427 Hs.42506 Homo sapiens mRNA full length insert cDN 3.2 

117403 H84455 Hs.40639 ESTs 4.7 

117691 AB040959 Hs.93836 DKFZP434N0 14 protein 3 

40 118229 AW968941 Hs. 166254 hypothetical protein DKFZp566l1 33 3.3 

118363 AI183838 Hs.48938 hypothetical protein FU21 802 4.3 

118416 N66028 Hs.49105 FKBP-associated protein 3.1 

118470 AW970584 Hs.291033 ESTs 3.4 

118502 AL157488 Hs.50150 Homo sapiens mRNA: cDNA DKFZp564B182 (fr 52 

45 118695 AK000465 Hs.50081 KIAA1 199 protein 3.4 

118925 N92293 Hs.206832 ESTs, Moderately similar to ALU8.HUMAN A 3.3 
119025 BE003760 Hs.55209 H()mosapier«mRNA;cDNADKFZp434K0514(f 19.7 

119036 R95872 Hs.1 17572 chemokine binding protein 2 3.7 

119063 R16833 Hs.53106 ESTs, Moderately similar to ALU1 JlUMAN A 4.1 

50 119075 M10905 Hs.287820 fibronectin 1 3-2 

119620 W47620 Hs.56009 Z^oligoadenylate synthetase 3 (100 k 3.3 

119741 AF041853 Hs.43670 kinesin family member 3A ' 3.1 

119747 AI970797 Hs.64859 ESTs J 

119754 AL037824 Hs.194695 ras homolog gene family, member I 3.8 

55 119905 AW449064 Hs.119571 coBagen, type III, alpha 1 piers-Danl 3.1 

120084 W94472 Hs.59529 ESTs, Moderately similar to ALU1_HUMAN A 8.4 

120241 AA825686 Hs.321176 ESTs, Weakly similar to S65824 reverse t 3.6 

120326 AA195300 Hs.21145 hypothetical protein RG083M05.2 3.2 

120742 AA225084 gb.nc21d06.r1 NCLCGAP.Prl Homo sapiens 3.6 

60 120870 AA357172 Hs292581 ESTs, Moderately similar to ALU1_HUMAN A 5.8 

120885 AA365515 Hs.301872 hypothetical protein MGC4840 3 

120970 AA398118 Hs.97579 ESTs, Weakty similar to A46010 X4lnked 3.7 

121054 AW976570 Hs.97387 ESTs 5-3 

121095 AA320134 Hs.196029 Homo sapiens mRNA for KIAA1657 protein, 4 

65 121103 AA398936 Hs.97697 EST 3.5 

121121 AA399371 Hs.189095 simBar to SALL1 (sal (DrosopMaHike 6.3 

121337 AW885727 Hs.301570 ESTs 4.7 
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121351 AW206227 Hs.287727 hypothetical protein RJ23132 5 

121611 M31669 Hs.1735 inhibin, beta B (activin AB beta polypep 3.6 

121643 AA640987 Hs.193767 ESTs 5.6 

121770 NM.015902 Hs.278428 progestin induced protein 3.4 

5 122125 AK000492 Hs.98806 hypothetical protein 4.1 

122338 AA443311 Hs.98998 ESTs 3 

122417 AA446965 Hs.112092 ESTs 47 

122513 AI767879 Hs.99214 ESTs 3.8 

122544 AW973253 Hs.292689 ESTs 3 

10 122655 AA323296 Hs.97837 Homo sapiens mRNA; cDNA OKFZp547J047 (fr 5.6 

122805 AA526911 Hs.82772 collagen, type XI, alpha 1 3.2 

122851 AW205931 Hs.99598 hypotheUcal protein MGC5338 8.6 

123105 AA487809 Hs.166011 catenin (cadherin-associated protein), d 3 

123111 AA228776 Hs.191721 ESTs 6.9 

15 123249 AA371307 Hs.125056 ESTs ( 3.6 

123273 AA491253 Hs.173611 Empirically selected from AFFX single pr 7 

123385 BE149685 Hs.17767 KiAA 1554 protein 3.1 

123419 T66087 Hs.1 12482 Homo sapiens unknown mRNA sequence 3.4 

123485 AI308876 Hs.103849 hypothetical protein DKFZp761D1 12 3.1 

20 123645 AI675944 Hs.188691 Homo sapiens cDNA FU12033 fis, clone HE 3.8 

123819 AA580082 Hs.112264 ESTs 4.7 

124012 AA352723 Hs.241471 RNB6 3.8 

124243 H69125 Hs.133525 ESTs 4.1 

124357 N22401 gb:yw37g07.s1 Morton Fetal Cochlea Homo 4.1 

25 124359 N22508 Hs. 13931 5 Homo sapiens cONA: FU21479 fis, done C 3.6 

124567 AW451645 Hs.151504 Homo sapiens cDNA FU1 1973 fis, done HE 3.1 

124911 N34151 Hs.174195 interferon induced transmembrane protein 3.5 

124972 R41396 Hs.101774 hypothetical protein FU23045 4.3 

125006 BE065136 Hs.1 45696 splicing factor (CC1. 3) 6 

3 0 125042 T78906 Hs.269432 ESTs, Moderately similar to ALU1_HUMAN A 8.1 

125184 W60326 Hs.288684 Homo sapiens cDNA FU1 1750 fis, done HE 4.7 

125243 AW970536 Hs.105413 ESTs 3.1 

125286 AF086534 Hs.187561 ESTs, Moderately similar to ALU LHUMAN A 3.3 

125304 AL359573 Hs.124940 GTP-binding protein 3 

35 125330 AW880562 Hs.114574 ESTs 3 

125331 AI422996 Hs.161378 ESTs 3.2 

125685 AI924630 Hs.4943 hepatocellular carcinoma associated prot 3.2 

126257 N99638 gb:za39g11.r1 Soares fetal liver spieen 4 

126474 AW975814 Hs.326714 Homo sapiens done IMAGE:71 3177, mRNA se 4 

40 126666 AA648886 Hs.151999 ESTs 3.8 

126872 AW450979 gb:UI-H-BI3-aia-a-12-0-Ul.s1 NCI_CGAP_Su 3 

127431 AW771958 Hs.175437 ESTs, Moderately similar to PC4259 ferri 3.6 

127980 AA961459 Hs.125644 ESTs 4.1 

127997 AW068311 Hs.311054 Homo sapiens mRNA full length insert cDN 3.3 

45 128420 AA650274 Hs.41296 fibranectin teudne rich transmembrane p 4.6 

128609 NM_003616 Hs.102456 survival of motor neuron protein interne 3.9 

128946 Y13153 Hs. 107318 kynurenine 3-monoaxygenase (kynurenine 3 3.1 

128955 AA775076 Hs.1 85807 Homo sapiens, Similar to PRO0478 protein 3.9 

129092 056365 Hs.63525 poly(rC)-binding protein 2 3.3 

50 129270 AA357185 Hs.109918 ras homolog gene family, member H 3.1 

129301 AF182277 Hs.330780 cytochrome P450, subfamily HB (phenobar 3.9 

129385 AA172106 Hs.1 10950 Rag C protein 6,2 

129619 AA209534 Hs.284243 tetraspan NET-6 protein 3.4 

129629 AK000398 Hs.1 1747 hypothetical protein FU20391 3 

55 129725 X56411 Hs.1219 alcohol dehydrogenase 4 (dass II), pi p 3.2 

130069 A1754813 Hs.1 46428 collagen, type V, alpha 1 5.4 

130092 X03363 Hs.323910 Vnerb-b2 avian erythroblastic leukemia v 4.4 

130298 AI347487 H$,132781 dass I cytokine receptor 4.6 

130382 NMJJ03450 Hs.1 55204 zinc finger protein 174 5.6 

60 130622 A1582291 Hs.16846 ESTs, WeaWy similar to 04HUD1 debrisoqu 3 

130703 R77776 Hs.1B103 ESTs 3.8 

130881 AA809875 Hs.25933 ESTs 4.2 

130954 AB014544 Hs.21572 KIAA0644 gene product 4.7 

131095 AI399653 Hs.22917 ESTs 4.3 

65 131153 H0904B Hs.23606 ESTs 3.8 

131253 R71802 Hs.24853 ESTs 3.5 

131372 AW293399 Hs.144904 nudear receptor co-represser 1 3.6 
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131507 A1826268 
131587 AI695549 
131739 AF017986 
131795 BE501849 
131970 D86960 
131986 NM.002314 
132093 AA400091 
132122 AA426202 
132159 076435 
132333 AA192669 
132406 AL133731 
132482 AV660345 
132543 BE568452 
132624 AA326108 
132700 AA319233 
132725 NMJJ06276 
132799 W73311 
132847 T48195 
132857 Y00272 
132936 AL120659 
133130 AI128606 
133142 AW952412 
133167 AW162840 
133225 AW600291 

133274 AA085191 

133275 293241 
133287 AW797437 
133376 BE618768 
133462 AW675064 
133740 AW162919 
133831 BE274552 
133976 AI908165 
134666 BE391929 
134710 AI433797 
134731 D89377 
134776 J05582 
135230 AFQ64m 
135303 R61253 
135400 X78592 
135411 L10333 
300089 AJ199738 
300233 AW614220 
300254 AW183618 
300256 AW591433 
300378 Z45270 
300973 AA572949 
301111 R10799 
301341 AA887801 
301548 AI091631 
301884 AA312082 
301935 U79745 
301976 T97905 
302001 AB020711 
302067 BE542706 
302094 AW749321 
302099 AL049670 
302145 NMJJ03613 
302235 AL049987 
302276 AW057736 
302290 AA179949 
302372 AL1 17406 
302378 AL109712 

302384 AJ678059 

302385 AJ224172 
302680 AW192334 
302830 AI038997 
302857 AF282265 



Hs.27769 ESTs, Weakly similar to MCAT_HUMAN MITOC 3.2 

Hs.183868 glucuronidase, beta 3.1 

Hs.31386 secreted frizzled-related protein 2 Z.2 

Hs.32317 highHnobility group 20B 3.2 

Hs.3610 K1AA0205 gene product 3.6 

Hs.36566 UM domain kinase 1 3 2 

Hs.39421 ESTs 12 

Hs.40403 Cbp/r^OO^teradngtransacfivator.wit 3 

Hs.41154 Zc famirymerr^1(cJdi>airedDri^»hi 3.2 

Hs.45032 ESTs 3.5 
Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (f 9.2 

Hs.238126 CGi-49 protein 8.2 

Hs.5101 protein regulator of cytokinesis 1 3.4 

Hs.33829 bHLH protein DEC2 32 

Hs.5521 ESTs 4.3 

Hs.1 84167 sprK^far^r,arghir^serirte-rich7 3.6 

Hs.169407 SAC2 (suppressor of acfin mutations 2, y 3.2 

Hs.58189 eukaryotic translation inifiation factor 3.5 

Hs.184572 cell division cycle 2, G1 to S and G2 to 4,4 

Hs.61 1 1 aryl-hydrocarbon receptor nudear transl 4.8 

Hs.6557 zinc finger protein 161 3.3 

Hs.65874 ESTs, Weakly similar to A40348 Elav/Sex- 3.5 

Hs.6641 kinesin family member 5C 4.5 

Hs.6823 hypothetical protein FU10430 3 3 

Hs.6949 hypothetical protein MGC11275 3 

Hs.239934 CGI-96 protein 4.5 

Hs.69771 B-factor, properdin 4.1 

Hs.7232 acetyl-Coenzyme A carboxylase alpha 5.1 

Hs.73875 furnarylacetoacetate hydrolase (fumarylac 3 

Hs.170160 RAB2, member RAS oncogene famfly-iike 3.4 

Hs,76578 protein inhibitor of activated STAB 3.9 

Hs.169946 GATA-binding protein 3 (T-ceii receptor 6.2 

Hs.8752 transmembrane protein 4 3.1 

serine hydroxymethyitransferase 1 (scrub 3 

msh (Drosophila) homeo box homolog 2 5.8 

mucin 1 , transmembrane 4 

suppressor of Ty (S.cerevisiae) 3 homoto 3.2 

KJAA1677 protein 3.3 

androgen receptor (dihydrotestosterone r 4.8 

reticulon 1 3.8 

ESTs, Weakly simDar to ALUAJWMAN Hi! 3* 

ESTs 4.2 

solute carrier family 30 (zinc transport 9.9 

Transmembrane protease, serine 3 4.9 

hypothefical protein RJ22672 3.4 

ESTs 3.5 

ESTs 3.8 

G protein-coupled receptor 13.9 

two pore potassium channel KT3.3 4 .4 

GDNF family receptor alpha 1 5.7 

solute carrier family 16 (monocarboxylic 8.6 

gb.7e54d O.ii Scares fetal liver spleen 3.9 

K1AA0904 protein 7.7 

CEGP1 protein 7.3 

ESTs 3.3 

ribosomal protein L34 pseudogene 1 4.2 

cartilage intermediate layer protein, nu 7.9 
Homo sapiens mRNA; cDNA DKFZp564F112 (fr 5.6 

HER2 receptor tyrosine kinase (c-erb-b2, 5.4 
Homo sapiens mRNA; cDNA DKFZp564N0763 (f 34.1 

ATP-binding cassette transporter MRP8 6 J 

Homo sapiens mRNA fall length Insert cON 4 

synaptonemal complex protein 2 4.3 

lipophilin B (uteroglobin family member) 13.8 

ESTs 9 6 

ESTs 5 

inner centromere protein antigens (135kD 3.4 



Hs.89404 
Hs.89603 
Hs.96757 
Hs.98265 
Hs.99915 
Hs.99947 
Hs.208275 
Hs.189402 
Hs.55610 
Hs.298241 
Hs.235873 
Hs.207566 
Hs.191990 
Hs.208229 
Hs.203845 
Hs.105445 
Hs.1 14924 

Hs.278346 
Hs.222399 
Hs.6786 
Hs.137576 
Hs.151407 
Hs.166361 
Hs.323910 
Hs.175563 
Hs.200102 
Hs.296506 
Hs.202676 
Hs.204096 
Hs.38218 . 
Hs.132921 
Hs.44836 
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302892 AW176909 Hs.42346 calrineunn-binding protein calsarcin-1 3.4 

302970 W05608 Hs.312679 ESTs, Weakly similar to A49019dyneln he 5.1 

303271 AA652687 Hs.96151 Human DNA sequence from clone RP5-1 103G7 3.7 

303289 AL121460 Hs.272673 hypothetical protein FU20508 4.1 

5 303357 AW005352 Hs.159643 ESTs, Weakly similar to T32554 hypothetl 4.2 

303540 AA355607 Hs.309490 ESTs. Weakly similar to putative WHSC1 p 4.3 

303563 AA367699 Hs.10082 potassium Inte/mediate/small conductance 3.3 

303642 AW299459 gbDcs50d08j(1 NCI_CGAP_Kid11 Homosaplen 4.2 

303780 A1424014 Hs.18995 K1AA1 304 protein 3.6 

10 303797 AW629759 gb:hh70eQ5.y1 NCI CGAP_GU1 Homo sapiens 4.9 

303852 R53434 Hs.90207 hypothetical protein MGC1 1 138 3.7 

304328 M149951 Hs.62112 zinc finger protein 207 3 

304782 AA582081 gb:nn32h08.s1 NCI CGAP Gas 1 Homo sapiens 4.1 

305913 AA876109 gb:nx24hQ1.s1 NCLCGAP GC3 Homo sapiens 3 

15 305917 AA876469 gb:oe48b04.st NCLCGAP_Pr25 Homo sapiens 3.1 

307010 AI140014 gb:qa68f09.x1 SoaresJetolhearLNbHH19W . 3.5 

- 307041 A1144243 gb:qb85b12jc1 SoaresJetaLheartNbHH19W 3.9 

308106 AI476803 gb^77e1Zx1 Soares_NSF_F8 JW_OTJ>AJ> S 4.3 

308307 A1581398 Hs.172928 collagen, type I, alpha 1 4.6 

20 308615 AK000142 Hs.101774 hypothetical protein FU23Q45 4.4 

309177 AI951118 Hs.326736 Homo sapiens breast cancer antigen NY-BR 17.3 

309328 AW024348 Hs.233191 EST, Weakly similar to A27217 glucose tr 3.2 

309574 AW16B083 gb:xg59g04.x1 NCLCGAP JJt4 Homo sapiens 3.1 

309583 AW170O35 Hs.326736 Homo sapiens breast cancer antigen NY-BR 57.6 

25 310064 AI199712 Hs. 148486 ESTs, Weakly similar to 191721 OA Pro/Arg 4.6 

310098 A1685841 Hs. 161 354 ESTs ~ 3.6 

310438 AW022192 Hs.200197 ESTs 4.6 

310683 AI939456 Hs.16087Q ESTs 3.2 

310727 AK000703 Hs.323822 Homo sapiens mRNA for KlAA 1551 protein, 3.6 

30 310781 AJ380797 Hs.158992 ESTs 10.2 

310895 AI955121 Hs.165724 N-acetylgalactosamine-4-O-sulfotransfera 3.4 

310955 AI476732 Hs.263912 ESTs 10.9 

311117 AI671439 Hs.196029 Homo sapiens mRNA for KIM1657 protein, 3.1 

311166 AI8210O5 Hs.1 18599 ESTs 10.8 

35 31 1237 AA641098 Hs.208809 ESTs, Moderately similar to ALU1 JHUMAN A 4.3 

311465 AJ758660 Hs.206132 ESTs 4.4 

311587 AI828254 Hs.271019 ESTs, Weakly similar to A47582 B-cellgr 5.1 

311598 AW023595 Hs.232048 ESTs 58 

311774 AA700870 Hs.14304 ESTs 3.3 

40 311785 AI056769 Hs,133512 ESTs 3.9 

311872 R12375 Hs.194600 ESTs 3.3 

311889 AA767342 Hs.122483 ESTs, Weakly similar to PSF_HUMAN PTB-AS 3 

311913 AI358522 Hs.270188 ESTs 3 

311923 T60843 Hs.189679 ESTs 5.6 

45 311935 AA216387 gb:nc16b02.s1 NCI CGAP JV1 Homo sapiens 5.2 

312019 AA373630 Hs.188750 ESTs 3 

312021 AA759263 Hs.14041 ESTs 3.4 

312067 T78968 Hs.14411 ESTs 3.5 

312090 T80177 Hs.118Q64 similar to rat nuclear ubiquitous casein 3.8 

50 312147 AI633744 Hs.195648 ESTs, Weakty similar to 138022 hypothefi 4.4 

312153 BE261944 Hs.118625 hexokinasel 5.2 

312168 T92251 Hs.198882 ESTs 33 

312182 T94344 Hs,326263 ESTs 3.3 

312187 AA700439 Hs.188490 ESTs 34 

55 312199 AW438602 Hs.191179 ESTs 39 

312219 H73505 Hs.1 17874 ESTs 4 

312226 AA315703 Hs.199993 ESTs, Weakly similar to ALUB_HUMAN 111! 4.9 

312299 AA972712 Hs.269737 ESTs 5.7 

312544 AA516420 Hs.183526 ESTs, Weakly similar to 138022 hypothetl 6.3 

60 312638 AW439195 Hs.256880 ESTs, Weakly similar to S65657 a!pha-1C- 4.9 

312826 AW291545 Hs.185018 ESTs 4.9 

312837 AW292286 Hs.255058 ESTs 4.4 

312980 AA497043 Hs.115685 ESTs 3.1 

313070 A1422023 Hs.161338 ESTs 4.3 

65 313079 N76497 Hs.1787 proteollpld protein 1 {Pellzaeus-Merzbac 3.3 

313089 AF026944 Hs.293797 ESTs 5.8 

313096 AW073310 Hs.163533 Homo sapiens cONA FU14142 fis, clone MA 4.5 
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313126 AA7465Q3 Hs.283313 ESTs 10 

313166 AI801098 Hs.151 500 ESTs 3 5 

313197 AW979008 Hs.222487 ESTs 33 

313280 AW960454 Hs.222830 ESTs 47 

5 313325 AI420611 Hs.127832 ESTs 34 

313328 AW449211 Hs.105445 GDNF family receptor alpha 1 12.4 

313352 AW150945 Hs.144758 ESTs 4.1 

313385 AI032087 Hs.269819 ESTs 3 

313393 A1674685 Hs.200141 ESTs 52 

10 313417 AA741151 Hs.137323 ESTs 3.5 
313434 W92070 gb:zh48gQ5j1 Soares fetaUwer_spleen 3.7 

313569 AI273419 Hs.135146 hypothetical protein FU 13984 3 
313591 AA046309 gbaf12f01.s1 Soares fetal heajt_NbHH19W 5.6 

313615 AI540978 Hs.301997 hypothetical protein RJ13033 3.2 

15 313915 C18863 Hs. 163443 Homo sapiens cDNAFU 11576 fc, done HE 263 

313975 AW175896 Hs.65114 keratin 18 3 

313979 AI535895 Hs.221024 ESTs 4.9 

313997 AV657317 Hs.288649 hypothetical protein MGC3077 3.9 

314043 AA827082 Hs.291872 ESTs 3.1 

20 314078 AW129357 Hs.329700 ESTs 8.3 

314097 AA648744 Hs.269493 ESTs 6.6 

314121 AI732083 Hs.187619 ESTs 6.2 

314129 AA228366 Hs.115122 ESTs 4 
314138 AA740616 gb:ny97f11.s1 NCLCGAPJ5CB1 Homo sapiens 5.9 

25 314236 AA743396 Hs.189023 ESTs 3.1 

314244 AL036450 Hs.103238 ESTs 4 

314305 AI280112 Hs. 125232 Homo sapiens cONAFU13266fc, done OV 8 

314306 AI697901 Hs.192425 ESTs 3.7 
314322 AA907153 Hs.190060 ESTs 3.3 

30 314394 AW961597 Hs.130816 ESTs, Moderately similar to 138022 hypot A2 

314401 AI660412 Hs.234557 ESTs 3.3 

314465 AA602917 Hs.156974 ESTs 4.7 

314506 AA833655 Hs.206868 Homo sapiens cDNA FU14056 lis, done HE 8.5 

314510 AI204418 Hs.190080 ESTs 4 

35 314546 AW007211 Hs.16131 hypothetical protein FU12876 3.4 

314547 AA399272 Hs.144341 ESTs 6.7 

314558 AI673274 Hs.190721 ESTs 27.4 

314627 AA425310 Hs.155766 ESTs, Weakly simitar to A47582 B-cefl gr 4.4 
314648 AW979268 gb£ST39 1378 MAGE resequences, MAGP Homo 4.6 

40 314691 AW207205 Hs.136319 ESTs 207 

314729 AA457367 Hs.191638 ESTs 36 

314754 AW026761 Hs.134374 ESTs 3.6 

314814 BE350122 Hs.157367 ESTs, Weakly similar to (78885 serine/th 4.9 

314864 AW971198 Hs.294068 ESTs 4.3 

45 314881 AI095087 Hs. 1 52299 ESTs, Moderately similar to S65657 alpha 3.7 

314882 AA828032 Hs.189076 ESTs 3 1 

314981 AW972359 Hs.293334 ESTs 3 

315006 A1538613 Hs.298241 Transmembrane protease, serine 3 10.9 

315021 AA533447 Hs.312989 ESTs 53 

50 315051 AW292425 Hs.163484 ESTs 1Z9 

315060 AA551104 Hs.189048 ESTs. Moderately similar to ALUCJiUMAN ! 5.8 

315073 AW452948 Hs.257631 ESTs 42 

315080 AA744550 Hs.136345 ESTs 37 

315175 A1025842 Hs.152530 ESTs 6 

55 315183 AW136134 Hs.220277 ESTs 39 

315193 AI241331 Hs.131765 ESTs, Moderately similar to I38937 DNA/R 4^4 

315196 A1367347 Hs.44898 Homo sapiens clone TCCCTA001 51 mRNA sequ 8.2 

315198 AI741506 Hs.165900 ESTs, Weakly similar to ALU1_HUMAN ALU S 3.6 

315240 R38772 Hs.172619 myelin transcription factor 1* 3 4 

60 315263 AW5 10994 Hs.220740 ESTs 34 

315282 A1222165 Hs.144923 ESTs 49 

315296 AA876905 Hs.125286 ESTs 4' 

315368 AB037745 Hs.104696 K1AA1 324 protein 47 

315397 AA218940 Hs.137516 fidgetin-Gke 1 3*1 

65 315489 A1378817 Hs.191847 ESTs 3*1 

315498 AA628539 Hs.116252 ESTs, Moderately similar to ALU 1.HUMAN A 3^2 

315526 A1193043 Hs.128685 ESTs, Weakly similar to T17226 hypotheti 4,1 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



315530 AW015415 


Hs.127780 ESTs 


8.9 


315562 AA737415 


Hs.1 52826 ESTs 


5.5 


315634 AA8370B5 


Hs.220585 ESTs 


6.3 


315647 AA648983 


Hs.212911 ESTs 


3.6 


315707 AI4 18055 


Hs. 16 1160 ESTs 


5.1 


315772 AW515373 


Hs.271249 Homo sapiens cONA FU13580 fis, done PL 


3.1 


315850 AW270550 


Hs.1 16957 ESTs 


3.8 


315858 AA737345 


HS.294041 ESTs 


5 


315878 AA683336 


Hs.189046 ESTs 


3.1 


315977 AW865916 


Hs. 151206 ESTs 


4.7 


315978 AA830893 


Hs.119769 ESTs 


4.1 


315995 AI217477 


Hs.194591 ESTs 


4.1 


316012 AA764950 


Hs.119898 ESTs 


7 


316042 AI469960 


Hs.170698 ESTs 


4.9 


316052 A1962796 


Hs.136754 ESTs 


4.1 


316072 AW517524 


Hs.135201 NOD2 protein 


3.2 


316074 AW975114 


Hs.293273 ESTs 


3.8 


316100 AW203986 


Hs.213003 ESTs 


3.2 


316133 A1187742 


Hs.125562 ESTs 


3.7 


316177 AI904982 


Hs.293102 ESTs. Moderately similar to ALU1_HUMAN A 


30.7 


316186 AI433540 


gb:ti69g05.x1 NCLCGAP_Kld11 Homosapien 


3.1 


316244 A1640761 


Hs.224988 ESTs 


3.5 


316303 AA740994 


Hs.209609 ESTs 


3.8 


316313 AA741300 


Hs.202599 ESTs, Weakly similar to 138022 hypotheti 


4.4 


316364 AA747807 


Hs. 149500 ESTs 


3.2 


316580 AA938198 


Hs.146123 poly(A) polymerase gamma 


9.4 


316697 AW293174 


Hs.252627 ESTs 


4.4 


316715 A1440266 


Hs.170673 ESTs, Weakly similar to T24832 hypotheti 


3 


316868 AI660898 


Hs.195602 ESTs 


3.2 


316869 AI954880 


Hs.134604 ESTs 


3.2 


316886 AA836331 


Hs.134981 ESTs 


4^4 


316897 AA838114 


Hs.221612 ESTs 


3.7 


316943 AW014875 


Hs.1 37007 ESTs 


4.6 


317069 AI732892 


Hs.1 90489 ESTs 


5.9 


317194 AW445167 


Hs.1 26036 ESTs 


4.1 


317360 AJ125252 


Hs.126419 ESTs 


3.5 


317404 AI806867 


Hs.1 26594 ESTs 


5.1 


317452 AA972965 


Hs.1 35568 ESTs 


6.9 


317501 AI822034 


Hs. 137097 ESTs 


4.6 


317674 AW294909 


Hs.1 32208 ESTs 


4.3 


317803 AW664964 


Hs.128899 ESTs 


6.1 


317834 X56348 


Hs.287270 ret proto-oncogene (multiple endocrine n 


3.1 


317850 AI681545 


Hs.152982 hypothetical protein FU13117 


3.4 


317881 AI827248 


Hs.224398 Homo sapiens cONA RJ1 1469 fls, clone HE 


9.6 


317902 AW102941 


Hs.211265 ESTs 


4.1 


317916 AJ565071 


Hs.159983 ESTs 


10.3 


318042 AW294522 


Hs.149991 ESTs 


3.1 


318223 AI077540 


Hs.134090 ESTs 


3.9 


318327 AW294013 


Hs.200942 ESTs 


3 


318332 AI093930 


Hs.163440 Homo sapiens cDNA: FU21000 fis, done C 


4.4 


318418 AF107493 


Hs.1 18498 Homo sapiens LUCA-15 protein mRNA,spiic 


5.4 


318558 AW402677 


Hs.146381 RNA binding motif protein, X chromosome 


4.4 


318625 AA526235 


Hs.193162 Homo sapiens cDNA FU1 1983 fis, clone HE 


5.9 


318634 T49598 


Hs.156832 ESTs 


4 


318740 NMJ102543 


Hs.77729 oxidised low density lipoprotein (lectin 


7.3 


318744 AI793124 


Hs.144479 ESTs 


17.8 


318781 F11802 


Hs.6818 ESTs 


3 


319191 NM012391 


Hs.794 14 prostate epithelium-specific Els transcr 


3.6 


319478 AI524124 


Hs.270307 ESTs 


4.6 


319510 W88532 


Hs.254562 ESTs 


3.3 


319551 AA761668 


gb:nz24c08.s1 NCLCGAP_GCB1 Homo sapiens 


3.2 


319745 T79366 


Hs.108258 actln binding protein; macrophin (mlcrof 


3.3 


319834 AA071267 


gb:zm61g01.r1 Stratagene fibroblast (937 


6.2 


319840 C19035 


Hs.164259 ESTs 


3.3 


319977 AA534222 


gb:nj21d02.s1 NCLCGAPAA1 Homo sapiens 


4.3 


320074 AA321166 


Hs.278233 ESTs 


3.4 


320167 AA984373 


Hs.90790 Homo sapiens cDNA: FU22930 fis, clone K 


4.1 
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320187 T99949 Hs.303428 Homo sapiens cDNARJ14832fc, done OV 5.3 

320211 AL039402 Hs.125783 DEME-6 protein 9.2 

320416 A1026984 Hs.293662 ESTs 3.1 

320588 U78082 Hs. 167738 RNA polymerase Jl transcriptional regula 3.1 

5 320635 N5Q617 . Hs.80506 smaD nuclear ntonucleoprotein potypept 6.1 

320654 AI160015 Hs. 1181 12 ESTs 3.5 

320742 AI601188 Hs.120910 ESTs 3 

320832 AA214584 Hs.290167 ESTs 3.7 

320915 AI359144 Hs.143688 Homo sapiens cONA: FU23Q31 fis, done L 3.1 

10 321016 BE144167 Hs.49994 hypofoetical protein similar to RNA-bind 3.3 

321107 AI732643 Hs.144151 ESTs 12,3 

321171 AI769410 Hs.221461 ESTs 3.3 

321253 AA610649 Hs.333239 ESTs 3 

321318 AB033041 Hs.137507 vang {van gogh, DrosophilaHke 2 3.9 

15 321642 A1432199 Hs.247084 ESTs 3 

321644 AW975944 Hs.237396 ESTs 11.7 

321683 A1471598 Hs.197531 ESTs 3.8 

321758 U29112 Hs.196151 ESTs 4.4 

321811 080630 gb.HUM091D02B Human fetal brain (TFujrwa 3.2 

20 321828 R59890 Hs.83623 nuclear receptor subfamily 1, group I, m 3.1 

321910 H67065 Hs.271530 ESTs, Weakly similar to ALU7 HUMAN ALU S 4.7 

321937 AL049351 Hs.302058 Homo sapiens mRNA; cDNA DKFZp566C093 (fr 3.5 

321978 N77342 Hs.21851 Homo sapiens cDNAFU 12900 lis, done NT 5 

322035 AL137517 Hs.334473 hypothetical protein DKFZp56401278 19 

25 322136 AF075083 gb^tomo sapiens full length insert cONA 3.6 

322258 BE265745 Hs.194359 ESTs, Weakly similar to ALUCJtUMAN Ull 3 

322296 W76326 gb:zd60d04ri SoaresJetaLhearLNbHH19W 4.4 

322303 AI357412 Hs.157601 ESTs 11.5 

322476 AW963372 Hs.46677 PRO2000 protein 3 

30 322520 T55958 gb.yb35R35.r1 Sfratagene fetal spleen (9 3 

322521 AF147347 gb:Homo sapiens full length insert cONA 4.2 

322567 AF155108 Hs.256150 Homo sapiens, Similar to RIKEN cONA 2810 4 

322595 W92147 Hs.1 18394 ESTs 5.4 

322675 AA017656 gb:ze39h01.r1 Soares retina N2b4HR Homo 3.1 

35 322766 AW068805 Hs.288467 Homo sapiens cONA FU12280 fis, done MA 5.2 

322818 AW043782 Hs.293616 ESTs 7.6 

322882 AW248508 Hs.279727 Homo sapiens cONA FU 14035 fis, done HE 5.9 

322975 C16391 gb:C16391 Ctontech human aorta polyA mRN 16.5 

323091 AI902456 Hs.210761 ESTs, Weakly similar to I38022 hypotheti 4 

40 323131 AK002088 Hs.2701 24 Homo sapiens cDNA RJ1 1226 fis. done PL 3.3 

323168 AL120862 Hs.124165 programmed cell death 9 (PDCD9) 6.3 

323244 AW675572 Hs.193620 ESTs 4.6 

323262 AL133990 Hs.190642 ESTs 10.5 

323332 AI829520 gb:wi19c06.x1 NCI,CGAP_Ut1 Homo sapiens 6.2 

45 323333 AV651680 Hs.208558 ESTs 4.3 

323335 AI655499 Hs.161712 ESTs 9.2 

323645 AW445014 Hs.197746 ESTs 3,1 

323663 BE081058 Hs.243023 ESTs 4 

323693 AA317962 Hs.249721 ESTs, Moderately similar to PC4259 ferri 3 

50 323782 AW961560 Hs.97600 ESTs 3.2 

323817 AA410943 BMP-R1B 8.4 

323930 AL043683 Hs.8173 hypothetical protein RJ1O803 3.3 

323974 AI825204 Hs.211408 ESTs 4.5 

324001 AL044949 Hs.116298 ESTs 4.5 

55 324036 AI472078 Hs.303662 ESTs 8.4 
324261 BE069341 gb:QV3-BT038 1-270100473-008 BT0381 Homo 49.4 

324285 AA431159 Hs.122954 ESTs 3 

324296 AI524039 Hs.192524 ESTs 3 

324305 AA642007 Hs.1 16369 ESTs 3.3 

60 324432 AA464510 Hs.152812 ESTs 165 

324585 A1823969 Hs.132678 ESTs 3.3 

324598 AW972227 Hs.163986 Homo sapiens cDNA: FU22765 lis, done K 5 

324603 AW993522 Hs.292934 ESTs 10.4 

324631 AA937116 Hs.293683 ESTs, Weakly similar to I54374 gene NF2 3.3 

65 324716 BE169746 Hs.12504 likely ortholog of mouse Arkadia 3.2 

324748 AW974941 Hs.292385 ESTs. WeaWy similar to I78885 serine/lh 3 

324771 AA631739 Hs.335440 EST 3 
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324774 
324823 
324824 
324826 
5 324961 
324987 
324994 
325146 
325372 

10 325544 
327075 
332798 
334223 
334447 

15 335809 
335824 
338255 
409430 
428046 

20 432558 
436808 
448569 
453542 

25 



30 



AI031771 

AW516704 

AI826999 

AA704806 ■ 

AA613792 

AI375572 

AI805416 

AI064690 
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40 



45 



50 



55 



60 



65 



R21945 

AW812795 

R97268 

AA731602 

BE382657 

AW836724 

M97935 

M97935 

M55150 

M13755 

AI052047 

AA252033 

AA401739 

H18459 

R48744 

M31682 

AA4 16873 

D80240 

R49590 



X03363 



Hs.132586 
Hs.208726 
Hs.224624 
Hs.143842 

Hs.172634 
Hs.213897 
Hs.171176 



Hs.166975 
Hs.155381 
Hs.177269 
Hs.120266 
Hs.21486 
Hs.339660 



AA034918 



AF049569 
M13955 



HG4126HT4396 



ESTs 
ESTs 
ESTs 

ESTs, Weakly similar to 2004399A chromos 

gb:no97h03.s1 NCI_CGAP_Pr2 Homo sapiens 

ESTs 

ESTs 

ESTs 

Phase 2 & 3 Exons 
Phase 2 & 3 Exons 
Phase 2& 3 Exons 

C22000007^ij12314195|emb|CAB99338.1| (A 
NM_0G5Q80*:Homo sapiens X-box binding pr 
NM_012429*:Homo sapiens SEC14 (S. cerevl 
NM_014509*:Homo sapiens kraken-llke (BK1 
ENSP00000249072*:DJ222E13.1 (N-TERM1NAL 
NMJ)14323*;Homo sapiens zinc finger prot 
splicing factor, arglnine/serine-rich 5 
ESTs, Moderately similar to (36022 hypot 
ESTs 
ESTs 

signal transducer and activator of trans 

Homo sapiens mRNA expressed only In plac 

AFFX control: STAT1 

AFFX control: STAT1 

fumarylacetoacetate 

interferon stimulated protein; 15 kDa 

ESTs 

ESTs; Weakly similar to ALU SUBFAMILY J 
ESTs 

hepatocellular carcinoma associated protein; 
ESTs 

inhibin; beta B (activin AB beta polypeptide) 
ESTs 

HUM5G11A Human fetal brain (TFujiwara) Homo 
ESTs 

CH22_FGENES.678_5 

CH22.FGENES.619J 

CH22_FGENES.619_12 

CH22_EM^C005500.GENSCAN.127 9 

CH22.EM:AC005500.GENSCAN.304 2 

CH22.FGENES.271J 

CH22JGENES.619J3 

CH22_FGENES.27U 

CH22_FGENES.617_7 

CH.07Jisgi|6004473 

CH22JGENES.264J 

HER2 receptor tyrosine kinase (c erbB 2; ERBB2; 

CH22_FGENES.617_9 

CH.07_hsgil5868264 

CH.19_hs gi|5867439 

CH22 FGENES.6 3 

CH.17 hsgp67230 

CH.20 hsgi[6552458 

CH22 EMAC005500.GENSCAN.14822 

CH22_FGENES.669J0 

KIAA1028 protein 

CH22 FGENES.48 12 

CH22_FGENES.118_2 

ESTs 

multiple UniGene matches 
CH22 FGENES.619_8 
CH22.FGENES.13 7 

CH22 FGENES.360J 
CH22 FGENES.706 9 
CH.21 hsgii6531965 



4.2 

3.4 

3.1 

4.4 

3.9 

18.8 

3.3 

4.2 

4.4 

57 

3.8 

4.3 

26.2 

3.9 

10.1 

20 

9 

4 

4.6 

3.2 

3.9 

4.1 

3.7 

3.2 

3 

3 

4.5 
6.7 
3.2 
3.3 
3 

4.2 
3 
3 
4 

3.2 
16.8 
12.9 
11.3 
9.2 
8.5 
8.4 
8 

7.3 
7.2 
7.1 



6.6 



neu) 
6.5 
5.8 
5.7 
5.3 
5.1 
5.1 
4.7 
4.6 
4.6 
4.5 
4.5 
4.4 
4.3 
4.3 
4.3 

Zinc Finger Protein Hzf4 

4.3 

4.3 

4.2 



4.3 
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HG2614HT2710 
X83535 

HG4716HT5158 
HG4677HT5102 

AA7 14311 

HG2465HT4871 
AA707750 



AA976074 



CR17_hsgi[5867215 
CH22_FGENES.669_8 

CH22_FGENES.48J8 

matrix metalloproteinase 14 (membrane Inserted) 
CH22_FGENES.27U 
CH22.FGENES.617J 
CH22_FGENES.290_8 

CH22LFGENES.135 
CH22.FGENES.132 
CH.14_hsgi[6682474 
CH.02Jisgi|5867750 
CH22_FGENES.617J 

CH22_DJ32I10.GENSCAN.23 39 
CH22_FGENES.543_20 
CH22_EM:AC005500.GENSCAN.96 1 
CH22_FGENES.204 2 
CH22__FGENES.619jl 
CH.16Jisgi[5867087 
EST duster (not in UniGene) 
CH22_EfAAC005500.GENSCAN.149 9 
CH22_EM:AC005500.GENSCAN.421 5 
CH22_FGENES.134 
CH.07_hsgij6004478 
CH22_FGENES.360J 

CH22_FGENES.6_2 

CH22_C20H12.GENSCAN.16 2 

CH22_C65E1.GENSCAN.81 

ESTs; Weakly similar to cis Golgi matrix 

CH22_FGENES.307_4 

CH22^EM^C005500.GENSCAN^48 14 

CR06Jtsgi|5902482 

CH22.FGENES.669 5 

CH22_DJ32I10.GENSCAN.19 8 

CH2JLFGENES.527-.6 

CH22^FGENES.330J0 

CH22LFGENES.14 2 

ESTs 

CH22_FGENES^26 7 
CH22.FGENES.13 3 
CH22_EM:AC005500.GENSCAN.209 12 
CH22_FGENES.271_3 



4.1 
4.1 

Collagen, Type VUi. Alpha 1 4.1 

4.1 

4 

3.9 
3.9 
3.8 

Guanosine 5' Monophosphate Synthase 3.8 

3.8 

3.8 

3.8 

3.8 

3.7 

Oncogene RetfPtc^ Fusion Activated 3.7 

3.7 

3.7 

3.7 

3.5 

3.5 

3.5 

3.4 

3.4 

3.4 

3.3 

3.3 

3.3 

Ona Binding Protein Ap 2, Aft Splice 3 3 .3 

3.3 

3.2 

3.2 

3.1 

3.1 

3.1 

3.1 

3,1 

3.1 

3.1 

3.1 

3.1 

3 

3 

3 

3 

3 
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TABLE 17A 



10 



Table 17A shows the accession numbers for those pkeys lacking unigenelD's for Table 17. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 
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Pkey: Unique Eos probeset Identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

116845 393481J AA649530 AA659316 H64973 
103207 30635.-4 X72790' 

126257 182217J N99538 AW973750 AA328271 H90994 AA558020 AA234435 N59599 R94815 

102791 37186J AF080229 AF080231 AF08Q230 AF080232 AF080233 AF080234 BE550633 AI636743 AW614951 BE467547 A1680833 
AI633818 N29986 U87592 U87593 U87590 U87591 S46404 U87587 M463992 AW206802 A1970376 AI58371 8 AI672574 
N25695 AW665466 AI818326 M126128 AW80345 AW013827 AA248638 AI214968 AA204735 AA207155 AA206262 AA204833 
AW003247 AW496808 AKJ80480 AI631703 AI651023 AI867418 AW818140 AA502500 AI2061 99 AI671282 AI352545 BE501030 
A1652535 BE465762 AA206331 AW451866 AA471 088 AA206342 AA204 834 AA206 100 AW02 1661 AA332922 N66048 
AA703396 H92278 AW139734 H92683 U87589 U87595 H69001 U87594 BE466420 AI624817 BE466611 AI206344 AA574397 
AA348354 AI493192 

126872 142696J AW450979 AA136653 M136656 AW419381 AA984358 AA492073 BE168945 AA809054 AW238038 BE01 1212 BE011359 
BE011367 BE011368BE011362 BE011215 BE01 1 365 BE01 1363 
1746257 J R82040R70934- 
176835.1 AA225084AA302713 

324239J A1311928 AA936030 T51931 AA609816 AA487195 AA664207 
genbanK.F09609 F09609 
genbanK_R08440 R08440 



112631 
120742 
106864 
109700 
111532 
113938 
113947 
124357 
108733 
112303 
322136 
322296 
321811 
314648 
322520 
322521 
322675 
323332 
316186 
322975 
324261 
323817 
301976 
324961 
303642 
303797 
319551 
311935 
319834 
319977 
314138 
313591 



genbank.W81598 W81598 
genbank_W84768 W84768 
genbank.N22401 N22401 
504187J AA121022AA126422 
genbank_R54797 R54797 
46802 J AF075083 H52291 H52528 
47334 J W76326 AF086341 W72300 
1527481.1 D80630 080896 080895 
293660 1 AW979268AA878419AA431342AA431628 
38916.1 T55958 T57205 AF147346 
38917 J AF147347 T55426 T55503 
86787.1 AA017656AA017374AA019761 

179142.1 AI829520 AI791832 AA228414 AI791823 AA229211 AA229315 
425440 1 A1433540AA728984AA8O4981 
1510563.1 C16391 C16413 

273265.1 BE069341 AW748403 AL044891 AI908240 AA393080 

233566.1 AA410943 AW948953 AA334202 AA332882 

128835.1 T97905AA101672 

376239.1 AA613792 AW182329 T05304 AW858385 

284260.1 AW299459 AA4171 12 

386364.1 AW629759 AW749955 AA633408 AI651005 

357371 J M761668 AA573621 R92814 R09670 

174129.1 AA216387 T63548 AA228676 

1 12523.1 AA071267 T65940 T64515 AA071334 

345248.1 AA534222 AA632632 T81234 

179960.1 AA740616 AA654854 AA229923 

103087.1 AA046309 AI263500 AA046397 
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308106 AJ476803 

338255 CH22J856FG_UNK_EM:AC00 
335809 CH22_3181FG,617_6 UNK.EM 
335824 CH22J5197FG_619J? UNK_E 
5 307010 AI140014 
307041 AM44243 
305913 AA876109 
305917 AA876469 
309574 AW168083 
10 325372 c12_h$ 
325544 c12Jis 

332798 CH22J4FG.6 5 L1NK_C4G1.G 
334223 CH22J507FGJJ60 4 L1NK_EM 
327075 c21Jis 
15 334447 CH22J746FG 387 7JJNICEM 
304782 AA582081 

313434 441798J W92070 AW019952 W92053 
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TABLE 17B 

Table 17B shows the genomic positioning for those pkeys lacking unigene CD's and 
5 accession numbers in Table 17. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

1 0 Ret: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLpositton: Indicates nucleotide positions of predicted exons. 



15 



Pkey Ref Strand NLposrtfon 



334447 Dunham, LetaL Plus 14308764-14308824 

335809 Dunham, I. eUl. Plus 26310772-26310909 

20 335824 Dunham, I. etal. Plus 26376860-26376942 

332798 Dunham, I. etal. Minus 232147-231974 

334223 Dunham, I. etal Minus 12734365-12734269 

338255 Dunham, LetaL Minus 15242294-15242231 

325372 5866920 Minus 111706M117304 

25 325544 6682452 Plus 171228-171286 

327075 6531965 Plus 40413184041431 
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TABLE 18: Table 2 from BRCA 014 P 

Table 18 shows genes with atleast five times the expression in breast tumor tissue than is 
5 expressed in normal body tissues. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

10 Un'genelD: Unigene number 

Unigene Title: Unigene gene title 



R1: 



Ratio of tumor to normal body tissue 



15 Pkey ExAccn UnigenelD Unigene Title 



R1 



101378 BE563085 Hs.833 
101530 M29874 Hs.1360 
101767 M81057 Hs.180884 
20 101878 M97815 Hs.183650 
103010 X52509 Hs.161640 
104115 AF183810 Hs.26102 
104825 AA035613 Hs.141883 
107105 AW963419 Hs.155223 
25 108819 AA011449 Hs.271627 
112287 AB033064 Hs.334806 
112561 A1791493 Hs.129873 
112637 R82331 Hs.164599 
113206 BE262470 Hs.241471 
30 113970 W27249 Hs.8109 
114965 AI733881 Hs.72472 
118925 N92293 Hs.206832 
119905 AW449064 Hs.1 19571 
121611 M31669 Hs.1735 
35 129301 AF182277 Hs.330780 
133976 A1908165 Hs.169946 
134731 D89377 Hs.89404 
300254 AW183618 Hs.55610 
301884 AA312082 Hs.105445 
40 302001 AB020711 Hs.278346 
302067 BE542706 Hs.222399 
302276 AW057736 Hs.323910 
302290 M179949 Hs.175563 
302372 AL1 17406 Hs.2O01O2 
45 302385 AJ224172 Hs.204096 
309177 A1951118 Hs.326736 
309583 AW1 70035 Hs.326736 
310781 A1380797 Hs.158992 
311166 AI821005 Hs.118599 
50 311935 AA216387 

312153 BE261944 Hs.1 18625 
313328 AW449211 Hs.105445 
313915 C18863 Hs.163443 
314097 AA648744 Hs.269493 
55 314138 AA740616 

314506 AA833655 Hs.206868 
314558 A1873274 Hs.190721 
314691 AW207206 Hs.136319 
315006 A1538613 Hs.298241 
60 315021 AA533447 Hs.312989 
315051 AW292425 Hs. 163484 
315060 AA551104 Hs.189048 



interferon-sfimubted protein, 1 5 kDa 5.3 

cytochrome P450, subfamily IIB (phenobar 9 

carboxypepSdase B1 (tissue) 1 2 

cellular retinoic acid-binding protein 2 6.5 

tyrosine aminotransferase 12.4 

opposite strand to trichominophalangeal 7.6 

ESTs 6.9 

stanniocalcin 2 5,3 

ESTs 6.1 

KIAA1238 protein 7.3 

ESTs, Weakfy similar to A36036 cytochrom 8.2 

ESTs 5.4 

RNB6 6.2 

hypothetical protein FU21Q80 6.9 

BMP-R1B 10.1 

ESTs, Moderately simi/ar to ALU8_HUMAN A 19.7 

collagen, type III. alpha 1 (Ehlers-Oanl 8.4 

inhibin, beta B (activtn AB beta polypep 5.6 

cytochrome P450, subfamily IIB (phenobar 6.2 

GATA-binding protein 3 (T-ceH receptor 6.2 

msh(Drosophila)homeoboxhomolog2 5.8 

solute carrier family 30 (zinc transport 9.9 

GDNF family receptor alpha 1 5.7 

K1AA0904 protein 7.7 

CEGP1 protein 7.3 

HER2 receptor tyrosine kinase (c-erb-b2. 5.4 
Homo sapiens mRNA; cONA DKFZp564N0763 (f 34.1 



ATP-binding cassette transporter MRP8 
llpophiiin B (uteroglobin family member) 
Homo sapiens breast cancer antigen NY-8R 
Homo sapiens breast cancer antigen NY-8R 
ESTs 
ESTs 

gb:nc16b02.s1 NCI_CGAP_Pr1 Homo sapiens 
hexokinase 1 

GDNF family receptor alpha 1 
Homo sapiens cDNA FU11576 fis, done HE 
ESTs 

gbmy97f11.s1 NCLCGAP_GCB1 Homo sapiens 5.9 
Homo sapiens cONA FU 14056 /is. clone HE 8.5 
ESTs 
ESTs 

Transmembrane protease, serine 3 
ESTs 
ESTs 

ESTs, Moderately similar to ALUCLHUMAN 1 



6.7 
13.8 
17.3 
57.6 
10.2 
10.8 
5.2 
5.2 
12.4 
26.3 
6.6 



27.4 
20.7 
10.9 
5.3 
12.9 
5.8 
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315196 
315530 
315634 
316012 
316177 
316580 
317803 
317881 
318740 
318744 
320211 
321107 
321644 
321978 
322035 
322766 
322818 
322975 
323262 
323332 
323817 
324261 
324432 
324598 
324603 
324987 
325544 
330388 
334223 
335809 
335824 



AI367347 Hs.44898 
AW015415 Hs.127780 
AA837085 Hs.220585 
AA764950 Hs.1 19898 
AI904982 Hs.293102 
AA938198 Hs.146123 
AW664964 Hs. 126899 
AI827248 Hs.224398 
NM.002543HS.77729 
A1793124 Hs.144479 
AL0394O2 Hs.125783 
AI732643 Hs.144151 
AW975944 Hs.237396 
N77342 Hs.21851 
AL137517 Hs.334473 
AW068805 Hs.288467 
AW043782 Hs.293616 
C16391 

AL133990 Hs.190642 
AI829520 
AA410943 
BE069341 

AA464510 Hs.152812 
AW972227 Hs.163986 
AW993522 Hs.292934 
AI375572 Hs.172634 

Hs,4B 



A1052047 
R72427 



Homo sapiens done TCCCTA00151 mRNA sequ8.2 
ESTs 8.9 
ESTs 6,3 
ESTs 7 
ESTs, Moderately similar to ALU1_HUMAN A 30.7 
poly(A) polymerase gamma 9,4 
ESTs 6.1 
Homo sapiens cDNA FU1 1469 fis, clone HE 9.6 
oxidised low density lipoprotein (lectin 7.3 
ESTs 17.8 
DEME-6 protein 9.2 
ESTs 12.3 
ESTs 11.7 
Homo sapiens cONA FU12900 fis, clone NT 5 
hypothetical protein DKF2p56401278 19 
Homo sapiens cONA FU12280 fis, clone MA 5.2 
ESTs 7.6 
gb:C16391 Clontech human aorta pofyA mRW 16.5 
ESTs 105 
gbw!19c06.x1 NCLCGAP^Utl Homo sapiens 6.2 
BMP-R1B 8.4 
gb:QV3-BT0381-270100-073-c08 BT0381 Homo 49.4 

165 
5 

10.4 
18.8 
5.7 
6.6 
26.2 
10.1 



ESTs 

Homo sapiens cONA: FLJ22765 fis, clone K 
ESTs 
ESTs 

Phase2&3Exons 

HER2 receptor tyrosine kinase (c-erb*b2, 
NMJ)05080*:Homo sapiens X-box binding pr 
NM_014509*:Homo sapiens kraken-iike (BK1 
ENSP00000249072*:DJ222E13.1 (N-TERMINAL 

ESTs; Weakly similar to CYTOCHROME P450 6.7 

CH22_EMACO055OO.GENSCAN.127 9 5.5 

CH22_FGENES.619 13 9.2 

CH22_FGENES.617 9 8 

CH22_FGENES.271 7 6.5 

CH22J : GENES.619J 7.3 

CH22J : GENES.271_8 12.9 

CH22_FGENES.619_12 8.4 

CH22_EM:AC005500.GENSCAN.304 2 1 1.3 

CH.07_hsgl|6004473 8.5 

CH22J r GENES.617_7 7.1 

CH22JFGENE$.678_5 7.2 

CH22_FGENES.678__5 16.8 
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TABLE 18A 

Table 18A shows the accession numbers for those pkeys lacking unigeneDD's for Table 18. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

Pkey CAT number Accession 



323332 179142J AI829520AJ791832AA228414 AI791823AA229211 AA229315 

20 322975 1510563J C16391 C16413 

324261 273265 J BE069341 AW748403 AL044891 AI908240 AA393080 

323817 233566J AA410943AW948953AA334202AA332882 

311935 174129J AA216387T63548AA228676 

314138 17996QJ AA740616 AA654854 AA229923 

25 335809 CH22J181FG 617 6 UNK_EM 

335824 CH22_3197FG_619 11_L!MK_E 

325544 d2_hs 

334223 CH22J507FGJJMJJJNK-EM 
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TABLE 18B 

Table 18B shows the genomic positioning for those pkeys lacking unigene ID 's and 
5 accession numbers in Table 18. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

1 0 Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al.° refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I, et a!., Nature (1999) 402:489-495. 
Strand: Indicates ONA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref Strand Ntj)osftion 



335809 Dunham, I. etai. Plus 26310772-26310909 

20 335824 Dunham, I. etal. Plus 26376860-26376942 

334223 Dunham, I. etal. Minus 12734365-12734269 

325544 6682452 Plus 171228-171286 
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TABLE 19: 1045 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES 
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Table 19 shows 1045 genes up-regulated in breast cancer compared to normal adult tissues. 
These were selected from 59680 probesets on the Affymetrix/Eos-Hu03 GeneChip array such 
that the ratio of "average" breast cancer to "average" normal adult tissues was greater than or 
equal to 2.5. The "average" breast cancer level was set to the 90 th percentile value. The 
"average" normal adult tissue level was set to the 90* percentile value amongst 144 non- 
malignant tissues. In order to remove gene-specific background levels of non-specific 
hybridization, the 15 th percentile value amongst the 144 non-malignant tissues was subtracted 
from both the numerator and the denominator before the ratio was evaluated. 
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Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal body tissue 



Pkey ExAccn UnigenelD UnigeneTHIe 



408591 
406964 
400291 
407277 
449746 
426878 
400292 
427585 
408045 
407178 
407377 
450705 
407212 
428848 
404561 
407980 
447350 
450375 
422109 
435496 
453160 
420813 
415989 
422505 
424399 
423575 
429441 
431474 
448595 
427217 
402578 
422805 
424634 
456207 
424086 
459587 



AF015224 

M21305 

AA401369 

AW170035 

A1668594 

BE069341 

AA250737 

D31152 

AW138959 

AA195651 

C16391 

U90304 

AA412108 

NMJ00230 

AA046309 

AI375572 

AAD09647 

S73265 

AW840171 

AI263307 

X51501 

AI267700 

AL120862 

A1905687 

C18863 

AJ224172 

AL133990 

AB014544 

AA399272 

AA436989 

NNL003613 

AA193450 

AI351010 

AA031956 



Hs.46452 

Hs.190721 
Hs.326736 
Hs.176588 

Hs.72472 
Hs.179729 
Hs.245123 
Hs.104106 

Hs.25351 

Hs.269350 

Hs.194236 



Hs.172634 

Hs.8850 

Hs.1473 

Hs.265398 

Hs.239884 

Hs.99949 

Hs.317584 

Hs,124165 

Hs.2533 

Hs.163443 

Hs.204096 

Hs.190642 

Hs.21572 

Hs.144341 

Hs.121017 
Hs.151407 



mammaglobin 1 

gb:Human alpha satellite and satellite 3 
ESTs 

Homo sapiens breast cancer antigen NY -BR 
ESTs, Weakly similar to CP4Y_HUMAN CYTOC 
gb:QV3-BT0381-27010M73-c08 BT0381 Homo 
BMP-R1B 

collagen, type X, alpha 1 (Schmid metaph 
ESTs 
ESTs 

gb:C16391 Clontech human aorta poiyA mRN 
Iroquois homeobox protein 5 
ESTs 

leptin (murine obesity homolog) 
trichorhinophalangeal syndrome I (TRPS1) 
gb:zf12f01.s1 SoaresJetal_heartJylbHH19W 
ESTs 

a dlsintegrin and metalloprateinase doma 



ESTs, Weakly similar to transformation 
H2B histone family, member L 
prolacun-induced protein 
ESTs 

programmed cell death 9 (PDCD9) 
aldehyde dehydrogenase 9 family, member 
Homo sapiens cDNA FU11576 lis. done HE 
fipophflin B (uteroglobin family member) 
ESTs 

KIAA0644 gene product 
ESTs 

Cl001134:gq2117372|pirj[l65981 fatty ac 
H2A histone family, member A 
cartHage Intermediate layer protein, nu 
gfczr40e07.fi SoaresJJhHMPu_S1 Homosapi 



Hs.102267 lysyt oxidase 



gb:zkl5e04,s1 Soares _pregnanLuterus_NbH 



R1 

137.6 

71.0 

68.4 

54.2 

46.4 

44.8 

37.4 

323 

31.9 

30.4 

27.7 

24.8 

22.0 

219 

21.8 

19.8 

17,3 

16.6 

16.5 

16.0 

15.8 

15.8 

15.5 

14.8 

14.5 

13.7 

13.6 

13.5 

13.0 

12.8 

12.6 

12.2 

12.0 

11.9 

11.9 

11.5 
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416208 AW291168 Hs.41295 ESTs, Weakly similar to MUC2J-IUMAN MUCIN 1 1.5 

429170 NM.001394 Hs.2359 dual specificity phosphatase 4 11.5 

407276 AI951118 Hs.326738 Homo sapiens breast cancer antigen NY-BR 11.4 

434377 AW137148 Hs.306593 Homo sapiens cONA FU11382 fis, done HE 11.3 

5 448390 AL035414 Hs.21068 hypothetical protein 11.1 

452401 NM.007115 Hs.29352 tumor necrosis factor, alpha-induced pro 11.0 

421037 A1684808 Hs.197653 programmed cell death 9 (PDCD9) 10.9 

452461 N78223 Hs.108106 transcription factor 107 

443348 AW873596 Hs.182278 calmodulin 2 {phosphorytase kinase, delt 10.6 

10 421155 H87879 Hs.102267 lysyl oxidase 10.5 

402606 NM_024626:Homo sapiens hypothefica! prot 10.4 

409269 AA576953 Hs.22972 hypothetical protein RJ 13352 10.4 

447268 AI370413 Hs.36563 hypothetical protein aJ224 18 10.3 

447033 AI357412 Hs.157601 ESTs 10.2 

15 400295 W72838 Hs.2533 aldehyde dehydrogenase 9 family, member 10.1 

424905 NMJW2497 Hs. 1 53704 NIMA (never in mitosis gene a)-relatedk 10.1 

432441 AW292425 Hs.163484 ESTs 9.9 

427365 AI873274 Hs.190721 ESTs 9.9 

438950 H23789 Hs.144530 EST 9.8 

20 422835 BE218705 Hs.121378 mefaDothionein-iike 5, tesfe-spectfic 9.7 

425692 D90041 Hs. 1 55956 N-acetyltransferase 1 (arylamine N-acety 97 

411869 W20027 Hs.23439 ESTs 9.6 

439820 AL360204 Hs.283853 Homo sapiens mRNA full length insert cON 9.6 

445730 AI624342 Hs.170042 ESTs 9.5 

25 459583 AI907673 gb:lL-BT1 52-080399-004 BT1 52 Homosapien 9.3 

452744 AI267652 Hs.30504 Homo sapiens mRNA; cONA DKFZp434E082 (fr 9.2 

432596 AJ224741 Hs.278461 matrilin 3 9.1 

400297 AI127076 Hs.334473 hypothetical protein DKFZp5640 1278 9.1 

449448 D60730 Hs.57471 ESTs 9.1 

30 423945 AA4 10943 gb:zt32h03.r1 Soares ovary tumor NbHOT H 9.1 

406348 Target Exon 9.0 

424735 U31875 Hs.272499 short-chain alcohol dehydrogenase family 9.0 

453392 U23752 Hs.32964 SRY (sex determining region Y)-box 11 9.0 

433365 AF026944 Hs.293797 ESTs 8.8 

35 405654 NA C1200l521:gi|7513934|pir|IT31081 cca3 pr 8.8 

418601 AA279490 Hs.86368 calmegin 8.8 

451110 AI955040 Hs.265398 ESTs, Weakly similar to transformation-r 87 

408771 AW732573 Hs.47584 potassium voltage-gated channel, delayed 8.5 

409041 AB033025 Hs.50081 KIAA1 199 protein 8.4 

40 442353 BE379594 Hs.49136 ESTs, Moderately similar to ALU7 HUMAN A 8.3 

451561 N52812 Hs.177403 ESTs 8.2 

424001 W67883 Hs. 137476 paternally expressed 10 8.2 

429859 NM_O0705O Hs.225952 protein tyrosine phosphatase, receptor t 8.1 

423887 AL060207 Hs.1 34585 DKFZP434G232 protein 8.1 

45 405095 NA Target Exon 8.1 

419296 AA236115 Hs.120785 ESTs 8.0 

447164 AF026941 Hs.17518 Homo sapiens c'tg5 mRNA, partial sequence 8.0 

413472 BE242870 Hs.75379 solute carrier famiry 1 (gHal high affl 8.0 

416747 AW876523 Hs.15929 hypothetical protein RJ 129 10 8.0 

50 415385 R17798 Hs7535 C08W4ike protein 7.9 

434424 AI811202 Hs.325335 Homo sapiens cONA: FU23523 fis, clone L 7.9 

420931 AF044197 Hs.100431 small inducible cytokine B subfamily (Cy 7.9 

406687 M31126 Hs.272620 matrix metalloprotelnase 11 (MMP11;stro 7.8 

400285 NA Eos Control 77 

55 437207 T27503 Hs.15929 hypothetical protein RJ 129 10 7.6 

427119 AW880562 Hs.114574 ESTs 7.5 

429534 AW976987 Hs.163327 ESTs, Weakly similar to 2109260A B cell 7.5 

433426 H69125 Hs.133525 ESTs 7.5 

411078 AI222020 Hs.182364 CocoaCrisp 7.4 

60 426214 H59846 Hs.128355 ESTs, Moderately simflar to ALU7_HUMAN A 7.4 

447475 AJ380797 Hs.158992 ESTs 7.3 

415263 AA948033 Hs.130853 ESTs 7.2 

439569 AW602166 Hs.222399 CEGP1 protein 72 

414142 AW368397 Hs.1 50042 Homo sapiens cONA FU14438 fis, done HE 7.1 

65 426261 AW242243 Hs.1 66670 peroxisomal famesylaled protein 7.0 

444783 AK001468 Hs.62180 anln (Drosophila Scraps homolog), act 6.9 

445885 AI734009 Hs. 1 27699 KIAA1 603 protein 6.9 
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429432 A1678059 Hs.202676 synaptonema! complex protein 2 6.9 
410781 AI375672 Hs.165028 ESTs 69 
443788 A1732643 Hs.144151 ESTs 69 
421373 AA808229 Hs.167771 ESTs 68 
451398 AI793124 Hs.144479 ESTs 6.8 
4 °4253 NJVL021058*-JHomo sapiens H2B histone fami 6.8 

441098 AI015591 Hs. 13 1004 ESTs, Weakly similar to T1 7227 hypothetl 6.7 
426215 AW963419 Hs.155223 stanniocaJcin 2 6 6 

428227 AA321649 Hs.2248 small inducible cytokine subfamily B (CX 6,6 
422956 BE545072 Hs.122579 hypothetical protein FU 10461 6,6 
411111 AW818127 gb:CM1-ST0277-061299-059-«)7ST0277 Homo 6.6 

4349B8 AI418055 Hs.161160 ESTs 66 
442580 AI733682 Hs.130239 ESTs 66 
449611 AI970394 Hs.197075 ESTs 6.6 

408000 L11690 Hs.620 bullous pemphigoid antigen 1 (2307240kD) 6.5 

420757 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 6.5 

431089 BE041395 Hs.283676 ESTs, Weakly similar to unknown protein 6.5 

400301 X03635 Hs.1657 estrogen receptor 1 ' 65 

427356 AW023482 Hs.97849 ESTs 6.5 

425704 U79293 Hs.159264 Human done 23948 mRNA sequence 6.4 

441134 W29092 Hs.7678 cellular retinoic acid-binding protein 1 6.4 

424902 NMJ03866 Hs. 153687 inositol polyphosphate-4-phosphatase.ty 6.4 

448693 AW004854 Hs.228320 hypothetical protein FU23537 6.4 

431448 AL1 37517 Hs.334473 hypothetical protein DKFZp5640l 278 6.2 

444342 NMJM398 Hs.10887 similar to tysosome-associated membrane 6.1 

422168 AA586894 Hs.1 12408 S1 00 <^um-binding protein A7 (psorias 6.1 

453331 AI240665 Hs.8895 ESTs 6.1 

418007 M13509 Hs.83169 mafrixmetalfoprotelnase 1 (M/WP1; inters 6.0 

441233 AA972965 Hs.135568 ESTs 60 

418092 R45154 Hs.106604 ESTs 60 

430044 AA464510 Hs.152812 ESTs 59 

432837 AA310693 Hs.87329 HSPC072 protein 59 

433285 AW975944 Hs.237396 ESTs 5.9 

450701 H39960 Hs.288467 Homo sapiens cDNA FU1 2280 fis. done MA 5.9 

425707 AF115402 Hs.11713 E74-like factor 5 (ets domain transcript 5.9 
410785 AW803341 gb:iaAJM0079-090300-050-D03 UM0079 Homo 5.9 

425398 AL049689 Hs.156369 hypothetical protein similar to tenascfo 5.9 

414812 X72755 Hs.77367 monokine induced by gamma interferon 5.8 

459371 R20991 gb:yg06hQ1.r1 Soares infant brain 1N1B H 5.8 

411284 N28519 Hs,135191 ESTs, Weakly similar to unnamed protein 5.8 

453511 AL031224 Hs.33102 transcription factor AP-2 beta (activati 5.8 

451807 W52854 Hs.27099 hypothetical protein FU23293 similar to 5.7 

430510 AW162916 Hs.241576 hypothetical protein PR02577 5.7 

415539 AI733881 Hs.72472 BMP-RIB 56 

438199 AW016531 Hs.122147 ESTs 5.6 

417866 AW067903 Hs.82772 collagen, type XI, alpha 1 5.5 

430019 AA463893 Hs.220933 ESTs 5 5 

439809 R41396 Hs.101774 hypothetical protein FU23045 5.5 

423811 AW299598 Hs.50895 homeoboxC4 5.4 

434539 AW748078 Hs.214410 ESTs, Weakly similar to MUC2JHUMAN MUCIN 5.4 

439138 AI742605 Hs.193696 ESTs 5 4 

453931 AL121278 Hs.25144 ESTs 54 

444078 BE246919 Hs.10290 U5 snRNP-spetific 40 kDa protein <hPrp8- 5,4 

447102 BE167434 Hs.98471 ESTs, Weakly similar to T1 8712 hypotheti 5.4 

451621 AI879148 Hs.26770 tatty add binding protein 7, brain 5.4 

425236 AW067800 Hs.155223 stannlocaJdn 2 5.3 

421464 AA291553 Hs.190086 ESTs 5.3 
450736 AW970060 gb:EST382140 MAGE resequences, MAGK Homo5.3 

428085 AA421081 Hs.12388 ESTs 5.3 

452838 1)65011 Hs.30743 preferentially expressed antigen In rneta 5.3 

445424 AB028945 Hs.12696 cortactin SH3 domain-binding protein 5.3 

456938 X52509 Hs.161640 tyrosine aminotransferase 5.3 

422867 L32137 Hs.1584 cartilage oligomeric matrix protein {COM 5.2 

438167 R28363 Hs.24286 ESTs 52 

433330 AW207084 Hs.132816 hypothetical protein MGC 14801 52 

449765 N92293 Hs.206832 ESTs, Moderately similar to ALU8 HUMAN A 5.2 

416276 U41060 Hs.79136 UV-1 protein, estrogen regulated 52 
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400300 X03363 HER2 receptor tyrosine kinase (oerb-b2, 5.2 

418004 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member 5.2 

439840 AW449211 Hs.105445 GDNF family receptor alpha 1 5.2 

428771 AB028992 Hs.193143 KIAA1069 protein 5.2 
455047 AW852530 gb:PM1-CT0243-071099-001-g06 CT0243 Homo 5.2 

419169 AW851980 Hs.262346 ESTs, Weakly similar to S72482 hypolheti 5.2 

453197 AI916269 Hs. 109057 ESTs, Weakly similar to ALU5JWMAN ALUS 5.1 

400298 AA032279 Hs.61635 six transmembrane epithelial antfgen of 5.1 

431023 AI283133 Hs.297420 ESTs 5.1 

427666 AI791495 Hs.180142 calrnodulln-like skin protein 5.1 

427718 AI798680 Hs.25933 ESTs 5.1 

434531 AA642007 Hs.116369 ESTs 5.1 

429220 AW207206 Hs.136319 ESTs 5.1 

405494 NA C2001837*:gi|1 2697903|dbj|BAB21 770.1 1 (A 5.1 

452930 AW195285 Hs.194097 ESTs, Weakly similar to 138022 hypotheti 5.1 

444910 AI201849 gb:qs76g04.x1 NCI_CGAP.Pr28 Homo sapiens 5.1 

453310 X70697 Hs.553 solute carrier family 6 (neurotransmitte 5.0 

444381 BE387335 Hs.283713 ESTs, Weakly similar to S64054 hypotheti 5.0 

450603 R43646 Hs.12422 ESTs 5.0 

416575 W02414 Hs.38383 ESTs 5.0 

43B504 AW665281 Hs.224625 ESTs 5.0 

416209 AA236776 Hs.79078 MAD2 (mitotic arrest deficient yeast h 5.0 

428804 AK000713 Hs.193736 hypothetical protein FU20706 5.0 

420077 AW512260 Hs.87767 ESTs 4.9 

450480 X82125 Hs.25040 zinc finger protein 239 4.9 

437637 AJ003029 Hs.65792 syntrophin, gamma 2 4.9 

431808 M30703 Hs.270833 amphiregulin (schwannoma-derived growth 4.9 

418836 AI655499 Hs.161712 ESTs 4.8 

442441 AI820662 Hs.1 29598 ESTs 4.8 

435635 AF220050 Hs.1 81385 uncharacterized hematopoietic stem/proge 4.8 

400286 NA C16000922:gi|7499103lpiri|T20903 hypothe 4.8 

407506 U71600 gb:Human zinc finger protein zfp3l (zf31 4.8 

420026 AI831190 Hs.166676 ESTs 4.8 

441377 BE218239 Hs.202656 ESTs 4.8 

457726 AI217477 Hs.194591 ESTs 4.8 

412785 AW997556 Hs.78521 KIAA171 7 protein 4.8 

428368 BE440042 Hs.83326 matrix metanoproteinase 3 {stromelysin 4.7 

436026. AI349764 Hs.217081 ESTs 4.7 

409110 AA191493 Hs.48778 niban protein 4.7 

400284 NA estrogen receptor 1 4.7 

410102 AW248508 Hs.279727 Homo sapiens cONAFU 14035 fis, clone HE 4.7 

407819 R42185 Hs.274803 ESTs 4.7 

430486 BE062109 Hs.241551 chloride channel, calcium activated, fam 4.7 

422896 AW961489 Hs.154116 ESTs 4.7 

453616 NMJXJ3462 Hs.33846 dyneln, axonemal light intermediate pol 47 

427427 AF077345 Hs.177936 ESTs 4.6 

421751 AW813731 Hs.159153 ESTs, Moderately similar to S65657 alpha 4.6 

454074 R63503 Hs.28419 ESTs 4.6 

405718 C4000799 # :gi|6330365|dbflBAA86508.1| (AB 4.6 

444649 AW207523 Hs.197628 ESTs 4.6 

429431 240313 Hs. 106330 Homo sapiens done IMAGE .23371 , mRNAseq 4.6 

427611 M81057 Hs. 1 80884 carboxypeptldase B1 (tissue) 4.6 

447342 AI199268 Hs.19322 Homo sapiens, Similar to RIKEN cDNA 2010 4.6 

430345 AK000282 Hs.239681 hypothetical protein aJ20275 4.6 
454307 AW855717 gb^C1-CT0279-081299-O134)01 CT0279 Homo 4.6 

400303 AA242758 Hs.79136 UV-1 protein, estrogen regulated 4.6 

438180 AA808189 Hs.272151 ESTs 4.6 
451340 AW936273 gb:QV(W)T0020-O90200-107-g07OT0020 Homo 4.6 

458711 AL036877 Hs.282878 ESTs 4.6 

457430 AA514660 Hs.128443 ESTs 4.6 

416030 H15261 Hs.21948 ESTs 4.6 

447233 AW246333 Hs.17901 Homo sapiens, clone IMAGE:39370 15, mRNA. 4.6 

445537 AJ245671 Hs.12844 EGF-like-domain. multiple 6 (EGFL6) 4.5 

424590 AW966399 Hs.46821 hypothetical protein FU 20086 4.5 

432374 W68815 Hs.301 885 Homo sapiens cDNA FU1 1346 fis, clone PL 4.5 

423833 AW503329 gb:UI-HF-BN0-akx-e-02-0-Ul.r1 NIH J/IGC.50 4.5 

406747 AI925153 Hs.217493 annextnA2 4.5 

286 



WO 02/059377 



PCT/USO2/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



412102 
431716 
411050 
401418 
436194 
436211 
414080 
424115 
415786 
442117 
452784 
432731 
410534 
405196 
430217 
401793 
415747 
423679 
400238 
425627 
400608 
458634 
407771 
405906 
405925 
439382 
445263 
407162 
419536 
454359 
411558 
450715 
421451 
452864 
409757 
413043 
413499 
444619 
408380 
406992 
404285 
425247 
428046 
446163 
421147 
426451 
415227 
452176 
452862 
443646 
425523 
424687 
430009 
434469 
451381 
450229 
455700 
431924 
438885 
401451 
431676 
409092 
429270 
443903 
427122 
410275 
432912 



H56435 
D89053 
AW814902 
NA 

AK001074 

AK001581 

AA135257 

AA335497 

AW419196 

AW664964 

BE463857 

R31178 

AW905138 

NA 

N47863 

AA381209 
AB007975 
NA 

AF019612 

AV657310 
AL138272 
NA 
NA 

BE247684 

H57646 

N63855 

AA603305 

N71277 

AA102670 

A1266484 

AA291377 

AA033714 

NM.001898 

BE158766 

BE144884 

BE538082 

AF123050 

S82472 

NA 

NN1005940 

AW812795 

AA026880 

AW592167 

AI908165 

AW821113 

AA024538 

AW378065 

AK385198 

AB007948 

J05070 

AA894564 

AA634806 

BE241831 

R18717 

8E068115 

AKOOOB50 

AI886558 

A1685464 

AI735283 

W60379 

AC20547 

AW057736 

U85658 

BE007371 



gb:yq98e09.rt Soares fetal Over spleen 4.5 

Hs.268012 fatty-acid-Coanzyme A Ggase, long-chain 4.5 
gb:MR1-ST020oM204G0-022-f03ST0205Homo 4.5 

C140O0338*:gij74595O2!p!r|lS74665 outer 4.5 

Hs.333435 Homo sapiens cONA FU10212 fe, clone HE 4.5 

Hs.334828 hypothetical protehFU 10719; KIAA1794 4.4 

Hs.47783 B aggressive lymphoma gene 4.4 

Hs.293965 ESTs, Weakly similar to I38022 hypothefi 4.4 

Hs^57924 hypothetical protein FU 13782 4.4 

Hs.128899 ESTs 4.4 

Hs.151258 hypothetical protein FU21062 4.4 

Hs.287820 fibronectin 1 4.4 
gb:QVO-NN1071-280400-207-g07 NN1071 Homo4.4 

C2000662*:gi]7512792lpir||T124B2 hypothe 4.4 

Hs.336901 ribosomal protein S24 4.4 

C17001545:gi|5360127]gb|AAD42882.1|AF155 4.4 

gb:EST94257 Activated T-celb I Homo sap 4.4 

Hs.131454 KIAA0506 protein 4.4 

C19000274*:gij12741327Jref)XP - 008833.2j 4.4 

Hs.297007 membrane-bound transcription factor prot 4.4 

C10001899giI7508633|pir||T25392 hypothe 4.4 

Hs282m ESTs 4.3 

Hs.62713 ESTs 4.3 

Taiget Exon 4.3 

Target Exon 4.3 

Hs.103070 ESTs 43 

Hs.42586 WAA1560 protein 4.3 

Hs.142634 zinc finger protein 4.3 

gb:np1 2d1 1 .s1 NCLCGAP_Pr3 Homo sapiens 4.3 

gbza36e03.s1 Soares fetal liver spleen 4.3 

Hs.70725 gamma-aminobutyricacid(GA8A)Arecepto 4.2 

Hs.31570 ESTs, Weakly similar to K1AA1324 protein 4.2 

Hs.50831 ESTs 4.2 

Hs.287629 hypothetical protein FU1 4260 A2 

Hs.123114 cystatinSN 4.2 

gb:IL2-HT0397471299-024-F02HT0397Homo 42 
gb:CMO-HT0182-041099-065-e11 HT0182 Homo4.2 

Hs.8172 ESTs, Moderately similar to A46010 X-lin 4.2 

Hs.44532 diubiquitin 4.2 

gb:beta -pol=DNA polymerase beta {exon a 4.2 

C6001909;giI704441 jdbJ}QAA18909.1| (D298 4.2 

Hs.155324 matrix metalloproteinase 11 (MMP11;stro 4.2 

Hs.155381 ESTs. Moderately similar to I38022 hypot 42 

Hs.25252 prolactin receptor 4.2 

Hs.293299 ESTs 4.2 

Hs.169946 GATA-binding protein 3 (T-ceil receptor 4.2 

Hs.72402 ESTs 42 

Hs.282990 Human DNA sequence from clone RP1-28H20 4.2 

Hs.8687 ESTs 4.2 

Hs.164226 ESTs 4.2 

Hs.158244 KIAA0479 protein 4.1 

Hs.151738 matrix metalloproteinase 9 (gelatinaseB 4.1 

Hs.22242 ESTs 4.1 

gbab28c02.r1 Stratagene lung (937210) H 4.1 

Hs.172330 hypothetical protein MGC2705 4.1 

Hs.8929 hypothetical protein FU11362 4.1 
gb:CM1-BT0368-061299-06(^g07 BT0368 Homo4.1 

Hs.272203 Homo sapiens cDNA FU20843 fis, clone AD 4.1 

Hs.184987 ESTs 4.1 

NM_004496*:Homo sapiens hepatocyte nude 4.1 

gb.H88f04.x1 NCI CGAP_Pr28 Homo sapiens 4.1 

Hs.172608 ESTs 4.1 

Hs.57773 ESTs 4,1 

Hs.135223 ESTs - 4.1 

Hs.323910 HER2 receptor tyrosine kinase (c-erb-b2, 4.1 

Hs.61796 transcription factor AP-2 gamma (activat 4.1 

Hs.200313 ESTs ' 4.1 
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403585 Target Exon 4.1 

438295 A1394151 Hs.37932 ESTs 4.1 

420380 AA640891 Hs.102406 ESTs 4.1 

431118 BE264901 Hs.250502 carbonic anhydrase VIII 4.1 

416182 NM_004354 Hs.79069 cydinG2 4.1 

418994 AA296520 Hs.89546 selectin E (endolhefial adhesion molecul 4.1 

400555 Target Exon 4.1 

410079 U94362 Hs.58589 glycogenin2 4.0 

427674 NNL003528 Hs.2178 H29 histone family, member Q 4.0 

427131 AA448460 Hs.112017 GE36gene 4.0 

439759 AL359055 Hs.67709 Homo sapiens mRNA full length Insert cON 4.0 

429353 AL1 17406 Hs.200102 ATP-binding cassette transporter MRP8 4.0 

421296 NrVL002666 Hs.103253 perilipin 4.0 

418819 AA228776 Hs.191721 ESTs 4.0 

424188 - AW954552 Hs.142634 zinc finger protein 4.0 
455431 AW938484 gb^M0-OT0057-290200-253-<106 DT0057 Homo4.0 

404142 NA Target Exon 4.0 

441143 AI027604 Hs.159650 ESTs 4.0 

444540 AI693927 Hs.265165 ESTs 4.0 

415579 AA165232 Hs.222069 ESTs 4,0 

452891 N75582 Hs.212875 ESTs, Weakly similar to DYH9_HUMAN CIUA 4.0 
414605 BE390440 gb:601283601F1 NIH_MGC_44 Homo sapiens c 4.0 

452281 T93500 Hs.28792 Homo sapiens cDNA FLJ 11041 fls, clone PL 4.0 

417801 AA417383 Hs.82582 integrin, beta-like 1 (with EGF-like rep 4.0 

446232 AI281848 Hs.194691 retinolc acid Induced 3 4.0 

447377 X77343 Hs.334334 transcription factor AP-2 alpha (activat 4.0 

437854 AL119723 gb:OKFZp761A2124_r1 761 (synonym: hamy2) 4.0 

446140 AA356170 Hs.26750 hypothetical protein FU21 908 4.0 

452240 AI591147 Hs.61232 ESTs 4.0 

459574 AI741122 Hs. 101 810 Homo sapiens cDNAFU 14232 fis, clone NT 4.0 

458673 N99626 gb:za39d1U1 Soares fetal Cver spleen 4.0 

444858 AI199738 Hs.208275 ESTs, Weakly similar to ALUAJHUMAN llll 4.0 

452166 AI948607 Hs.264680 ESTs 4.0 

452681 AF153330 Hs.30246 solute carrier famfly 19 (thiamine trans 3.9 

450192 AA263143 Hs.24596 RAD51 -interacting protein 3.9 

406554 NA Ta/getExon 3.9 

416259 AA573006 Hs.19173 ESTs 3.9 

445813 Z42023 Hs.106576 alanine-gryoxylate aminotransferase 2-D 3.9 

451024 AA442176 gb:zw63b08.r1 SoaresJatalJetus__Nb2HF8_ 3.9 

413930 M86153 Hs.75618 RAB11A, member RAS oncogene family 3.9 

401781 Target Exon 3.9 

415296 F05086 Hs.328142 ESTs 3.9 

452564 AA026777 gb:ze93c11.r1 Soares JetalhearLNbHH19W 3.9 

442500 AI819068 Hs.209122 ESTs 3.9 

419759 Z21336 Hs.135411 actin related protein 3.9 

424638 AI472106 Hs.49303 Homo sapiens cDNA FLJ1 1663 fis, clone HE 3.9 

439699 AF086534 Hs.187561 ESTs, Moderately similar to ALU1_HUMAN A 3.9 

428042 AA419529 Hs.76391 myxovirus (influenza) resistance 1, homo 3.9 

452501 AB037791 Hs.29716 hypothetical protein FU10980 3.9 

453049 BE537217 Hs.30343 ESTs 3.9 

443213 BE568414 Hs.145497 Homo sapiens cDNA: FU22097 fis, clone H 3.9 

443489 AI073512 Hs.133916 ESTs 3.9 
455092 BE152428 gb:CMQ-HT0323-1 51 299-1 26-b04 HT0323 Homo3.9 

401 785 NM_002275*:Homo sapiens keratin 1 5 (KRT1 3.9 

426427 M86699 Hs.1 69840 TTK protein kinase 3.9 

446009 AI989885 Hs.231926 ESTs 3.9 

436033 H75391 Hs.255748 ESTs 3.9 
451067 BE172186 gb:MR0-HT0559-110300-005-h1 1 HT0559 Homo3.8 

419348 AA236645 Hs.98274 ESTs 3.8 

444635 AI184268 Hs.339665 ESTs 3.8 

412140 AA219691 Hs.73625 RAB6 interacting, WnesirMike (rabkjnes 3.8 

403593 NA Target Exon 3.8 

442323 AW016669 Hs.29190 ESTs 3.8 

419854 AW664873 Hs.87836 Homo sapiens PAC done RP5-1087M19 from 3.8 

433871 W02410 Hs.205555 ESTs 3.8 

445253 AI217928 Hs.144762 ESTs 3.8 

409542 AA503020 Hs.36563 hypothetical protein FU2241 8 3.8 
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443162 T49951 Hs.9029 DKFZP434G032 protein 3.8 

458194 AW383618 Hs.265459 ESTs, Moderately similar to ALU2_HUMAN A 3.8 

422475 AL359938 Hs.1 17313 Meis (mouse) homotog 3 3.8 

440705 AA904244 Hs.153205 ESTs 3.8 

447290 AI476732 Hs.263912 ESTs 3.8 

403426 Target Exon 3.8 

427821 AA470158 Hs.98202 ESTs 3.8 

454288 BE222648 Hs.279458 ESTs, Highly similar to c33QA1.1b (H.sap 3.8 

443801 AW206942 Hs.253594 ESTs 3.8 

410658 AW105231 Hs.192035 ESTs 3.8 
410672 AW794600 gfcRC64JMM14-17030O-O22-C05 UM0014 Homo3.8 

428579 NM_005756 Hs.184942 6 protein-coupled receptor 64 3.8 

445495 BE622641 Hs.38489 ESTs, Weakly similar to I38022 hypotheti 3.8 

447995 AI742618 Hs.181733 ESTs, Weakly similar to nitrilase homolo 3.7 

401747 Homo sapiens keratin 17 (KRT17) 3.7 

420633 NM_014581 Hs.274480 odoranUiinding protein 2A 3.7 

423545 APOO0692 Hs.1 29781 chromosome 21 open reading frame 5 3.7 

' 433138 AB029496 Hs.59729 semaphorin sem2 3.7 

434715 BE005346 Hs.116410 ESTs 3.7 

428664 AK001666 Hs.189095 simBartoSAU.1 (sal (DrosoprulaHike 3.7 

450951 AA018534 Hs.103334 ESTs 3.7 
402696 NA C3CK)2523:gI|6686211lsp|Q27533|YH2M_CAEEL3.7 

446868 AV660737 Hs.135100 ESTs 3.7 

458154 AW816379 Hs.335018 ESTs . 3.7 

422026 U80736 Hs.1 10826 trinucleotide repeat containing 9 3.7 

419440 AB020689 Hs.90419 KIAA0882 protein 3.7 

421524 AA312082 Hs.105445 GDNF family receptor alpha 1 3.7 

417283 N62840 Hs,48648 ESTs 3.7 

401508 NA NrVL024817:Homo sapiens hypothetical prat 3.7 

410303 AA324597 Hs.21851 Homo sapiens cONA FU12900 fis, done NT 3.7 

420362 U79734 Hs.97206 huntingtin interacting protein 1 3.7 

433384 AI021992 Hs.124244 ESTs 3.7 

434302 AA629065 Hs.1 16301 ESTs 3.7 

443938 R55373 Hs.20864 ESTs 3.7 
448420 BE623004 gb:601441282F1 NIH_MGC_72 Homo sapiens c 3.7 

458712 AI347502 Hs,107872 hypothetical protein FU20761 3.7 

433404 T32982 Hs.102720 ESTs 3.7 

405232 NMJ)15832:Homo sapiens methyl-CpG bindin 3.7 

430491 AL109791 Hs.241559 Homo sapiens mRNA full length insert cDN 3.7 
455609 BE011668 gb:CM3-BK0223-100500-177-a04BN0223Homo3.7 

450164 AI239923 Hs.30098 ESTs 3.7 

453948 AI970797 Hs.64859 ESTs 3.7 

436061 AI248584 Hs.190745 Homo sapiens cONA: FU21326 fis, clone C 3.7 

401049 NA Target Exon 3.6 

418867 031771 Hs.89404 msh (Drosophfla) homeo box homotog 2 3.6 

420179 N74530 Hs.21168 ESTs 3.6 

458663 AV658444 Hs.280776 tankyrase, TRFI-interading ankyrin-rela 3.6 

437259 A1377755 Hs.120695 ESTs 3.6 

428309 M97815 Hs. 183650 cellular retinoic acid-binding protein 2 3.6 
450522 A1698839 gb:wd31K)2jc1 Soares_.NFUJ_GBC_S1 Homo s 3.6 

451952 AL120173 Hs.301663 ESTs 3.6 
412209 AW901456 gb:RC0-NN 101 2-270300-031 -c07 NN1012 Homo3.6 

425201 AA352111 gb£ST6O061 Activated T-ceb XX Homo sa 3.6 

443830 AI142095 Hs.143273 ESTs 3.6 
439255 BE164500 gb:RC4-HT0469-23030M14-e10 HT0469 Homo 3.6 

414869 AA157291 Hs.21479 ubinudehl 3.6 

409064 AA062954 Hs. 141 883 ESTs 3.6 

407721 Y12735 Hs.38018 dual-specificity tyrosine^)-phosphoryl 3.6 

445135 AK000054 Hs.12347 hypothetical protein FU20047 3.6 

404091 NA Target Exon 3.6 

409731 AA125985 Hs.56145 thymosin, beta, identified in neuroblast 3.6 

405153 Target Exon 3.6 

423248 AA380177 Hs.125845 ribulose-5-phosphate-3-epimerase 3.6 

403639 NA ENSP00000233023*:CONA aJ1 2662 fis, clon 3.6 

404360 C7001385:gf|12082809|gb|AAG48618.1|AF315 3.6 

422352 AA766296 Hs.99200 ESTs 3.6 

423338 AB007961 Hs.127338 K1AA0492 protein 3.6 

289 



WO 02/059377 



424202 BE350295 Hs.15032 RAN binding protein 17 3.6 

431750 AA514986 Hs.283705 ESTs 3.6 

439907 AA853978 Hs.124577 ESTs 3.6 

453596 AA441838 Hs.62905 hypothetical protein RJ 14834 3.6 

406446 NA Target Exon 3.6 

418454 AA315308 Hs.195870 hypothetical protein FU14991 3.6 

434360 AW015415 Hs.127780 ESTs 3.6 

409079 W87707 Hs.82065 interieukin 6 signal transducer (gp130, 3.6 

440132 AI697121 Hs.202466 ESTs, Weakly similar to S65824 reverse t 3.6 

448706 AW291095 Hs.21814 Interieukin 20 receptor, alpha 3.6 

440671 AW297920 Hs.130054 ESTs 3.5 
407647 AW860158 gb:RCO-CT0379-29010M32-b04 CT0379 Homo 3.5 

459023 AW96B226 Hs.60798 ESTs 3.5 

402820 NM 017646*:Homo sapiens tRNA isopentenyl 3.5 

417009 AA191719 Hs.314714 ESfs 3.5 

447048 AW393080 Hs.228320 hypothetical protein FU23537 3.5 

449978 A1806335 Hs.200829 ESTs, Weakly similar to T30171 ninein - 3.5 

428062 AA420683 Hs.98321 hypothetical protein FU14103 3.5 

452909 NrVL015368 Hs.30985 pannexin 1 3.5 

400610 NA Target Exon 3.5 

417843 W07361 Hs.22545 Homo sapiens cDNA RJ12935 fis, clone NT 3.5 

419335 AW960146 Hs.284137 hypothetical rwtein FU 12888 3.5 

451592 AI805416 Hs.21'3897 ESTs 3.5 

443270 NMJJ04272 Hs.337737 Homer, neuronalimmediate early gene, 1B 3.5 

423948 AW392342 Hs.283077 centrosomal P4.1 associated protein; unc 3.5 

449424 AW448937 Hs.197030 ESTs 3.5 
423841 AW753967 gb:RC2-CT0304^)8010(W)114i12CT0304Homo3.5 

416806 NMJ00288 Hs.79993 peroxisomal biogenesis factor 7 3.5 

422060 R20893 Hs.325823 ESTs, Moderately similar to AUJ5..HUMAN A 3.5 

433104 AL043002 Hs.128246 ESTs, Moderately similar to unnamed prot 3.5 

415778 H84847 Hs.49391 hypothetical protein LOC541 49 3.5 

413054 AW316843 Hs.66309 hypothetical protein MGC1 1061 3.5 

416636 N32536 Hs.42645 solute carrier family 16 (monocarboxyTrc 3.5 ■ 

424639 AI917494 Hs.9812 Homo sapiens cDNA FU14388 fis, done HE 3.5 

424827 AI057094 Hs.96867 Homo sapiens cDNA: FU23 155 fis, clone L 3.5 - 

437782 A1370876 Hs.79090 exportin 1 (CRM1, yeast, homotog) 3.5 

411514 AW850178 gb:IL3<;T021 9-27 1099-022-H 12 CT0219 Homo 3.5 

413783 AA314337 Hs.301547 ribosomal protein S7 3.5 

421106 AA877124 Hs.172844 ESTs 3.5 

431291 N25521 Hs.25275 Kruppel-type zinc finger protein 3.5 

440623 AI935016 Hs.216639 ESTs 3.5 
455838 BE145808 gb:MR0-HT0208-1 01 299-1 03-fl 1 HT0208 Homo 3.5 

458771 AW295151 Hs.163612 ESTs 3.5 

442942 AW167087 Hs.131562 ESTs 3.5 

436550 Z50158 Hs.270235 ESTs, Weakly similar to MMHUB1 larrrinln b 3.5 

418849 AW474547 Hs.53565 Homo sapiens P1G-M mRNA for mannosyltran 3.5 

424420 BE614743 Hs.146688 prostaglandin E synthase 3.5 

430916 AW505021 Hs.88414 BTB and CNC homology 1, basic leucine zi 3.5 

432030 AI908400 Hs.143789 ESTs 3.5 

439405 AF086224 Hs.55238 ESTs 3.5 

405917 NA C17000675:gij7290703lgbJAAF46150.1|(AEO 3.5 

452727 AW993582 Hs.176220 ESTs 3.5 

426320 W47595 Hs.169300 transforming growth factor, beta 2 3.4 

421070 AA283185 Hs.19327 ESTs 3.4 

424625 AW904466 Hs.321197 PDZ domain protein (DrosophBa inaWike 3.4 

428508 BE252383 Hs.184668 SBB131 protein 3.4 
455651 BE064962 gb:RC1-BT031 3-1 30400-0 16-C02 8T0313 Homo 3.4 

410555 U92649 Hs.64311 a dislntegrin and melalloproteinase doma 3.4 

447754 AW073310 Hs.163533 Homo sapiens cDNA FU14142 fis, clone MA 3.4 
418636 AW749855 gb:QV4-BT0534-281299453<Q5 BT0534 Homo 3.4 

404097 NA C5000242 # :gi|9369379lgblAAF87128.1JAC006 3.4 

434205 AF119861 Hs.283032 hypothetical protein PRO201 5 3.4 

421072 AI215069 Us.89113 ESTs 3.4 

402421 NA C100l578*:gij6759903lgb|AAF28099.1}(AF1 3.4 

405248 NA Target Exon 3.4 

407638 AJ404672 Hs.334483 hypothetical protein FU23571 3.4 

403000 BE247275 Hs.151787 U5 snRNP-spedffc protein, 116 kD 3.4 
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433393 AF038564 Hs.98074 itchy (mouse homolog) E3 ubiquitin prote 3.4 

432239 X81334 Hs.2936 matrix metatloproteinase 13 (ccliagenase 3.4 

458747 BE618395 Hs.25739l hypothetical protein DKFZp761J1 523 3.4 

442082 R41823 Hs.7413 ESTs; calsyntenin-2 3.4 
417974 AA210765 gb2J90cO6.r1 NCLCGAPJ3CB1 Homo sapiens 3.4 

446002 A1346468 Hs.145789 ESTs 3.4 

448995 A1613276 Hs.5662 guanine nucleotide binding protein (G pr 3.4 

438007 A1247718 Hs.232168 ESTs 3.4 

424698 AA164366 Hs.151973 hypothetical protein RJ23511 3.4 

435202 AI971313 Hs.170204 K1AA0551 protein 3.4 

410467 AF102546 Hs.63931 dachshund (DrosophHa) homolog 33 

405460 NA Target Exon 3.3 

441826 AW503603 Hs.129915 phosphotriesterase related 3.3 

453472 AL037925 gb:DKFZp564M037_rl 564 (synonym: hfbr2) 3.3 

447078 AW885727 Hs.301570 ESTs 33 

441690 R81733 Hs.33106 ESTs 33 

420092 AA814043 Hs.88045 ESTs 3.3 

418478 U38945 Hs.1174 cydirHiependent kinase inhfoitor 2A {me 33 

408908 BE296227 Hs.250822 serine/threonine kinase 15 3.3 

414737 AI160386 Hs.125087 ESTs 33 

449650 AF055575 Hs.23838 calcium channel, voltage-dependent L ty 33 

418912 NNL000685 Hs.89472 angiotensin receptor 1 33 

436405 AA160079 Hs.172932 Homo sapiens mRNA for partial yiJTR, seq 3.3 

453911 AW503857 Hs.4007 Sarcolemmat-assoclated protein 3.3 

409361 NM_005982 Hs.54416 sine oculis homeobox (Drosophila) homolo 3.3 

429548 AW138872 Hs.135288 ESTs 3.3 

420807 AA280627 Hs.57846 ESTs 33 

409695 AA296961 gb:EST1 12514 Adrenal gland tumor Homo sa 33 

445189 AI936450 Hs.147482 ESTs 3.3 

402892 NA Target Exon 3.3 

426681 AA994896 Hs.22514 ESTs 33 

458722 AA741545 Hs.282832 ESTs, Weakly similar to T24961 hypotheti 3.3 

409430 R21945 Hs. 166975 splicing factor, arginine/serine-rich 5 3.3 

443194 AI954968 Hs.279009 matrix Gla protein 3.3 
445432 AV653771 gb:AV653771 GLC Homo sapiens cDNA done 33 

410908 AA121666 Hs.10592 ESTs 33 

406151 NA Target Exon 33 

436461 AW511956 Hs.293261 ESTs 33 
411171 AW820260 gb:QV2-ST0296-150200-040-c10ST0296 Homo 33 

432415 T16971 Hs^89014 ESTs, Weakly similar to A43932 mucin 2 p 3.3 

439310 AF086120 Hs.102793 ESTs 33 

401575 NA Target Exon 3.3 

420900 AL045633 Hs.44269 ESTs 3.3 

445628 AI344166 Hs.155743 ESTs 3.3 

448243 AW369771 Hs.52620 integrin, beta 8 33 

445102 AW204610 Hs.22270 ESTs 3.3 

442118 AA976718 Hs.202242 ESTs 33 

416783 AA206186 HsJ9889 monocyte to macrophage differentiation-a 33 

435039 AW043921 Hs.130526 ESTs 3.3 

451474 T70874 Hs.207636 ESTs 3.2 

442559 T10213 Hs.159993 gycosyltransferase 3.2 

453921 A1824009 Hs.44577 ESTs 3,2 

420036 R60336 Hs.52792 Homo sapiens mRNA; cDNA DKFZp586H 823 (f 3.2 

435627 W88774 Hs.1 18370 ESTs 3.2 

411598 BE336654 Hs.70937 H3 histone family, member A 3.2 

446733 AA863360 Hs.26040 ESTs, Weakly similar to fatty acid omega 3.2 

410153 BE311926 Hs.15830 hypothetical protein FU 12691 3.2 

403637 NA C3001106*:gi|10047201|dbj|BAB13394.1j{A 3.2 

405547 NM_018833*:Homo sapiens transporter 2, A 3.2 

427878 C05766 Hs.181022 CG1-07 protein 3.2 

451871 AI821005 Hs.1 18599 ESTs 3.2 

410313 R10305 Hs.185683 ESTs 3.2 

416856 N27833 Hs^69028 ESTs, Weakly similar to I38022 hypotheti 3.2 

449490 AI652777 Hs.197069 ESTs 3.2 

450506 NMJ304460 Hs.418 fibroblast activation protein, alpha 3.2 

440684 AI253123 Hs.127356 ESTs, Highly similar to S21 424 nestin (H 3.2 

459055 N23235 Hs.30567 ESTs, Weakly similar to B34087 hypotheti 32 
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452190 H26735 
430965 AA489732 
405394 

424693 BE169810 
454265 H03556 
437687 AA765917 
428372 AK000684 
414083 AL121282 
411670 AW856552 
416283 NNL0Q5429 
437488 AA758239 
428398 AI249368 
452042 H38857 
421477 AI904743 
438078 AI016377 
448816 AB033052 
419519 AJ198719 
404580 

447046 AA326187 
457473 AW974903 
429838 AW904907 
459702 AI204995 
400195 NA 
417860 AW408557 
417995 AW974175 
422589 AA312735 
435870 AA701327 
440801 AA906366 
426274 038122 
423728 AW891294 
439677 R82331 
452834 AJ638627 
431349 AA503653 
417576 AA339449 
430264 AA470519 
418827 BE327311 
410835 AW806906 
426269 H15302 
405336 NA 
437783 AI683150 
440931 AI583052 
455945 BE160636 
430437 AI768801 
405848 NA 
455685 BE066976 
406970 M29994 
409602 W26713 
423518 045027 
425653 AI065104 
426326 BE165753 
433805 AA706910 
437152 AL050027 
448602 AJ541305 
452844 AW407181 
407366 AF026942 
408254 AW807227 
424085 NM_002914 
416790 R83066 
420020 BE295866 
426119 W94997 
426968 U07616 
457421 AL117431 
453403 BE466639 
454141 AW138413 
426650 AA382814 
450865 AI245013 
407993 AW135274 



Hs.9 1 668 Homo sapiens done PP1498 unknown mRNA 3.2 

Hs.154918 ESTs 3.2 

Target Exon 3.2 

Hs.47557 ESTs 3.2 

Hs.300949 ESTs, Weakly similar to thyroid hormone 3.2 

Hs 122840 ESTs 3.2 

Hs!l 63887 hypothetical protein FU22104 3.2 

Hs.257786 ESTs 3.2 
gb:RC1-CT0294-G80100-012-a04 CT0294 Homo3.2 

Hs.79141 vascular endothelial growth factor C 3.2 

Hs.180330 ESTs 3.2 

Hs.98558 ESTs 3.2 

Hs.243901 Homo sapiens cDNA FU20738 fis, clone HE 3.2 

Hs.104650 hypothetical protein HJ10292 3.2 

Hs.131693 ESTs 3.2 

Hs.22151 KIAA1226 protein 3,2 

Hs.176376 ESTs 3.2 

NM_0141 12*:Homo sapiens trichorhinophala 3.2 

Hs.1 71 70 G protein-coupled receptor 4 3,2 

Hs.291231 ESTs 3.1 

Hs.30732 hypothetical protein RJ 13409; WAA1711 3.1 

gb;an03c03jc1 Stratagene schizo brain S 1 3. 1 

NfVL007057*:Homo sapiens ZW10 interactor 3.1 

hypothetical protein FU14075 3.1 



Hs.235498 
Hs.188751 
Hs.30512 
Hs.17949 
Hs.190535 
Hs.2007 
Hs.132136 
Hs.164599 
Hs.105685 
Hs.156942 
Hs.82285 

Hs.47166 

Hs.168950 

Hs.201550 
Hs.270058 

Hs.169943 



Hs.256972 
Hs.129732 
Hs.249718 
Hs.250528 
Hs.112742 

H$M778 
Hs.218377 



Hs.139226 

Hs,7043 

Hs.94382 

Hs.189917 

Hs.173034 

Hs.112165 

Hs.61779 

Hs.139336 

Hs.106532 
Hs.12433 



ESTs, Weakly similar to MAPB_HUMAN MICRO 3.1 
Homo sapiens mRNA for KIAA0556 protein, 3.1 
ESTs 3.1 
ESTs 3.1 
tumor necrosis factor (ligand) superfami 3.1 
solute carrier family 4, sodium bfcarbon 3.1 
ESTs 3.1 
KIAA1688 protein 3.1 
ESTs, Moderately similar to ALU2 JWMAN A 3.1 
phosphonbosylglycinamideformyltransfer 3.1 
gb:nc7H10.s1 NCI_CGAP_Pr1 Homo sapiens 3.1 
HT021 3.1 
gb:QV4-ST0023-160400-172-d12 ST0023 Homo 3.1 
Homo sapiens mRNA; cDNA DKFZp566A1046 (f 3.1 
Target Exon 3.1 
ESTs, Weakly similar to ALU1JUJMAN ALU S 3.1 
ESTs 3.1 
gb:PM1-HT0422-291299-002<08 HT0422 Homo 3.1 
Homo sapiens cDNA FU 1 3569 fis, clone PL 3.1 
Target Exon 3.1 
gb:PM0-8T0340-211299-003-c12 BT0340 Homo 3.1 
gb:Human alphas spectrin gene, exon 12. 3.1 
ESTs 3.1 
R3H domain (binds sing le-stranded nuclei 3.1 
ESTs, Weakly similar to A46010 X-iinked 3.1 
Homo sapiens, clone 1MAGE:4098694, mRNA, 3.1 
ESTs 3.1 
gb:Homo sapiens mRNA; cDNA OKFZp566C03243.1 
niban protein 3.1 
Homo sapiens cDNA HJ 1 1927 fis, clone HE 3.1 
gb:Homo sapiens cig33 mRNA, partial sequ 3.1 
gb:MR4-ST0062-180200-001-e10ST0062Homo3.1 



replication factor C (activator 1) 2 (40 3.1 

succinate-CoA figase, GDP4ormlng, alpha 3.1 

adenosine kinase 3.1 

ESTs 3.1 

amphiphysin (Stiff-Mann syndrome with br 3.1 

Homo sapiens CDNAFU12198 fis, ctone MA 3.1 

Homo sapiens cDNA FU13591 fis, done PL 3.1 

ATP-binding cassette, sub-family C (CFTR 3.1 

gb:EST96097 Testis I Homo sapiens cDNA 5 3.1 

ESTs, Weakly similar to 138588 reverse t 3.1 

ESTs 3.1 
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446466 


H38026 


457888 


BE219794 


420058 


AK001423 


409248 


AB033035 


452747 


BE153855 


418926 


AA232658 


419346 


AI830417 


429826 


N93266 


435147 


AL133731 


420139 


NM 005357 


405609 


NA 


404274 




449777 


AI971362 


415459 


H07118 


415245 


N59650 


406291 


NA 


414210 


BE383592 


432055 


AW972359 


442246 


A1791988 


451353 


N21043 


451177 


AI969716 


418026 


BE379727 


401326 


NA 


409920 


BE169746 


432887 


A1926047 


411789 


AF245505 


401045 




432162 


AA584062 


434627 


AI221894 


442611 


BE077155 


425477 


AW958879 


457183 


H91882 


433014 


NMJM711 


415542 


R13474 


416173 


R52782 


408155 


AB014528 


453913 


AW004683 


435495 


AI754212 


423629 


AW021173 


411836 


AW901879 


415030 


031118 


419606 


AW294795 


440310 


AA878939 


443608 


A1375957 


414140 


AA281279 


444781 


NM 014400 


435393 


AA701259 


454071 


AI041793 


446922 


BE175605 


448062 


AW295923 


427711 


M31659 


450382 


AA397658 


424866 


W01938 


433043 


W57554 


420802 


U22376 


445625 


BE246743 


403677 


NA 


411093 


BE067650 


435255 


W87434 


443127 


BE568102 


448104 


A1674818 


427315 


AA179949 


430414 


AW365665 


423600 


AI633559 


458562 


N34128 


402109 


NA 


429629 


BE501732 



Hs.308 

Hs.293471 

Hs.94694 

Hs.51965 

Hs.61460 

Hs.105794 

Hs.44143 

Hs.40747 

Hs.4774 

Hs.95351 



Hs,231945 

Hs.6099 

Hs.27252 



Hs.293334 

Hs.129115 

Hs.42932 

Hs.13034 

Hs.83213 

Hs.12504 

Hs.162859 

Hs.72157 

Hs.272798 

Hs.39311 

Hs.177537 

Hs.270535 

Hs.1 18569 

Hs.279912 

Hs.290263 

Hs.43133 

Hs.78934 

Hs.21951 

Hs.18612 

Hs.314453 

Hs.191735 

Hs.198529 

Hs.125406 

Hs.289074 

Hs.23317 

Hs.11950 

Hs.189299 

Hs.42502 

Hs.255472 

Hs. 180408 

Hs.60257 

Hs.337243 

Hs.125019 

Hs.1334 

Hs.288529 



arresiin 3, retinal (X-arrestin) 3.1 

ESTs 3.1 

Homo sapiens cDNA RJ10561 lis, clone NT 3.0 

K1AA1209 protein 3.0 

)g superfamily receptor INtR 3.0 

UDP^lucc^eiglycorjroteinglucosyltransfe 3.0 

polybromo 1 3.0 

ESTs 3.0 
Homo sapiens mRNA; cDNA DKFZp761C1712 (f 3.0 

lipase, hormone-sensitive 3.0 

ENSP00000241065*:CDNA 3.0 

NMJX)2944*:Homo sapiens v-ros avian UR2 3.0 

ESTs 3.0 

ESTs 3.0 

ESTs 3.0 

Target Exon 3.0 
gb.-601297871F1 NIH_MGC_19 Homo sapiens c 3.0 

ESTs 3.0 

ESTs 3.0 

ESTs 3.0 

ESTs 3.0 

fatty acid binding protein 4, adipocyte 3.0 

C10000447*:gi|1 168375|sp|P43467|AGA1J 3 ED 3.0 

likely orthoiog of mouse Arkadia 3.0 

ESTs 3.0 

DKFZP564I1922 protein 3.0 

C11001883*gi}6753278|refjNPJ)33938.1| c 3.0 

hypothetical protein FU20413 3.0 

ESTs 3.0 

hypothetical protein DKFZp761B1514 3.0 

ESTs 3.0 

Dvl-binding protein IDAX exhibition of 3.0 

KIAA0419 gene product 3.0 

ESTs, Weakly similar to I38022 hypotheti 3.0 

gb:yg99d09.r1 Soares infant brain 1NIB H 3.0 

K1AA0628 gene product 3.0 

mutS (E. coB) homolog 2 {colon cancer, 3.0 

Homo sapiens Xq pseudoautosomal region; 3.0 

Homo sapiens cDNA: FU21909 fis, done H 3.0 

ESTs 3.0 

hypothetical protein MGC10520 3.0 

ESTs 3.0 

ESTs 3.0 

F-box only protein 22 3.0 

hypothetical protein FU14681 3.0 

GPI-anchoredmetastasis^assodatedprote 3.0 

ESTs 3.0 

ESTs 3.0 
gb:RC5-HT0580-100500-022-H07 HT0580 Homo3.0 

KIAA1843 protein 3.0 

solute carrier family 25 (mitochondrial 3.0 

Homo sapiens cDNA RJ13598 fts, clone PL 3.0 

ESTs, Weakly simflar to ALU7_HUMAN ALUS 2.9 

lymphoid nuclear protein (LAF-4) mRNA 2.9 

v-myb avian myeloblastosis viral oncogen 2.9 

hypothetical protein RJ22635 2.9 

C4001462:gi|4887715|gb|AAA79329.2| (L088 2.9 
gb:MR4-BT0358^)90300-003-e01 BT0358 Homo 2.9 

Hs.106015 ESTs, Moderately similar to ALU1.HUMAN A 2.9 

Hs.180312 mitochondrial ribosomal protein S16 2.9 

Hs.316433 HomosapienscDNAFU11375fis,cloneHE 2.9 
Hs.175563 Homo sapiens mRNA; cONA DKFZp564N0763 (f 2.9 

Hs.120388 ESTs 2.9 

Hs.310359 ESTs 2.9 

Hs.145268 ESTs 2.9 

Target Exon 2.9 

Hs.30622 Homo sapiens cDNA FU13010 fis, clone NT 2.9 
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442295 A1827248 
419752 AA249573 
404721 

445107 AI208121 
401987 

430566 AA481282 
444517 AI939339 
445563 AW873606 
427691 AW194426 
456561 AI868634 
401458 

421039 NM 003478 
459504 BE514127 
424962 NMJ)12288 
409617 BE003760 
416931 045371 
413221 BE161151 
409732 NNL016122 
433687 AA743991 
434340 AI193043 
454529 Z45439 
421379 Y15221 
452291 AF015592 
457402 AW452648 
449051 AW961400 
408761 AA057264 
401093 

435061 AI651474 
447985 AI681475 
449340 AW235786 
426384 AI472078 
411905 BE265067 
405953 NA 
420854 AW296927 
434265 AA846811 
428365 AA295331 
442861 AA243837 
448337 AW206453 
452554 AW452434 
412248 BE176480 
449450 AL039852 
434757 AI038997 
409038 T97490 
454545 AW806899 
439842 AI910896 
428479 Y00272 
424800 AL035588 
411086 BE070800 
400250 NA 
449168 NM_016206 
456482 AA485224 
426044 AA502490 
431854 AA383550 
405873 NA 
440400 AA994364 
458265 AI075375 
413708 BE158791 
423739 AA398155 
424408 AJ754813 
453096 AW294631 
421825 AA298758 
417742 R64719 
402765 

444378 R41339 
419172 AW338625 
401497 
402376 



Hs.224398 Homo sapiens cDNA RJ1 1469 lis, done HE 2.9 
Hs.152618 ESTs. Moderately similar to ZN91_HUMAN Z 2.9 
NM_005596*:Homo sapiens nuclear factor I 2.9 
Hs.147313 ESTs, Weakly similar to 138022 hypotheti 2.9 
N^002737*:Homo sapiens protein kinase C 2.9 
Hs.190149 ESTs 2.9 
Hs.146883 ESTs 2.9 
Hs.149006 ESTs 2.9 
Hs.20726 ESTs 2.9 
Hs.246358 ESTs, Weakly similar to T32250 hypotheti 2.9 
Target Exon 2.9 
Hs.101299 cullinS 2.9 
gb:601315974F1 NIH _MGC_8 Homo sapiens cD 2.9 
Hs.153954 TRAM-like protein 2.9 
Hs.55209 Homo sapiens mRNA; cONA DKFZp434K0514 (f 2.9 
Hs.80485 adipose most abundant gene transcript 1 2.9 
gb:PM0-HT0425-141299-001-F08 HT0425 Homo2.9 
Hs.56148 NY-REN-58 antigen 2.9 
gb:ny57g01.s1 NCLCGAPJW Homo sapiens 2.9 
Hs.128685 ESTs, Weakly similar to T1 7226 hypotheti 2.9 
Hs.270425 ESTs 2.9 
Hs.103982 small inducible cytokine subfamily B (Cy 2.9 
Hs.28853 CDC7 (cell division cycle 7, S.cerevisl 2.9 
Hs.149342 activation-induced cytidine deaminase 2.9 
Hs.333526 HER2 receptor tyrosine kinase (c-erb-b2, 2.9 
Hs.238936 ESTs, Weakly similar to (defBne not ava 2.9 
C12000586*:gil6330167|db]|BAA86477.1 1 (A 2.9 
Hs.163944 ESTs 2.9 
Hs.200949 ESTs 2.9 
Hs.195359 hypothetical protein MGC10954 2.9 
Hs.303662 ESTs 2.9 
gb:601 193893F1 NIH_MGC_7 Homo sapiens cD 2.9 
Target Exon 2.8 
gb:UW^W0*jc-c47-0-UI,s1 NCLCGAP.Su 2.8 
Hs.130554 Homo sapiens cONA: FU23089 fis. done L 2.8 
Hs.183861 Homo sapiens cDNA FLJ20042 fis, clone CO 2.8 
Hs.57787 ESTs 2.8 
Hs.3782 ESTs 2.8 
Hs.58006 ESTs, Weakly similar to ALU5J-IUMAN ALU S 2.8 
gb:RC3-HrQ585-160300-022-c02 HT0585 Homo 2.8 
Hs.49136 ESTs, Moderately similar to ALU7_HUMAN A 2.8 
Hs.132921 ESTs 2.8 
Hs.50002 smai) inducible cytokine subfamily A (Cy 2.8 
gb:QV4-ST0023-160400-172-c12 ST0023 Homo 2.8 
Hs.132413 ESTs 2.8 
Hs. 184572 ceil division cyde 2, G1 to S and G2 to 2.8 
Hs,153203 MyoO family inhibitor 2.8 
gb:RC3-BT0502-251 199-011-c07 BT0502 Homo 2.8 
Eos Control 2.8 
colon carcinoma related protein 2.8 
gb:aa41b12.s1 NCLCGAP_GCB1 Homo sapiens2.8 



Hs.23142 

Hs.336695 
Hs.271699 

Hs.125594 
Hs.128193 

Hs.97600 
Hs.146428 
Hs.11325 
Hs.183747 



Hs.12569 
Hs.22120 



ESTs 2.8 

polymerase (DNA directed) iota 2.8 

Target Exon 2.8 

ESTs, Weakly similar to T25472 hypotheti 2.8 

ESTs, Weakly similar to IRX2JWMAN IROQU 2.8 

gb:lL2-HT0397-091299-0254302HT0397Homo 2.8 

ESTs 2.6 

collagen, type V, alpha 1 2.8 

ESTs 2.8 

ESTs, Moderately similar to CAIBJHUMAN C 2.8 
gb:EST22d11 WATM1 Homo sapiens cONA clon 2.8 

C1003621*:gi|12407405{gb}AAG53491.1|AF22 2.8 

ESTs 2.8 

ESTs 2.8 

Target Exon 2.8 

C19000763*:gij1363912|pirj|JC4296 ring f 2.8 
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405041 NA C300170B 4 :gll1345652|sppi5989lCA36_CHlC 18 

408758 NMJJ03686 Hs.47504 exonucleasel 2.8 

431917 D16181 Hs.2868 peripheral myelin protein 2 2.8 

437583 AA761190 Hs.244627 ESTs 2.8 

453737 AA744862 Hs.194293 ESTs, WeaJdy similar to I54374 gene NF2 2.8 

458094 AF086325 gb:Homo sapiens full length insert cDNA 2.8 

401283 NA Target Exon 2.8 

410784 AW803201 gb:IL2-UM0077-070500-080-E06 UM0077 Homo 2.8 

417601 NM_014735 Hs.82292 WAA0215 gene product 2.8 

418236 AW994005 Hs.337534 ESTs 2.8 

435532 AW291488 Hs. 117305 Homo sapiens, clone IMAGE:3682908, mRNA 18 
454714 AW815098 gb:GV4^T02l2-091199-023-f10 ST0212Homo 2.8 

418629 BE247550 Hs.86859 growth factor receptor-bound protein 7 2.8 

442101 AI651930 Hs.135684 ESTs 2.8 

405080 AK000375 Hs.m20 HDCMC28P protein 2.8 

414661 T97401 Hs.21929 ESTs 2.8 

425589 A1650633 Hs.143688 Homo sapiens cDNA: FU23031 fis, done L 2.8 

429638 AI916662 Hs.211577 Wnectin 1 (kinesin receptor) 2.7 

428824 W23624 Hs.173059 ESTs 2.7 

421566 NM000399 Hs.1395 early growth response 2 (Krox-20 (Drosop 2.7 
414596 BE386870 gb:601 27527 1F1 NIH MGC 20 Homo sapiens c 2.7 

440868 R79707 HsJ263339 ESTs, Moderately similar to 138022 hypot 2.7 

452943 BE247449 Hs.31082 hypothetical protein FU 10525 17 

443772 AV646449 Hs.282872 ESTs 2.7 

432361 A1378562 Hs.159585 ESTs 2.7 

430375 AW371048 Hs.93758 H4 histone family, member H 2.7 

406504 NA C5000558:gil4504675|reqNP_002175.1|int 2.7 

423279 AW959861 Hs.290943 ESTs 2.7 

424871 NMJD04525 Hs.153595 low density Hpoproteiruelated protein 2.7 

453619 H87648 Hs.33922 Homo sapiens, clone MGC:9084, mRNA, comp 2.7 

423961 013666 Hs.136348 osteoblast speafc factor 2 (fascfcCn 2.7 

422156 N34524 gb:yy56d10.s1 Soaresjnultiplejiderosis. 2.7 

415752 BE3 14524 Hs.78776 putative transmembrane protein 2.7 

419987 NMJJ05014 Hs.94070 osteomoduiin 2.7 

406182 NA Target Exon 2.7 

416495 X69970 Hs.79350 RYK receptor-like tyrosine kinase 2.7 

444701 AI916512 Hs.198394 ESTs • 2.7 

408171 AA301228 Hs.43299 hypothetical protein FU12890 2.7 

430153 AW968128 Hs.336679 ESTs 2.7 

413383 AA128978 Hs.154706 hypotfiefical protein FLJ1491 7 2.7 

414831 M31158 Hs.77439 protein kinase, cAMP-dependent regulato 2.7 

413278 BE563085 Hs.833 interferon-stimulated protein, 15 kDa 2.7 

433132 AB026264 Hs.284245 hypothetical protein IMPACT 2.7 

437030 AA742577 Hs.303781 EST 2.7 

439031 AF075079 gb:Homo sapiens full length insert cDNA 2.7 

449532 W74653 Hs.271593 ESTs, Moderately similar to A47582 B-cel 2.7 

406153 Target Exon 2.7 

406625 Y13647 Hs. 11 9597 stearoy^CoA desaturase (delta-9-desatur 2.7 

444698 AI188139 Hs.147050 ESTs 2.7 

432328 AI572739 Hs.195471 6-riwsphofmcto-2-ldnase/mjctose-2,6^1 17 

429628 H09604 Hs.13268 ESTs 2.7 

420149 AA255920 Hs.88095 ESTs 2.7 

431207 AA495925 Hs.9394 ESTs 2.7 

438394 BE379623 Hs.27693 peptidylprolyl isomerase (cydophifinH 17 

443304 A1050073 Hs.135338 ESTs Z7 

427660 AI741320 Hs.114121 Homo sapiens cDNA: FU23228 hs, done C 2.7 

408460 AA054726 Hs.2B5574 ESTs 2.7 

416515 N91716 Hs.194140 ESTs, Weakly similar to 138022 hypoiheti 2.7 

429922 297630 Hs.2261 1 7 H1 histone family, member 0 2.7 

418203 X54942 Hs.83758 CDC28 protein kinase 2 2.7 

439509 AF086332 Hs.58314 ESTs 2.7 
402184 NA ENSP00000245238*;CDNAFU10922fis.don 2.7 

450496 AW449251 Hs.257131 ESTs 2.7 

451963 AI825440 Hs.224952 ESTs 2.7 

457938 A1373638 Hs.133900 ESTs 2.7 

441541 AA938663 Hs.199828 ESTs 2.7 

441111 AI806867 Hs.126594 ESTs 17 
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423020 AA383092 Hs.1608 replication protein A3 (14kD) 2.7 

445354 AV653485 Hs.6390 Homo sapiens clone FLB3344 PRO0845 mRNA, 2.7 

427961 AW293165 Hs.143134 ESTs 27 

410889 X91662 Hs.66744 twist (Orosophfla) homolog (acrocephatos 27 

445234 AW137636 Hs.146059 ESTs 2.7 

413903 AA496493 Hs.23136 ESTs 2.7 

406069 NA Target Exon 2.7 

447410 AI470235 Hs.172698 EST 2.7 

401256 NA NJ^.024089*^omo sapiens hypothetical pro 2.7 

415139 AW975942 Hs.48524 ESTs 2.7 

420218 AW958037 Hs.286 ribosomaJ protein L4 2.7 

455511 BE144762 gb:CM0-HT01 80-04 1099465-b04 HT01 80 Homo2.7 

438825 BE327427 Hs.79953 ESTs 2.$ 

409564 AA045857 Hs.54943 fracture callus 1 (rat) homolog 2.6 

452837 AL121053 Hs.5534 Homo sapiens cDNA FU 12961 lis, done NT 2.6 

434876 AF160477 Hs.61460 Ig superfamily receptor LNIR 2.6 

421565 AK001122 Hs.105859 hypothetical protein FU 10260 2.6 

453279 AW893940 Hs.59698 ESTs 2.6 

430785 Z30201 gb:HHEA22G Atrium cDNA library Human hea 2.6 

456986 038299 Hs.170917 prostaglandin E receptor 3 (subtype EP3) 2.6 

433068 NMJ06456 Hs.268215 sialyltransferase 2.6 

421952 AA300900 Hs.98849 ESTs, Moderately similar to AF1 6 1511 1 H 2.6 

429208 AA447990 Hs.190478 ESTs 2.6 

430733 AW975920 Hs.283361 ESTs 2.6 

441720 AI346487 Hs.28739 ESTs 2.6 

418986 A1123555 Hs.81796 ESTs 2.6 

432481 AW451645 Hs.151504 Homo sapiens cDNA FLJ1 1973 fis, clone HE 2.6 

434338 AW75431 1 gb:CM1-CT0337-141299468-fQ7 CT0337 Homo 2,6 

417061 AI675944 Hs.188691 Homo sapiens cDNA FU 12033 fis, done HE 2.6 

41D530 M25809 Hs.64173 ATPase, H transporting, lysosomal (vacuo 2.6 

456672 AK002016 Hs.1 14727 Homo sapiens, done MGC:16327, mRNA, com 2.6 

425071 NMJ13989 Hs.154424 daiodinase, lodolhyronine, type ll 2.6 

408868 AW292286 Hs.255058 ESTs 2.6 

451531 AA018311 Hs.114762 ESTs 2.6 

405822 Target Exon 2.6 

418301 AW976201 Hs.53913 hypothetical protein FU 10252 2.6 

417315 A1080042 Hs.336901 ribosomal protein S24 2.6 

434699 AA643687 Hs.149425 Homo sapiens cDNA FU 11 980 fls, clone HE 2.6 

443204 AW205878 Hs.29643 Homo sapiens cDNA FU13103 fis, done NT 2.6 

405638 Target Exon 2.6 

452542 AW812256 gb:RC0-ST01 74-191099431*07 ST01 74 Homo 2.6 

403943 C5000355:gij4503225IrefjNP_000765.1J cyt 2.6 

404535 Z25884 Hs.121483 chloride channel 1 , skeletal musde (Tn 2.6 

402800 NA Target Exon 2.6 

449144 AI989503 Hs.233405 ESTs 2.6 

454934 AW846080 Hs.314324 ESTs 2.6 

424717 H03754 Hs.152213 wingless-type MMTV integration site fami 2.6 

428303 AW974476 Hs.183601 regulator of G-protein signalling 16 2.6 

427970 AA418187 Hs.330515 ESTs 2.6 

450638 AK001826 Hs.25245 hypothetical protein FU 1 1269 2.6 

453034 BE246010 Hs.271468 Homo sapiens mRNA for FU00038 protein, 2.6 

455097 AW855802 gb:RC1-CT0279-1702Q0-Q23-d08 CT0279 Homo 2.6 

427317 AB028955 Hs.175780 KIAA1 032 protein 2.6 

408875 NM_015434 Hs.48604 DKFZP434B 168 protein 2.6 

427510 Z47542 Hs.179312 small nuclear RNA activating complex, po 2.6 

423201 NM_000163 Hs.1 251 80 growth hormone receptor 2.6 

408271 Target Exon 2.6 

442696 BE566962 Hs.7063 Homo sapiens cDNA: FU209 13 fls, done A 2.6 

454018 AW016892 Hs.100855 ESTs 2.6 

435420 AI928513 Hs.59203 ESTs 2.6 

434398 AA121098 Hs.3838 serum-indudble kinase 2.6 

455708 BE069326 gb:QV3-BT0381-170100-060-g03 BT0381 Homo 2.6 

439347 W24320 Hs.102941 Homo sapiens cDNA:FU2 1531 fis. done C 2.6 

407523 X64984 gbRsaplens rnRM HTPCRX10 for olfactory 2.6 

425101 AA830431 Hs.180811 ESTs 2.6 

435153 AA668763 Hs.291939 ESTs 2.6 

409139 AI681917 Hs.3321 ESTs, Highly similar to IRX1_HUMAN IROQU 2.6 
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455100 
414612 
440283 
423025 
431473 
404440 
403388 
403775 
405037 
407447 
420952 
435447 
440202 
445854 
421247 
414870 
457411 
424676 
404443 
452268 
430832 
444779 
408633 
459089 
447197 
454111 
411-165 
406922 
456045 
413111 
423123 
432201 
427032 
445417 
422225 
428330 
410011 
426310 
419589 
437770 
442818 
414251 
428301 
431956 
455732 
458624 
428257 
418054 
458652 
458012 
422996 
410804 
457211 
440029 
448141 
409163 
431385 
414569 
439963 
455935 
425025 
416589 
404826 
422938 
421991 
417404 
448516 



BE160198 
BE274552 
AI732892 
AA831267 
AA825686 

NA 
NA 
NA 

AF290544 

AA282Q67 

A1872932 

AW516211 

AJ702335 

BE391727 

N72264 

AW085961 

Y08565 

NM.003512 

AI073913 

AI192105 

AW963372 

F13036 

R36075 

AW081681 

NM.000169 

S70284 

H62943 

BE065837 

NM 012247 

A1538613 

AF012023 

AK001058 

BE245652 

L22524 

AB020641 

NM_OD0909 

AW973708 

AA767881 

AK001741 

AL042306 

AW628666 

AK002032 

BE080908 

AI362790 

BE394723 

NMJW2318 

AW375610 

AI424899 

BE091089 

U64820 

AW972565 

AW089705 

AI471598 

AA065081 

BE178536 

AF109298 

AW247529 

BE 158687 

AW953168 

AA652687 

NNL001809 
NNL014918 
NM.007350 
AW898595 



gb:QV1-HT0413^)1020(K)594i03HT0413 Homo 2.6 
Hs.76578 protein Inhibitor of activated STAT3 2.6 
Hs.190489 ESTs 2.6 
Hs. 12244 hypothetical protein RJ20097 2.6 
Hs.321176 ESTs, WeaWy similar to S65824 reverse t 2.6 
NM_021048:Homo sapiens melanoma antigen, 2.6 
C3001398*:gil12248917|dbj|BAB20375.1| (A 2.6 
Target Exon 2.6 
NM_021628*:Homo sapiens arachidonate fip 2.6 
gb:Homo sapiens amlnopeptidase mRNA, par 2.6 
Hs.88972 ESTs, Moderately similar to A46010 X-fin 2.6 
gb;wm72e03.x1 NCLCGAP_Ut2 Homo sapiens 2.6 



Hs.125300 
Hs. 145568 
Hs.102910 
Hs.300670 
Hs.130093 
Hs.151678 

Hs.28777 
Hs.100686 
Hs.147170 
Hs.46677 
Hs.27373 

Hs.269064 
Hs.69089 

Hs.154188 

Hs.124027 
Hs.298241 
Hs.173274 
Hs.12680 
Hs.1 18281 
Hs.2256 
Hs.57856 
Hs.169266 
Hs.201925 
Hs.122897 
Hs.8739 
Hs.97689 
Hs.98440 
Hs.272245 

Hs.278639 
Hs.275243 
Hs.83354 
Hs.117102 
Hs.188211 

Hs.66521 
Hs.32399 
Hs.293711 
Hs.197531 

Hs.11090 
Hs.1 18258 
Hs.6793 

Hs.12407 
Hs.96151 

Hs.1594 
Hs.1 10488 
Hs.82101 



ring finger protein 21 , Interferon-fespo 2.6 
ESTs 2.6 
general transcription factor I1H, polype 2.6 
K1AA1204 protein 2.6 
ESTs 2.6 
UDP^-acetytelpra-D^alactosamine:polyp 2.6 
C8001428*:gq6572242|embjCAB62951.1| (Z9 2.6 
H2A histone family, member t 2.6 
ESTs, Weakly similar to JE0350 Anterior 2.6 
ESTs 2.6 
PRO2000 protein 2.6 
Homo sapiens mRNA; cONA DKFZp56401763 (f 2.6 
gb:yh88bQ1.s1 Scares placenta Nb2HP Homo 2.6 
ESTs, Weakly similar to T42689 hypotheti 2.6 
galactosidase, alpha 2.6 
gbstearoyl-CoA desaturase [human, adipo 2.6 
ESTs 2.6 
gb:RC2-8T0318-110100-012-g12 BT0318 Homo 2.6 
SELENOPHOSPHATE SYNTHETASE ; Human selenZ6 
Transmembrane protease, serine 3 2.5 
integrin cytoplasmic domain-associated p 2.5 
Homo sapiens cONA FU10196 fis, clone HE 2.5 
zinc finger protein 266 2.5 
matrix metalloproteinase 7 {MMP7; uterin 2.5 
PFTAIRE protein kinase 1 2.5 
neuropeptide Y receptor Y1 2.5 
Homo sapiens cDNA FU 13446 Gs, clone PL 2.5 
ESTs 2.5 
hypothetical protein FU10879 2J5 
VASA protein 2.5 
ESTs, Weakly similar to I38022 hypotheti 2.5 
Homo sapiens cONA FU1 1 170 fis, done PL 2.5 
gb:QV1-BT0631-28020(W844i07BT0631 Homo 2.5 
KIAA1684 protein; likely homolog of mous 2.5 
S100 calcium-binding protein A6 (calcycl 2.5 
lysyl oxidase-Iike 2 2.5 
hypothetical protein FU13046 similar to 2.5 
ESTs 2.5 
gb:PIW-BT0724-13040Q-006-c07 BT0724 Homo 2.5 

2.5 
2.5 
2.5 
2.5 
2,5 
2.5 
2.5 
2.5 



ESTs, Weakly similar to S51797 vasodilat 
ESTs, Weakly similar to S64329 probable 
ESTs 

gb:zm13a03.s1 Stratagene pancreas (93720 
membrane-spanning 4-domalns, subfamily A 
prostate cancer associated protein 1 
platelet-activating factor acetyihydrola 
gb£M0-HT0395-280100-169-b09 HT0395 Homo2.5 

ESTs 2.5 

Human DMA sequence from done RP5-1103G7 2.5 

Target Exon 2.5 

centromere protein A (17kD) 2.5 

K1AAG990 protein 2.5 

pfeckstrin hornotogy-tOce domain, family Z5 
gb^C1-NN0073-26040(W)11-g09 NN0073 Homo2.5 
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403356 NA ENSPO0000251 525*:Hypothetlcal protein Kl Z5 

404983 ENSP00000252242*:Keratin, type II cytosk 2.5 

418282 AA215535 Hs.98133 ESTs 2.5 

427409 AW467143 Hs.135411 actin related protein 2.5 

431806 AF186114 Hs.270737 tumor necrosis factor (llg and) superfami 2.5 

443367 AW071349 Hs.215937 ESTs 2.5 

421246 AW582962 Hs.102897 CGl-47 protein 2.5 

439217 AF086041 Hs.42975 ESTs 2.5 

400925 Target Exon 2.5 
404552 NA ENSP0Q00022Q888*:ZINC FINGER TRANSCR1PT12.5 

417168 AL1331 17 Hs.81376 Homo sapiens mRNA; cONA OKFZp586L1 121 (f 2.5 

418841 NMJW2332 Hs.89137 tow density llpoprotein-related protein 2.5 

426853 U32974 Hs.172777 baculoviraUAP repeat-containing 4 2.5 

427738 NrA.000318 Hs.180612 peroxisomal membrane protein 3 (35kD, Ze 2,5 

457384 AA501760 Hs.15806 HomosapiensmRNA;cDNADKFZp434H2019(t2.5 

447128 AI271898 Hs.164866 cydinK 2.5 
454693 AW813428 gb:MR3^T0192410200-210^5ST0192 Homo 2.5 

434657 AA641876 Hs.191840 ESTs 2.5 

402077 NA Target Exon 2.5 

400289 X07820 Hs.2258 matrix metalioproteinase 10 (MMP10; str 2.5 

. 409723 AW885757 Hs.257862 ESTs 2.5 

447020 T27308 Hs.16986 hypothetical protein FU1 1046 2.5 

455068 AI807894 Hs.47274 Homo sapiens mRNA; cDNA OKFZp564B176 (fir 2.5 

431232 A1024353 Hs.131755 hypothetical protein FU14298 2.5 

408938 AA059013 Hs.22607 ESTs 2.5 

411571 AA122393 Hs,70811 hypothetical protein FU2051 6 2.5 

426504 AW162919 Hs. 170160 RAB2, member RAS oncogene famWyAike 2.5 

428248 AI126772 Hs.40479 ESTs 2.5 

408813 AI580090 Hs.48295 RNA helicase family 2.5 

423504 N80O77 Hs.24792 chromosome 12 open reading frame 5 2.5 

425441 AA449644 Hs. 193063 Homo sapiens cONAFU 14201 fis, clone NT 2.5 

443066 AW297921 Hs.255703 ESTs 2.5 

443556 AA256769 Hs.94949 methylmalonyl-CoA epimerase 2.5 

428943 AW086180 Hs.37636 ESTs, Weakly similar to KIAA1392 protein 2.5 

425320 U29344 Hs.83190 fatty add synthase 2.5 

430388 AA356923 Hs.240770 nuclear cap binding protein subunit 2, 2 2.5 

423242 AL039402 Hs.125783 DEME-6 protein 2.5 

416241 N52639 Hs.32683 ESTs 2.5 

440244 A1743977 Hs.205144 ESTs 2.5 

409239 AA740875 Hs.44307 ESTs. Moderately similar to 138022 hypot 2.5 

452464 AW500507 Hs.192619 K1AA1600 protein 2.5 

410718 AI920783 Hs.191435 ESTs 2.5 

408877 AA479033 Hs.130315 ESTs, Weakly similar to A47582 B-cell gr 2.5 

445150 AI446747 Hs.338704 olfactory receptor, family 7, subfamily 2.5 

407756 AA1 16021 Hs.38260 ubiquitln specific protease 18 2.5 

407633 NWJ507069 Hs.37189 similar to rat HREV107 2.5 

449754 H00820 Hs.30977 ESTs, Weakly similar to B34087 hypotheti 2.5 

419316 AA236255 Hs.298419 ESTs 25 

429118 H20669 Hs.35406 ESTs, Highly similar to unnamed protein 2.5 

440331 AL046412 Hs.202151 ESTs 2.5 

449344 A1640355 Hs.312691 ESTs 2.5 

459006 AW298631 Hs.27721 Wolf-Hirschhom syndrome candidate Mik 2.5 

423165 A1937547 Hs.124915 hypothetical protein MGC2601 2.5 
411337 AW837349 gb:QV2-LT0038-270300-10fl-d12 LT0O38 Homo 2.5 

438290 AA843719 Hs.122341 ESTs 2.5 

406414 C5000506 # :gi|124941|spIP18614|ITA1_RAT I 2.5 

424498 AB033043 Hs.149377 hypofoetical protein DKFZp761L0424 2.5 

443464 BE548446 Hs.5167 Homo sapiens mRNA; cDNA DKFZp434F152 (fr 2.5 

424856 AA347746 Hs.9521 ESTs, Weakly similar to ZN43.HUMAN ZINC 2.5 

440304 BE159984 Hs.125395 ESTs 2.5 

409045 AA635062 Hs.50094 Homo sapiens mRNA; cDNA DKFZp434O0515 (f 2.5 

422648 D86983 Hs.118893 Melanoma associated gene 2.5 

428819 AL135623 Hs.193914 KIAA0575 gene product 2.5 

412520. AA442324 Hs.795 H2Ahistone family, member O 2.5 

430602 D13752 Hs.184927 cytochrome P450, subfamily XIB (steroid 2.5 

408031 AA081395 Hs.42173 Homo sapiens cDNA FU10366 fis, clone NT 2.5 

403133 Target Exon 2,5 
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413189 BE070231 gb:GV4-BT0407-2601 00-087-11 2 BT0407 Homo 2.5 

400346 AB041259 Hs.272263 Homo sapiens mRNA for keratin 19, pariia 2.5 

435509 AW58679 Hs.181915 ESTs 2.5 

458145 AI239457 Hs.130794 ESTs 2.5 
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TABLE 19A 



10 



Table 19A shows the accession numbers for those pkeys lacking unigenelD's for Table 19. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



40 



45 



50 



55 



60 



Pkey 

407647 
407980 
408254 



409695 
410534 
410672 
410784 
410785 
410835 
411050 
411086 
411093 
411111 
411171 
411337 
411514 
411670 
411905 
412102 
412209 
412248 
413043 
413111 
413169 
413221 
413499 
413708 
414210 
414596 



1007366 J 

103087J 

1049346.1 



15 



20 



25 



30 



3 5 409163 110418J 



CAT number Accessions 



114876.1 

1207247 J 

1214882.1 

1221005.1 

1221055J 

1223785.1 

1230330.1 

1231500.1 

1231970J 

1232669 1 

1234393.1 

1239217.1 

1248638J 

1253680J 

1265181.1 

1277395.1 

1283610 1 

1285000.1 

1346556.1 

1349546.1 

1352723 1 

1353887 1 

1373910J 

13B4140J 

1426051J 

1465004.1 



AW860158AW862385AW860159AW862386AW862341AW821869AW821893AW062660AW062656 
AA046309 AI263500 AA046397 

AW807227 AW807576 AW807137 AW807157 AW807495 AW807494 AW807417 AW807083 AW845786 AW845801 
AW807130 AW807335 AW807081 AW807349 AW807339 AW807164 AW807341 AW807224 AW845903 AW1 77424 
AW807159 AW807123 BE141576 AW807340 AW807334 AW807520 AW807205 AW807505 BE141574 AW807390 
AW807395 AW845789 AW807101 AW807089 AW807519 AW807239 AW807509 AW807356 AW807526 AW807098 
AW807307 AW807153 AW807295 AW807313 AW807322 AW807265 AW807513 AW807516 AWB07166 AW807501 
AW807120AW807168 AW807121 AW807527AW807151 AW807203 AW807489AW807511 AW807158 AW845800 
AW807507 AW845793 AW845796 AW807296 AW845781 AW807444 AW845871 AW807512 AW807242 AW807141 
AW807522 AW807487 AW807514 AW807142 AW807232 AW807379 AW8071 14 AW807518 AW807199 AW80721 1 
AW807498 AW807086 AW807492 AW807218 AW807082 AW807525 AWB07493 AW807523 AW807087 AW845784 
AW807037 AW807128 AW807080 AW807118 AW845807 AW807524 AW845803 AW807249 AW845795 AW807160 
AW807343 AW807515 AW807233 AW807289 AW177102 AW807352 AW807394 AW177105 AW807176 AW177103 
AW845870 AW177099 AW177101 AW807528 AW807336 AW807038 AW177100 AW80741 1 AW807088 AW845865 
AW807226 AW807517 AW807397 AW807303 AW807177 AW807154 AW807136 AW807146 AW807085 AW807521 
AW807488 AW807385 AW807355 AW807223 AW807155 

AA065081 AA075017 AA084791 AA071015 AA081560 AA071459 AA545727 AA083100 AA085366 AA115845 AA075457 

AA064704 AA082878 AA075742 AA069162 

AA296961 AA296889 AA076945 AA077528 AA077497 

AW905138 AW753008 R13818 Z43519 

AW794600AW794730 

AW803201 BE079700 BE062940 

AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 

AW806906 AW80691 5 AW856460 AW866475 AW666462 AW866448 AW866372 AW866604 

AW814902 8E156656 BE156667 BE156590 BE156441 BE156447 

BE070800 AW875226 BE1491 15 

BE067650AW817053 

AW818127AW818161 R09719 

AW820260 AW820332 R94406 

AW837349 AW837355 AW882717 

AW850178 AW850233 AW850445 AW850446 

AW856552 AW861 101 AW856574 AW861099 AW861 100 AW856573 AW856576 AW856562 

BE265067 BE264978 AW875420 

H56435H56572AW892929 

AW901456 AW901450 AW901441 

BE176480 AW903298 AW903313 

BE158766 BE061699 BE147360 BE147362 BE061666 BE061697 BE061647 BE061678 
6E065837 BE065805 BE065799 BE065818 BE065839 8E065831 BE065894 BE065789 BE065792 
BE070231 BE070229 BE070255 

BE161151 BE162495 BE161002 BE072205 BE1 60989 BE 162482 
BE144884 H97942 

BE158791 BE158806 BE158748 BE158744 BE158740 BE158739 BE158811 BE158700 BE158741 BE158683 BE158685 

BE383592BE261671 

BE386870 Z41986H08501 
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414605 


1465790.-1 


BE390440 


415747 


155189 1 


AA381209 AA381245 AA167683 


416173 


1574973J 


R52782 R17313 H24192 R19B76 


417742 


1696282J 


R64719 Z44680R12451 


417974 


171237J 


AA210765 T95700H94407 


418636 


177402J 


AW749855 AA225995 AW750208 AW750206 


419536 


185688 J 


AA603305 AA244095 AA244183 


420854 


197072J 


AW296927 AI684514 A1263168 AA281079 


422156 


212379J 


N34524 AA305071 AW954803AA502335 AI433430 AI203597 AW026670 AW265323 AW850787 AA317554 AW993643 






AW835572 AW385512 A1334966 W32951 H62656 H53902 R88904 AW835732 


422996 


223666 1 


BE091089 BE091123 AA319959 


423833 


232451.1 


AW503329 N46610AA331571 


423841 


232507.1 


AW753967 AA370795 AA331630 AW962550 


423945 


233566.1 


AA4 10943 AW948953 AA334202 AA332882 


425201 


247933.1 


AA3521 1 1 AW962247 AA429695 


426650 


270283.1 


AA382814AA402411AA412355 


426878 


273265.1 


BE069341 AW748403 AL044891 AI908240 AA393080 


430264 


315008.1 


AA470519 8E303010 BE302954 BE384120 


430785 


323486.1 


Z30201 AA486132T72025 


431676 


336411 1 


A1685464 AW971336 AA513587 AA525142 


433687 


373061.1 


AA743991 AA604852 AW272737 


434338 


383982.1 


AW754311 AA630185 AW803285 


434469 


387447.1 


AA634806 C18732 AA729161 AA729860 


435447 


406400.1 


AI872932 AA682306 BE220163 W88695 T81307 H91447 


437152 


43386 1 


AL050027BE089051 


437854 


44418.1 


AL119723 AL119874 AI90901B U50537 


439031 


46798.1 


AF075079H48601 H48795 


439255 


470321.1 


BE164500 AA832198 BE164502 


444910 


624951.1 


A1201849 BE069007 AW946544 


445432 


63943.1 


AV653771 BE089370 


446922 


69865.1 


BE175605 Z43529 F06610 BE175602 AV661027 


447197 


711623.1 


R36075AI366546 R36167 


448420 


76273.1 


BE623004 AA380669 BE263627 BE246433 


448516 


766241 1 


AW898595 AW898588 AW898590 AW898663 AW898592 AI525093 


450522 


837264.1 


AI698839 AI909260 AI909259 


450736 


844652J 


AW970060 AI732366 AI792313 AW839644 


451024 


85565 1 


AA442176AA259181 


451067 


85759 1 


8E172186 AA059279 AA020815 AA013437 


451340 


86640 1 


AW936273 AW340350 AA017208 


452542 


921410.1 


AW812256 AW812257 AI906423 AI906422 


452564 


92227.1 


M026777 N50065 R09961 N54721 


453472 


968371 1 


AL037925 AL037931 AL037957 


454307 


1106070.1 


AW855717 AW362452 AW362443 


454359 


1130674 1 


N71277AW390764 


454545 


1223779.1 


AW806899 AW866451 AW866393 AW866297 AW817869 


454693 


1229132J 


AW813428 AW813444 AW813367 AW813368 AW813429 AW813424 


454714 


1230493 1 


AW815098 BE154843 BE 154831 


455047 


1250536.1 


AW852530 AW852527 AW852526 


455092 


1252971.1 


BE152428 AW855572 AW855607 


455097 


1253130.1 


AW855802 AW855794 AW855797 AW855806 AW855796 AW855808 AW855793 AW855807 


455100 


1253334 1 


BE160198 AW935898 T1 1520 AW935930 AW856073 AW861034 


455431 


1289854.1 


AW938484 BE001245 BE001190 


455511 


1321229.1 


BE144762AW979091 


455609 


1337548.1 


BE011668 BE011689 BE011627 BE011679 BE011699 BE011678 BE011696 BE011675 BE011622 BE011635 


455651 


1348732.1 


BE064962 BE064979 BE064853 BE064857 BEQ64856 BE064977 BE064960 BE064860 BE064815 BE064957 BE064804 






BE064816 BE064850 BE064806 BE064796 BE064818 BE064975 BE064819 BE064810 BE064668 BE065059 


455685 


1350393J 


BE066976 BE066928 BE066927 


455700 


1351264.1 


BE068115 BE068104 BE068102 BE068096 BE068103 BE068154 BE068198 


455708 


1352232.1 


BE069326 BE069290 BE069352 


455732 


1353874.1 


BE080908 BE072258 BE072190 BE072236 


455838 


1374605.1 


BE145808 BE145807 BE181883 


455935 


1384144.1 


BE158687BE158688 


455945 


1385588.1 


BE160636 BE160606 BE160703 


456207 


165078.-1 


AA193450 


456482 


192289.1 


AA485224AA287308AA258121 


458094 . 


47311.1 


AF085325 W72956 W73221 AA2191 12 


458673 


679507.1 


N99626AI302701 
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10 



15 



Table 19B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 19. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref. Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. 'Dunham I. et at." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et at, Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition; Indicates nucleotide positions of predicted exons. 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey 


Ref 


Strand 


Ntjjosition 


400555 


9801191 


Minus 


134694-134817 


400608 


9887666 


Minus 


96756-97558 


400610 


9887671 


Minus 


117606-117928,124040-124147 


400925 


7651921 


Plus 


38183-38391,43900-14086 


401045 


8117619 


Plus 


90044-90184,91111-91345 


401049 


7232177 


Plus 


149157-150692 


401093 


8516137 


Minus 


22335-23166 


401256 


9796573 


Minus 


45482-45620 


401283 


9800093 


Minus 


4725647456 


401326 


9212516 


Minus 


226246-227505 


401418 


7452889 


Minus 


124865-125075 


401451 


6634068 


Minus 


119926-121272 


401458 


9187886 


Plus 


76485-77597 


401497 


7381770 


Plus 


92607-92813 


401508 


7534110 


Minus - 


110779-110983 


401575 


7229804 


Minus 


76253-76364 


401747 


9789672 


Minus 


118596-118816,119119-119244,119609-119761,120422-120990,130161-130381,130468-130593,131097- 








131258,131866-131932,132451-132575,133580-134011 


401781 


7249190 


Minus 


8321^83435,83531-83656,83740^3901,84237-«4393 ( o4955^5037 1 86290-86814 


401785 


7249190 


Minus 


165776-165996,166189-166314,166408-166569,167112-167268,167387-167469,168634-168942 


401793 


7263888 


Minus 


102945-103083 


401987 


4406829 


Minus 


72893-73021,76938-77049 


402077 


8117414 


Plus 


65014-65195 


402109 


8131678 


Minus 


171722-171859,173197-173303 


402184 


8576001 


Minus 


112844-112986,113505-113636 


402376 


9625329 


Minus 


21753-22385 


402421 


9796341 


Minus 


4660946662,46758-4681136293^86346,8977^89829,90048-90101.102817-102924 


402578 


9884928 


Plus 


66350-66496 


402606 


9909429 


Minus 


8174742094 


402696 


7328818 


Minus 


23600-23731 


402765 


9367757 


Plus 


109588-109726 


402800 


6010175 


Plus 


43921-44049,46181-46273 


402820 


6456853 


Minus 


82274-82443 


402892 


8086844 


Minus 


194384-194645 


403133 


7331427 


Plus 


38314-38634 


403356 


8569930 


Plus 


92839-93036 


403388 


9438331 


Plus 


112733-113001,114599-114735 


403426 


9719529 


Minus 


157156-158183 


403585 


8101208 


Minus 


131266-131769 


403593 


6862650 


Minus 


62554-62712,69449-69602 


403637 


8671936 


Minus 


142647-142771,145531-145762 


403639 


8671948 


Plus 


113234-113326,115186-115287.119649-119786 


403677 


7331517 


Minus 


55008-55083,62860-63051 


403775 


7770580 


Minus 


102247-102326,103095-103148 


403943 


7711864 


Plus 


100742-100904,101322-101503 
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404091 7664554 


Minus 


82121-83229 


404097 7770701 


Plus 


55512-55781 


404142 9856692 


Minus 


80316-80459 


404253 9367202 


Minus 


55675-56055 


404274 9885109 


Plus 


104127-104318 


404285 2326514 


Plus 


32282-32416 


404360 9858450 


Minus 


122873-122966,151324-151469,153093-153253 


404440 7528051 


Plus 


80430-81581 


404443 7579073 


Minus 


8719&87441 


404552 7243881 


Plus 


19854-20010 


404561 9795980 


Minus 


69039-70100 


404580 6539738 


Minus 


240588-241589 


404721 9856648 


Minus 


173763-174294 


404826 6572184 


Pius 


4772648046 


404983 4432779 


Minus 


51178-51374,52000-52173 


405037 7543748 


Minus 


127374-127578 


405041 7547195 


Plus 


121230-121714 


405095 8072599 


Plus 


138877-139066 


405153 9965565 


Minus 


175317-175500 


405196 7230083 


Minus 


135716-135851 


405232 7249042 


Plus 


125904-126063 


405248 7259728 


Plus 


637-777 


405336 6094635 


Plus 


33267-33563 


405394 6624123 


Minus 


31900-32373 


405460 7684569 


Minus 


52223-52389 


405494 8050952 


Minus 


70284-70518 


405547 1054740 


Plus 


124361-124520,124914-125050 


405609 5757553 


Minus 


4281443010,4358343783,4486345033,4642946554,4781548018,49961^^ 






52918,55469-55601,571 1 1-57307,58169-58296,60215-60332,61482-61727 


405638 6289229 


Plus 


199260-199372.199826-199929 


405654 4895155 


Minus 


53624^53759 


405718 9795467 


Plus 


113080-113266 


405822 6273498 


Minus 


154660-154974,155203-155379 


405848 7651809 


Minus 


28135-28244 


405873 6758747 


Minus 


32129-32764 


405906 7705124 


Minus 


10835-11059 


405917 7712162 


Minus 


J06829-107213 


405925 6758795 


Pius 


129935-130282 


405953 7960374 


Minus 


65101-65574 


406069 9117732 


Plus 


68880-69374 


406151 7144806 


Minus 


94087-94285 


406153 9929734 


Minus 


12902-13069 


406182 5923650 


Minus 


28256-28935 


406271 7534217 


Plus 


36179-36692 


406291 5686274 


Plus 


9562-9867 


406348 9255985 


Minus 


71754-71944 


406414 9256407 


Plus 


4959349850 


406446 9454509 


Minus 


116424-116527.118721-118859,121187-121364 


406504 7711360 


Minus 


107068-107277 


406554 7711566 


Plus 


106956-107121 
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TABLE 20: 544 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
LIKELY TO ENCODE EXTRACELLULAR OR CELL- 
SURFACE PROTEINS 

Table 20 shows 544 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode extracellular or cell-surface proteins. These were selected as for 
Table 19, except that the ratio was greater than or equal to 3.0, the "average" normal adult 
tissue level was set to the 85 th percentile value amongst 144 non-malignant tissues, and the 
96 th percentile value amongst the 73 breast cancer specimens was greater than or equal 100 
units, and the predicted protein contained a structural domain that is indicative of 
extracellular localization (e.g. ig, fii3, egf, 7tm domains, signal sequences, transmembrane 
domains). The predicted protein domains are noted. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unigene number 
20 Pred.ProtDomalns: Predicted Protein Domains 
Unfcene Title: Unigene gene title 

R1 : Ratio of 93 rd percentile of tumor to 85* percentile of normal body tissue 



25 Pkey ExAccn UnlgenelD PredProt.Oomains 
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408591 
400291 
449746 
407277 
400292 
424735 
426878 



407178 
408000 
427585 
429441 
450375 
420931 
422109 
445730 
451110 
400297 
420813 
452744 
424634 
420757 
424399 
447350 
456207 
431448 
427217 
456938 
435496 
402578 
453160 
422505 
444342 



AF015224 Hs.46452 

AA401369 Hs.190721 

AI668594 Hs.176588 

AW170035 Hs.326736 

AA250737 Hs.72472 

U31875 Hs.272499 
BE069341 

NM_000230Hs.194236 

AA195651 Hs.104106 

L11690 Hs.620 

D31152 Hs.179729 

AJ224172 Hs.204096 

AA009647 Hs.8850 

AF044197 Hs.100431 

S73265 Hs,1473 



AI624342 

A1955040 

A1127076 

X51501 

A1267652 



Hs.170042 

Hs.265393 

Hs.334473 

Hs.99949 

Hs.30504 



NNL003613HS.151407 
X78592 Hs.99915 



A1905687 
AI375572 
AA193450 
AL137517 



Hs.2533 
Hs.172634 

Hs.334473 



AA399272 Hs.144341 
X52509 Hs.161640 
AW840171 Hs.265398 

AI263307 Hs.239884 
AL120862 Hs.124165 
NMJ)14398Hs.10887 



SS.Uterogtobin.SS.Uteroglobin 
TM 

,SS,p450 
TM 

deafo.ZUS.TM.Activinjecp.pkinase, 
,SS,TM 
TM 

SS.leptin.SS.Leptin, 
,SS,Dihydroorotase, 

PlectinjepeatSH3,spectrin,SS,P!ectin.j 
SS.CIq.Collagen.SS.CIq.Coflagen. 
.SS.Uteroglobin, 

,SS,TM,disintegrin,Pep_M12B_j)ropep,Repro a disintegrin and metalloproteinase doma 



UnigenoTWe 

mammaglobin 1 
ESTs 

ESTs, Weakly similar to CP4Y.HUMAN CYTOC 
Homo sapiens breast cancer antigen NY-8R 
BMP-R1B 

short-chain alcohol dehydrogenase family 
gb:QV3-8T0381-27010(M)73-c08BT0381 Homo 
leptin (murine obesity homolog) 
ESTs 

bullous pemphigoid antigen 1 (230/240kO) 
collagen, type X, alpha 1 (Schmid metaph 
lipophiiin B (uteroglobin family member) 



SS.IL8.SS 
SS.Bombesin.SS 
,SS,TM,Cation_efllux 
SS 
TM 
SS SS 

,SS,TM,GNS1_SUR4 l cNMPJ)inding,Rlla 
ig.tspJ.SSMA 

hormone_rec,Androgenjecep,zf-C4 l 
SS 

.pkinase, 

.SS.TM.p450.p450 
TM 
SS 



small Inducible cytokine B subfamBy (Cy 



ESTs 

ESTs. Weakly similar to transformation-r 
hypothetical protein DKFZp56401278 
prolactin-induced protein 
Homo sapiens mRNA; cDNA DKFZp434E082 (fr 



androgen receptor (dihydrotestosterone r 
aldehyde dehydrogenase 9 family, member 
ESTs 

gbzr40eQ7.r1 SoaresJJhHMPu_S1 Homosapl 
hypothetical protein DKFZp56401278 
ESTs 



f SS JM r amtnotran - L2,CadhertrLC - .term,cad tyrosine aminotransferase 

SS ESTs, Weakly similar to transformatiorw 

SS.p450.SS.TM.p450 C1001134:gi|2117372|pir||l65981 fatty ac 

SS H2B histone family, member L 

SS programmed cell death 9 (PDCD9) 

Lamp.SS.TM.Ump, similar to lysosome-associated membrane 
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R1 

168.6 

73.2 

65.7 

57.6 

55.9 

53.8 

50.3 

40.8 

39.3 

37.3 

35.2 

30.0 

25.7 

25.2 

24.8 

24.1 

24.0 

23.8 

22.8 

22.6 

21.7 

21.7 

20.3 

19.2 

18.3 

18.2 

18.2 

18.1 

17.9 

17.8 

17.8 

17.7 

17.5 



WO 02/059377 



PCT/KS02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



449765 
428227 
425692 
424001 
448595 
449448 
418007 
418994 
453596 
452401 
446591 
419296 
452838 
422805 
448390 
447342 
411869 
443709 
459587 
442580 
400289 
411598 
415263 
433805 
407276 
443348 
421037 
424086 
400295 
452461 
427365 
433365 



N92293 Hs.206832 
AA321649 Hs.2248 
D90041 Hs.155956 
W67883 Hs.1 37476 
AB014544 Hs.21572 



D60730 
M13509 



Hs.57471 
Hs.83169 



AA296520 Hs.89546 
AA441838 Hs.62905 
NMJ)07115Hs.29352 
844186 Hs.15456 
AA236115 Hs.1 20785 
U65011 Hs.30743 
AA436989 Hs.121017 
AL035414 Hs.21068 



Hs.19322 
Hs.23439 
Hs.134662 

Hs.130239 
Hs.2258 
BE336654 Hs.70937 
AA94B033 Hs.1 30853 
AA706910 Hs.1 12742 
AI951118 Hs.326736 
AW873596 Hs.182278 
AI684808 Hs.197653 



A1199268 

W20027 

AI082692 

AA031956 

AI733682 

X07820 



AI351010 
W72838 
N78223 
A1873274 



Hs.102267 
Hs.2533 
Hs.108106 
Hs.190721 



432596 
408771 
432912 
447033 
421155 
424905 
425398 
438167 
459583 
423945 
439820 
402606 
445263 
430217 
447164 
431385 
423887 
415385 
425704 
429859 
425523 
428368 
418912 
422026 
451952 
438199 
400608 
413472 
432374 
402408 
445537 
451621 
405654 
434988 



AF026944 Hs.293797 

AA576953 Hs.22972 

AJ224741 Hs.278461 

AW732573 Hs.475B4 

BE007371 Hs.200313 

AI357412 Hs.157601 

H87879 Hs.102267 
NM_002497Hs.1537O4 

AL049689 Hs.1 56369 

R28363 Hs.24286 
AI907673 
AA4 10943 

AL360204 Hs.283853 

H57646 Hs.42586 
N47863 Hs.336901 
AF026941 Hs.17518 
BE178536 Hs.1 1090 
AL080207 Hs.134585 
R17798 Hs.7535 
U79293 Hs.159264 
NM_007050Hs.225952 
AB007948 Hs.158244 
BE440042 Hs.83326 
NMJX)0685Hs.89472 
U80736 Hs.1 10826 
AL120173 Hs.301663 
AW016531 Hs.122147 

BE242870 Hs.75379 

W68815 Hs.301885 
NA 

AJ245671 Hs.12844 

AJ879148 Hs.26770 
NA 

AM18055 Hs.161160 



SS.IL8, 

.SSAcetyltransG, 
.pkinase, 

LRRCT < LRR,SS,LRRCT,serine_carbpept 
SS 



ESTs, Moderately similar to ALU8.HUMAN A 
small inducible cytokine subfamily B {CX 
N-acety [transferase 1 (aryiamine N-acety 
10 



K1AA0644 gene product 
ESTs 



SS.hernopexinpeptidase_M10,SS l Pepti(iase_rnatrix metalloprotsinase 1 (MMP1; inters 
SSJectinj^shl,EGF,SS,EGF Jeciin_c.su selecfin E (endothelial adhesion molecul 



SS 

.SS.CUB.Xlink, 

PDZ.SS 

SS 

SS.SS 

histone.SS.histone.histone 
SS 

.SS.Iipocalin 

.SSPeptldase.MI, 

.SS.TM.SNF 

,ssm 

SS 



hypothetical protein RJ 14834 
tumor necrosis factor, alpha-induced pro 
PDZ domain containing 1 
ESTs 

preferentially expressed antigen in mela 
H2A histone family, member A 
hypothetical protein 

Homo sapiens, Similar to RJKEN cDNA 2010 

ESTs 

ESTs 

gb:zk15e04.s1 Soares_pregnantuterus_NbH 
ESTs 



hemopexin,PeptidaseJM0,SS,PeptfdaseJrf10matrix metalloproteinase 10 (MMP10; str 



histone.SS.histone.histone 
,SS t htstone,histone,linker_histone 
.SS.RIbosomalJJAe, 
TM 

.SS.DENN 
SS 

.SS.LysyLoxIdase 
SS 

.SS^a^HD, 
TM 

.SS.TPR 

SS I TM,UPF0016,SS,TW,UPF0016 

SS.EGF.vwa.SS.TM.vwa, 

TMK.tetra,lorLtrans 4 SS f TM,KJetra,ionJ 

.SSJM.Folatejarrier 

SS 



H3 histone family, member A 
ESTs 
ESTs 

Homo sapiens breast cancer antigen NY-BR 
calmodulin 2 (phosphoryiase kinase, delt 
programmed cell death 9 (PDCD9) 
lysyl oxidase 

aldehyde dehydrogenase 9 family, member 

transcription factor 

ESTs 

ESTs 

hypotheSca! protein FLJ 13352 
matriEn3 

potassium voltage-gated channel, delayed 

ESTs 

ESTs 



SS,LysyLoxidase,AWose_epim l Ep!merase,S tysyl oxidase 



pklnase.SS.TM.pkinase.poryprenyl.synt, 
SS 

.SS.WtmJ.p^SO.rrm 



death J ZU5,TMAcSvin_recp,pkinase, 

SS 

SS 

,SS.AcyItransferase, 



N1MA (never in mitosis gene a)-/elated k 
hypothetical protein similar to tenasrin 
ESTs 

gb:IL-BT152-080399-004 BT152 Homo sapien 
gb:zt32h03j1 Soares ovary tumor NbHOT H 
Homo sapiens mRNA full length insert cON 
NM_024626:Homo sapiens hypothetical prot 
K1AA1560 protein 



,SS,RrM4ioLA.RrMj)oLA2,RibcisomaLS24e,ribosomal protein S24 



,TM,IBR 
,SS,TM 

.SS.TM.BRCT.ank^BCjran.ABCJran 

,SS,Fork_head, 

SS 

,SS,TM,Y4)hosphatase < MAM,fn3. 
l SS,laminirLB,larrunin_EGF t laminm_Nfcerm 



Homo sapiens cig5 mRNA, partial sequence 
membrane-spanning 4-dornains, subfamily A 
DKFZP434G232 protein 
COBW-nke protein 

Human clone 23948 mRNA sequence 
protein tyrosine phosphatase, receptor t 
KIAA0479 protein 



SS l Peptidase_M10,hemopexin,SS,PeptIdase_matr!x metalloproteinase 3 (stromelysin 



SS,TM,7tm - 1,SS,TM l 7tm_1, 
SS 

,SS,pkinase, 
.SS.ArfGap, 
SS.TM.SS.TM 
SS 
SS 

,SS,carb_anhydrase 
,SS,TM,ras 
SS.Bpocalin.Bpocafin, 
BTB.SS 



angiotensin receptor 1 
trinucleotide repeat containing 9 
ESTs 
ESTs 

C10001899:gi|7508633|plr|}T25392 hypothe 
solute carrier family 1 (glial high afti 
Homo sapiens cDNA FU1 1346 fts, done PL 
NM_03G920*:Homo sapiens hypothetical pro 
EGF-fike-ckimain, multiple 6 (EGFL6) 
fatty acid binding protein 7, brain 
C12001521giJ7513934|pirp31081 cca3 pr 
ESTs 



17.3 

17.0 

16.7 

16.5 

16.3 

16.2 

15J 

15.5 

15.5 

15.0 

14.9 

14.8 

14.7 

14.3 

14.2 

14.2 

13.9 

13.7 

13.7 

13.5 

13.5 

13.3 

13.2 

13.1 

13.1 

13.0 

12.9 

12.8 

12.7 

12.5 

12.4 

12.3 

12.0 

11.9 

11.9 

11.9 

11.8 

11.8 

11.7 

11.6 

11.5 

11.5 

11.4 

11.4 

11.3 

11.2 

11.1 

11.1 

11.1 

10.9 

10.9 

10.7 

10.4 

10.3 

10.3 

10.3 

10.3 

10.3 

10.2 

10.1 

10.0 

9.9 

9.8 

9.7 

9.6 

9.6 

9.6 
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416220 N49776 Hs.1 70994 


,SS,TM 


hypothetical protein MGC10946 


y.o 


431808 M30703 Hs.270833 


SS.TM,EGF,SS 


amphiregulin (schwannoma-derived growth 


9.5 


4U142 AW368397 Hs.150042 


.SS.UDPGT 


Homo sapiens cONA FU14438 fis. done HE 


O A 

9.4 


400298 AA032279 Hs.61635 


TM 


six transmembrane epithelial antigen of 


9.4 


418601 AA279490 Hs.86368 


SS.TM.calreticulin.SS.TM.calreticuIin, 


calmegin 


9.4 


415539 AI733881 Hs.72472 


death,ZU5JM^ctlvtn_recp,pkinase, 


BMP-RIB 


ft A 

9.4 


421451 AA291377 Hs.50831 


TM 


ESTs 


9.3 


429432 AI678059 Hs.202676 


SS 


synaptonemal complex protein 2 


9.3 


442441 AI820662 Hs.1 29598 


ss 


ESTs 


9.1 


426429 X73114 Hs.169849 


,SS,TM,fn3,ig, 


myosin-blnding protein C, slow-type 


3.1 


406687 M31126 Hs.272620 


SS.Pepttdase^MIO.hemopexin.SS.PepUdase.matrixmetalloproteinase 11 {MMP11;stro 


9.1 


446693 AW004854 Hs.228320 


SS 


hypothetical protein FLJ23537 


9.1 


419948 AB041035 Hs.93847 


Ferricjedud,TM,Ferric reduct, 


MM 016931:Homo sapiens NADPH oxidase 4 ( 


9.1 


426214 H59846 Hs, 128355 


SS 


ESTs, Moderately similar to AW7JHUMAN A 


9.0 


427718 AI798680 Hs.25933 


t SS,TM,histone,Sec1 ,histone,sugarjr 


ESTs 


8.9 


414812 X72755 Hs.77367 


SSMSSJL8 


monokine induced by gamma Interferon 


8.8 


400285 NA 


JM,ABC_tran,ABC..mernbrane l 


Eos Control 


8.8 


422330 D30783 Hs.115263 


SS.TM.EGF.SS.TM 


epireguRn 


8.8 


416182 NM_004354Ha.79069 


cycfin,SS 


cycfinG2 


q a 

0.0 


420077 AW512260 Hs.87767 


SS 


ESTs 


8.7 


452281 T93500 Hs.28792 


,SS,TGF-beta,TGFbj)ropeptide l 


Homo sapiens cDNA FU11041 fis. done PL 


8.7 


434531 AA642007 Hs.116369 


SS 


ESTs 


8.6 


408380 AF123050 Hs.44532 


,SS,TM,ub!quilin 1 7tm_3,ANF_receptor,sush 


diubiquitin 


8.6 


443788 AI732643 Hs.144151 


TM 


ESTs 


8.6 


411078 AI222020 Hs.182364 


SS.SS 


CocoaCrisp 


8.5 


445495 BE622641 Hs.38489 


SS,SS,ENTH ( LLWEQ,ENTH,LLWEQ ( DNAjTiis_reESTs ( Weakly similar to 138022 hypotheti 


8.5 


433426 H69125 Hs.133525 


.SS.TM 


ESTs 


8.5 


424871 NM_004525Hs.153595 


SS,EGF.IdLrecepta,ldLrecepLb,SS,TM t E 


low density tipoprotein-retated protein 


8.4 


426215 AW963419 Hs.155223 


SS 


stanniocalcln 2 


8.4 


409045 AA635062 Hs.50094 


TM 


Homo sapiens mRNA; cONA DKFZp434O0515 (f 


8.4 


435525 AI831297 Hs.123310 


TM 


ESTs 


8.3 


409203 AA780473 Hs.687 


SS.p450.SS.p450 


cytochrome P45Q, subfamily 1VB. polypept 


8.3 


424902 NM_003866Hs.1 53687 


SS.SS 


Inositol polyphosphate-4-phosphatase. ty 


0 1 
0.0 


431725 X65724 Hs.2839 


SS.CysJnot.SS 


Nonie disease (pseudogGoma) 


8.3 


418092 R45154 .Hs.106604 


,deatrUU5,r^nase,Activin_recp 1 


ESTs 


0 0 

0.0 


439840 AW449211 Hs.105445 


SS 


GDNF family receptor alpha 1 


8.2 


427811 M81057 Hs.180884 


SS l 2a_carbOpept,Propep_M14,SS,Propep_M14cart)oxypepfidase 81 (tissue) 


8.2 


420807 AA280627 Hs.57846 


SS.cpnIO 


ESTs 


0.2 


426320 W47595 Hs.1693Q0 


SS,TGF-beta,TGFbj>ropeptide,SS ■ 


transforming growth factor, beta 2 


0 0 

0.2 


447078 AW885727 Hs.301570 


.SS.kazal, 


ESTs 


Q 4 

0. 1 


415786 AW419196 Hs.257924 


SS 


hypothetical protein FU13782 


0.1 


410102 AW248508 Hs.279727 


SS 


Homo sapiens cDNA FU14035 fis, done HE 


Q ft 

o.U 


404347 


SS 


Target Exon 


8.0 


433687 AA743991 


TM 


gb;ny57g01.s1 NCLCGAP_Pr18 Homo sapiens 


8.0 


421373 AA808229 Hs.167771 


t SS,IMPDH.C.IMPDHJl ( CBS 


ESTs 


8.0 


422634 NMJ)16010Hs.1 18821 


SS 


CGI-62 protein 


7.9 


453310 X70697 Hs.553 


TM,SNF,SS.TM,SNF. 


solute carrier family 6 (neurotransmitte 


7.9 


435957 N39Q15 Hs.190368 


,SS,TM 


ESTs 


7.8 


407771 AL138272 Hs.62713 


( TM l cpn60JTCP1 ) Sema, 


ESTs 


7 R 
f .0 


443646 AI085198 Hs.164226 


.TSPN.vwc.tspJ.EGF.thiored, 


ESTs 


7.8 


446142 AI754693 Hs.145968 


,TM,cadterin,Cadherin_CJerm. 


ESTs 


7.7 


444649 AW207523 Hs.197628 


jSS.rrm, 


ESTs 


7.6 


435147 AL133731 Hs.4774 


.TM.SDF.UPARJ.Y6, 


Homo sapfens mRNA; cONA DKFZp761C1712 (f 


7.6 


439138 AI742605 Hs.193696 


TM 


ESTs 


7.6 


429220 AW207206 Hs.136319 


SS 


ESTs 


7.6 


428804 AK000713 Hs.193736 


.SS.UDPGT 


hypoftetical protein FU20706 


7.5 


453511 AL031224 Hs.33102 


SS.SS 


transcription fador AP-2 beta (adivati 


7.5 


439809 R41396 Hs.101774 


SS 


hypothetical protein FU23045 


7.5 


414869 AA157291 Hs.21479 


SS 


ublnudein 1 


7.5 


416276 U41060 Hs.79136 


SS.TMJM 


UV-1 protein, estrogen regulated 


7.5 


452862 AW378065 Hs.8687 


,SS,Pep_M12B_pfopep > Reprolysin,tspJ l 


ESTs 


7.4 


452926 AI742170 Hs.31297 


.SS.TM 


duodenal cytochrome b 


7.4 


453331 A1240665 Hs.8895 


,SS,TM l dislntegrinPepJul12Bj}ropep,Repro ESTs 


7.3 


420802 U22376 Hs.1334 


SS,NA,mybJ)NA-btndlng 


v-myb avian myeloblastosis viral oncogen 


7.3 


450603 R43646 Hs.12422 


SS 


ESTs 


7.2 


422867 132137 Hs.1584 


SS,EGF.tspJ.SS.E2F_TDP. 


cartilage oligomeric matrix protein (COM 


7.2 


418004 U37519 Hs.87539 


SS.aldedh.SS.aldedh, 


aldehyde dehydrogenase 3 family, member 


7.2 
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426451 AI908165 Hs.169946 


SS.GATA, 


GATA-blndlng protein 3 (T-cell receptor 


7.1 


450701 H39960 Hs.288467 


.SS.LRR 


Homo sapiens cDNA F1J12280 fc. done MA 


7.1 


419519 AI198719 Hs.176376 


SS 


ESTs 


7.1 


410555 U92649 Hs.64311 


,TM,disintegrin,Reprolysin, 


a dlsintegrin and metalloproteinase doma 


7.1 


433138 AB029495 Hs.59729 


SS.ig.Serna.SS.Sema.efhand 


sernaphorin sern2 


7.0 


411558 AA102570 Hs.70725 


SS.TM.SSJM 


gamrre-aminobutyric acid (GABA) A recepto 


7.0 


409079 W87707 Hs.82065 


,TMM 


(nterieukin 6 signal transducer (gp130, 


7.0 


417275 X63578 Hs.295449 


SS.emand.SS.efhand.ras 


parvalbumin 


7.0 


432731 R31178 Hs.287820 


,SS.fo3.m1 ) fn2 l fn2,fn1 


fibronectin 1 


6.9 


442818 AK001741 Hs.8739 


WD40.SS 


hypothetical protein RJ 10879 


6.9 


407366 AF026942 


,TM,iBR 


gb:Homo sapiens cig33 mRNA, partial sequ 


6.8 


427427 AF077345 Hs.177936 


SS.lectin_c.SS 


ESTs 


6.8 


410785 AW803341 


SS 


gb:ltf -UM007949030M50^3 UM0079 Homo 


6.7 


401045 


lCEj>20,S^,lCEj3l0JCE_p20.ICEj)10.ICE_p 




C11001883*:gi|6753278|refJNPJ33938.1| c 


6.7 




4(0900 r\\ IZjjOj Uo-OUiJO 


,SS.Reproiysin f tsp 1, 


ESTs 


6.7 


d49ftR9 R41H91 Hs7A'1'l 
HHCSJOC rsH I OZ J no./ «J 1 0 


,mEPHJb<pkinase,SAM,fn3. 


ESTs; cabyntenin-2 


6.7 


449RR1 AA9d1AT7 H<:R77ft7 


SS 


ESTs 


6.6 


AlfiftlR A1RRRZQQ Hq 1R1719 
** I GO JO HIOOO**ao ria. IVJI/IZ 


,TM,Activin_recp f pk]nase t deathZU5, 


ESTs 


6.6 


d990Rn D9nRC« He ^9Rft9^ 


,SS,TM,CD36 


ESTs. Moderately similar to AIU5JWMAN A 


6.6 


***HOOI DcJoYjJo nS-Zw/IO 


,SS,mito_carr 


ESTs. Weakly similar to S64054 hypothec* 


6.6 




JMJtmJANF receptor, 


TaigetExon 


6.6 


4 1/000 nVVUO/yUO n5,OZ7/£ 


SS,Co!lagen,COLFI,TSPN,SS,TSPN 


collagen, type XI, alpha 1 


6.6 


/19flQ4Q At IICCW He iQIdiA 


SS.SS 


KIAA0575gene product 


6.5 


*r lui/ 3 UOODOO nb.D I / SO 


t SS,Ribosoma_S4e 


transcription factor AP-2 gamma (activat 


6.4 


4z£>AJb AWUo/oUU rlS. ibaZZJ 


SS 


stanniocaldn 2 


6.2 


H 1 OOO? IN lYi_wOV£Ona« 1 GOO? 


.SS.serpin, 


serine (or cysteine) proteinase inhiblto 


6.2 


4iooiy /UoioOui ns./yiy/ 


SS.TMjg.SS.TM 


CD83 antigen (activated B lymphocytes, i 


6.2 


AlHAft AA910RQ1 Uc 77R9R 


,SS,kinesin, 


RAB6 interacting. kinesin-fike (rabkines 


6.2 


X/OOXO AtAHC7fk07 Up 401CC9 


.SSJg.Sema.pkinase, 


ESTs 


6.2 


•WolW AAUzoooU nS.ZOZOZ 


.SS.TM.fn3, 


prolactin receptor 


6.1 


AAlAR*) TvlGGCI Lit- OflOQ 

44oioz i*iyyoi ns.yu^y 


filament,SS,filament,filament 


OKFZP434G032 protein 


6.1 


409 B02 W26713 Hs 256972 


.SS.TM.DAGKa.DAGKc, 


ESTs 


6.1 


428479 Y00272 Hs.184572 


,SS,pkinase,pkinase 


cell division cycle 2. G1 to S and G2 to 


6.1 


400300 X03363 


,SS,THpkinase,RecepJ^domain,SH2 t PH,FurHER2 receptor tyrosine kinase (oerb-b2, 


6.1 


433404 T32982 Hs.102720 


SS 


ESTs 


6.1 


410079 U94362 Hs.58589 


GlycoJransLS.SS 


glycogenh 2 


6.1 


401781 


,SS,lflarrBntPn^osyltran f fflamentArmad 


TaigetExon 


6.1 


447359 NM.012093HS.18268 


SS.adenylatekinase, 


adenylate kinase 5 


6.1 


402230 IMA 


.SS.TM^SO, 


Target Exon 


6.1 


427674 NM 003528Hs.2178 


histone.SS.histone, 


H2B histone family, member Q 


6.1 


428398 AI249368 Hs.98558 


.SS.TM 


ESTs 


6.0 


458098 BE550224 Hs.74170 


SS 


metattothionein 1E (functional) 


6.0 


419968 X04430 Hs.93913 


SS.IL6.IL6, 


interieukin 6 (interferon, beta 2) 


6.0 


416636 N32536 Hs.42645 


,SS,TM 


solute carrier family 16 (monocarboxylic 


6.0 


419703 AI793257 Hs.128151 


,SS,zf-C2H2. 


ESTs 


5.8 


424687 J05070 Ws.\^m 


SS.Peptldase M10,fn2,hemopexin,SS l TM,P 


epmatnx metalloproteinase 9 (geJatirtase B 


5.8 


449679 AI823951 Hs.129700 


SS 


toOold-like 1 


5.8 


421296 NWL002666HS.103253 


SS 


periDpin 


5.8 


442117 AW664964 Hs.128899 


.SS.TM 


ESTs 


5.7 


400303 AA242758 Hs.79136 


,SS,TM 


UV-1 protein, estrogen regulated 


57 


419440 AB020689 Hs.90419 


SS 


WAA0882 protein 


5.7 


444858 AJ199738 Hs.208275 


SS 


ESTs, Weakly similar to ALUA-HUMAN llll 


5.7 


432239 X81334 Hs.2936 


SS.Peptidase M10,hemopexin,SS,Peptidas 


e^matrix metalloproteinase 13 (collagenase 


5.6 


440705 AA904244 Hs.153205 


TM 


ESTs 


5.6 


400286 NA 


SSJMABCJran.ABCjnembrane.SS 


C16000922:gi|7499103|pir||T20903 hypolhe 


5.6 


446466 H38026 Hs.308 


amesfin,SS 


arrestin 3, retina! (X-arrestin) 


5.5 


423201 NM.000163HS.125180 


SS.TM.fn3.SS 


growth hormone receptor 


5.5 


433043 W57554 Hs.125019 


SS 


lymphoid nuclear protein (LAF-4) mRNA 


5.5 


439509 AF086332 Hs.58314 


.SS.TM.Syntaxin 


ESTs 


5.4 


425247 NM_005940Hs.155324 


SS,Pepfidase„M10.hernopexin,SS 


matrix metalloproteinase 11 (MMP11;stro 


5.4 


409757 NMj)01898Hs.123114 


.SS.cystalin, 


cystatinSN 


5.4 


425292 NMJ)05824Hs,155545 


SS 


37 kOa leucine-rich repeat (LRR) protein 


5.4 


448045 AJ297436 Hs.20166 


.SS.TM 


prostate stem cell antigen 


5.4 


452681 AF153330 Hs.30246 


.SS.TM 


solute carrier family 19 (thiamine trans 


5.3 


452243 AL355715 Hs.28555 


SS 


programmed cell death 9 


5.3 


439310 AFOB6120 Hs.102793 


.SS.TM.UDPGT.caseinkappa 


ESTs 


5.2 
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441111 AI806867 Hs.126594 
452355 N54926 Hs.29202 
427711 M31659 Hs.180408 
418636 AW749855 
5 429353 AL1 17406 Hs.200102 
441690 R6I733 Hs.33106 
430447 W17064 Hs.332848 
429698 A1685086 Hs.26339 
425325 X52730 Hs.1892 
1 0 phenytethanotamine N-methyltransferase 

423600 AI633559 Hs.310359 SS 
414737 A1160386 Hs.125087 
403593 NA 

407758 D50915 Hs.38365 
15 445234 AW137636 Hs,146Q59 

411165 NM_000169Hs.69089 

420633 NMJ)14581Hs.274480 

414117 W88559 Hs.1787 

416783 AA206186 Hs.79889 
20 401093 

411096 U80034 Hs.68583 

457411 AW085961 Hs.130093 

436007 AI247716 Hs,232168 

450506 NWL00446OHS.418 
25 417975 AA641836 Hs.30085 

421072 AI215069 Hs.89113 

427032 AF012023 Hs. 173274 

447752 M73700 Hs.105938 

403199 NA 
30 427122 AW057736 Hs.323910 

445900 AF070526 Hs.13429 

413048 M93221 Hs.75182 

419563 AA526235 Hs.193162 

442432 BE093589 Hs.38178 
35 452093 AA447453 Hs.27860 

442323 AW016669 Hs.29190 

450606 AI668605 Hs.60380 

435542 AA687376 Hs.269533 

417576 AA339449 Hs.82285 
40 446089 AI860021 Hs.270651 

445413 AA151342 Hs.12677 

424420 BE614743 Hs.146688 
• 432378 A1493046 Hs.146133 

452190 H26735 Hs.91668 
45 434674 AA831879 Hs.136985 

419966 AI345455 Hs.78915 

421582 A1910275 Hs.14Q6 

410361 BE391804 Hs.62661 

426327 W03242 Hs.44898 
50 406639 M97711 

452834 AI638627 Hs.105685 

427315 AA179949 Hs.175563 

446733 AA863360 Hs.26040 

442118 AA976718 Hs.202242 
55 421524 AA312082 Hs.105445 

453060 AW294092 Hs.21594 

453403 BE466639 Hs.61779 

444301 AK000136 Hs.10760 

453619 H87648 Hs.33922 
60 432656 NM.000246Hs.3076 

426384 AI472078 Hs.303662 

431701 AW935490 Hs.14658 

416931 D45371 Hs.80485 

420854 AW296927 
65 418867 D31771 Hs.89404 

443514 BE464288 Hs.141937 

447499 AW262580 Hs.147674 



ESTs 

G protein-coupled receptor 34 
solute carrier family 25 (mitochondria] 
gb:QV4-BT0534-281299-053-c05 BT0534 Homo 
ATP-bindlng cassette transporter MRP8 
ESTs 

SWI/SNF related, matrix associated, acti 
ESTs, WeaMy slmflar to S21348 probable 



.SS.TM.Phosphodiest, 
TM,7fmJ,TM 
SS 

.SS.TM.HECT 

,SS,TM.ABCJran f ABC__membrane, 
> SS l HECT f zf-UBR1.PABP,144-3, 
SS 

,SS,ras, 

SS,NNMT_PNMT_TEMT,SS^NNfr,PNMTJEMT,STAR 

5.1 
ESTs 

SS ESTs 
,C(DE-N,pkinase Target Exon 

SS.SS KIAA0125 gene product 

.SSJM ESTs 
SS ( Melibiase,BTK t PH,pkinase l SH2 l SH3,Ribo gatactosidase, alpha 
SS.Gpocalin.SS.lipocaiin odorant-binding protein 2A 

,TM,ion_trans,K_tetra, proteollpld protein 1 (Pellzaeus-Merzbac 

SSJM.TM monocyte to macrophage differentiations 

TM.LRRCTJM.LRRCT, C12000586*:gI]6330167|db]|BAA86477.1| (A 

Peptidase J/I3, mitochondrial intermediate peptidase 

SS ESTs 
.SS.adfuzinc, ESTs 
SS,DPPlV_N_term > Peptidase_S9,SS 1 OPPIV_NJbroblastactivayon protein, alpha 
.SS.trypsin hypothetical protein FU23186 

SS ESTs 

,SS, 14-3-3 integrin cytoplasmic domain-associated p 

SS,transferrin,7tm_1 .transferrin, lactotransferrin 
SS,TM ( Folate_carrier,SS,TM,Fo!ate_carrie NMJ)25243*:Homo sapiens solute carrier f 
,SSJM,pWnase,RecepJ^domain 1 SH2,PH,FurHER2 receptor tyrosine kinase (c-erb-b2, 



.SS.Ca.channeLB, 

SSTMJedn_c 1 fn2,Ricin - BJectin ( SS.TM 
SS 
SS 

,SS.TM,7tmJ. 
,SS,TM,CBS,voltage_CLC 
,TM,GlycoJiydroJ 
,SS,pkinase,RrioGEFJg,PH,SH3, 



Homo sapiens done 24787 mRNA sequence 
mannose receptor, C type 1 
Homo sapiens cDNA FU1 1983 fis, clone HE 
hypothetical protein FU23468 
Homo sapiens mRNA; cDNA OKFZp586M0723 (1 
ESTs 

ESTs, Moderately similar to ALU6.HUMAN A 
ESTs 



AIRS.formyLtransf.GARS.SS.GARS.AlRS.for phosphoribosylglycinamide formyltransfer 

.pkinase ESTs, Moderately similar to A47582 B-cet 

SS.UPF0099.SS.UPF0099. CGU47 protein 

.SS.TM.MAPEG, prostaglandin E synthase 

,SS,TM,UDPGT ESTs 

1 SS,TM,PH,SH2,FurinTike < pkinase,Recepi. Homo sapiens clone PP1498 unknown mRNA 

,SS ( Histdeacetyl 4 ESTs 

pkinase.OPR, GA-binding protein transcription factor, 

SS,trefo3,SS,TM,tdLrecepLa,SRCR,tryps trefoil factor 1 (pS2) 



SSJMGBP.TM.GBP 
SS 

SS,SS,ig, 

.SSPEAD.ForkJiead 
SS 

,SS.TM,p450, 
.tg.Sema. 
SS 

.SS.ras.Yjjhosphatase.ras 
,SS,HMG_box,filament, 
SS.LRR.SS 
SS 

SS.LRR, 
.SSArfGap, 
.SS.BIR 

SS,C1q,Collagen,SS,C1q, 
,SS,TM,Peptidase_M1, 
SS,homeobox,homeobox, 
,SS,TM,MiP. 

,SS,TM.cadherin,cadherin 



guanytate binding protein 1, interferon- 
Homo sapiens clone TCCCTA00151 mRNA sequ 
gb:Human T-cell receptor (V beta 18.1, J 
WAA1688 protein 

Homo sapiens mRNA; cDNA DKF2p564N0763 (f 
ESTs, Weakly similar to fatty acid omega 
ESTs 

GDNF family receptor alpha 1 

hypothetical protein MGC15754 

Homo sapiens cDNA FU13591 fis, clone PL 

asporln(LRRdassl) 

Homo sapiens, done MGO9084, mRNA, comp 

MHC dass II transactivator 

ESTs 

Human chromosome 5q13.1 done 5G8 mRNA 
adipose most abundant gene transcript 1 
gbUl-H-BW0-ajc^07-04Jl.s1 NCLCGAP_Su 
msh (Orosophila) homeo box homobg 2 
ESTs 

protocadherlnbeta16 



5.2 
5.2 
5.2 
5.2 
5.1 
5.1 
5.1 
5.1 



5.1 
5.1 
5.1 
5.0 
5.0 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.8 
4.8 
4.8 
4.8 
4.8 
4.7 
4.7 
4.7 
4.6 
4.6 
4.6 
4.6 
4.6 
4.6 
4.6 
4.6 
4.5 
4.5 
4.5 
4.5 
4.5 
45 
4.5 
4.5 
4.5 
4.5 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



441560 
409064 
422667 
454032 
432663 
401747 
432882 
437036 
447754 
443194 
451871 
457211 
421566 
431657 
427899 
444779 
442295 
436396 
446039 
422938 
406922 
439285 
424800 
429922 
447178 
409038 
452747 
420139 
408877 
403329 
439926 
430832 
432481 
452410 
416661 
431958 
425071 
447197 
428722 
428330 
423242 
449048 
414831 
413589 
408875 
418629 
450787 
414870 
450325 
407633 
426172 
442262 
427961 
445563 
403943 
408761 
423279 
420440 
445107 
428303 
411667 
427809 
416203 
430376 
444190 
433495 
429638 



F13386 Hs.7888 
AA062954 Hs.141883 
H25642 Hs.133471 
W31790 Hs.194293 
AI984317 Hs.122589 

NM_013257Hs279696 
A1571514 Hs.133022 
AW073310 Hs.163533 
A1954968 Hs.279009 
A1821005 Hs.1 18599 
AW972565 Hs.32399 
NMJM)0399Hs.1395 
AB45227 Hs.1 0544 8 
AA829286 Hs.332053 
AI192105 Hs.147170 
A1827248 Hs^24398 
A1683487 Hs.152213 
AI150491 Hs.90756 
NM_001809Hs.1594 
S70284 

AL133916 Hs.172572 
AL035588 Hs.153203 
Z97630 Hs.226117 
AW594641 Hs.192417 
T97490 Hs.50002 
BE153855 Hs.61460 
NM_005357Hs.95351 
AA479033 Hs.130315 
NA 

AW014875 Hs.137007 
AI073913 Hs.100686 
AW451645 Hs.151504 
AL133619 Hs.29383 
NM_001949Hs.1189 
X63629 Hs.ffl7 
NMJ)13939Hs.154424 
R36075 

U76456 Hs.190787 
L22524 Hs.2256 
AL039402 Hs.125763 
Z45051 Hs.22920 
M31158 Hs.77439 
AW452631 Hs.313803 
NM_015434Hs.486O4 
BE247550 Hs.86859 
AB006190 Hs.25475 
N72264 Hs.300670 
AI935962 Hs.26289 
NM_007069Hs.37189 
AA371307 Hs.125056 
BE170651 Hs.8700 
AW293165 Hs.143134 
AW8736Q6 Hs.149006 

AA057264 Hs.238936 
AW959861 Hs290943 
NM.002407HS.97644 
AI208121 Hs.147313 
AW974476 Hs.183601 
BE160198 

M26380 Hs.180878 
X54942 Hs.83758 
AW292053 Hs.12532 
AI878918 Hs.10526 
AW373784 Hs.71 . 
AI916662 Hs.211577 



.pkinase, 

.SS.CUB, 

.SS.TM.FMO-fike 

,SS,TM 

TM 

.SS.filamentfilament 
pkinase,pkinase_C, 
,SS,TM 
.pkinase, 
.SS.TM 
.SS.GDNF, 
WH1.WH1 
zf-C2H2,SS 
.SS.TM.pkinase 



1 SS.SAA_proteins f ABC_m6mbrane J ABC_tran f 3enjm amyloid A1 



Homo sapiens clone 23736 mRNA sequence 

ESTs 

ESTs 

ESTs, Weakly similar to I54374 gene NF2 
ESTs 

Homo sapiens keratin 17 (KRT17) 
senjm/glucocorticoid regulated kinase-li 
ESTs 

Homo sapiens cONA RJ14142 lis, done MA 

matrix Gla protein 

ESTs 

ESTs. Weakly similar to S51797 vasodilat 
early growth response 2 (Krox-20 (Drosop 
ESTs. Weakly similar to B34087 hypothefl 



SS 

.COLFI.vwc.Collagen, 
,wnt 

,TM,GlycoJiydroJ 
,SS,TM,thiolase, 
SS.TM.Desaturase.SS 
.SS.ig.ptdnase.LRRNT.LRRCT, 
HLHSS 

.SS f TM,linkerJiistone,7trrLl 
.SS.TM 
SSJL8,SS,iL8 
.SS.HLH 
.SSJM.P450, 
.SS.TM 
SS.SS 
SS 
SS 

.SS.Coltagen.COLFl.TSPN. 
,SS,TM,ras 
SS 



ESTs 



Homo sapiens cDNA FLJ11469 fts, done HE 
wingless-type MMTV integration site farrd 
ESTs 

centromere protein A (17kD) 
gbstearoyl-CoA desaturase {human, adlpo 
hypothetical protein FU20093 
MyoD family inhibitor 
H1 hlstone family, member 0 
ESTs 

small inducible cytokine subfamily A (Cy 
fg superfamUy receptor LNIR 
lipase, hormone-sensitive 
ESTs. Weakly similar to A47582 B-cell gr 
Target Exon 
ESTs 

ESTs. Weakly similar to JE0350 Anterior 
Homo sapiens cDNA FU1 1973 fis, clone HE 
Homo sapiens mRNA; cDNA OKFZp434E2321 (f 
E2F transcription factor 3 
SSJM,C^dherin_C - term t cadherin 4 SS,TM,cadcadherin 3, type 1, P^cadherin (placenta 
SS,T4_delodinase,T4_deiodinase, deiodinase, iodothyronine, type II 

JM.SDF, gb:yh88b01 .s1 Soares placenta Nb2HP Homo 

.SS.TIMP, tissue Inhibitor of metalloproteinase 4 

SS.Peptidase M10,SS,Peptidase_M10,hemopematrix metalloproteinase 7 (MMP7; uterin 
SS DEME-6 protein 

SS.SS.TM similar to S68401 (cattle) glucose indue 

,SS,cNMPJ)inaTng,R]|a r HMG_box protein kinase, cAMP-dependent, reguJato 

.SS.abhydrolase ESTs, Highly similar to AF157833 1 nond 

SS DKFZP434B168 protein 

SS,SH2.PH,SS,TM f PH.SH2f urln-like,pkinas growth factor receptortwund protein 7 



SS.TM.MlP.SS.TM.MiP, 
SS 
SS 

TM.TM 
.SS.DENN 
.SS.START, 
SS 

.SS.WH1.WH1 
p450 ( SS,p450 
.SS.TfotftmJ, 
SS 

.SS.SRCRUteroglobin 
.SS.TM 

SS,RGS.RGS,RGS 
TM 

,SS.Upase.PLAT,Sec7.PH. 

CKS.SS.CKS. 

SS 

SS 

SS,lg,MHCJ,connexin.SCAN t SS,TM 
SS.TM.SS 



aquaporln7 
KIAA1204 protein 
ESTs 

similar to ratHREV107 
ESTs 

deleted in liver cancer 1 

ESTs 

ESTs 

C5000355:gl|4503225|reflNP_000765.1| cyt 
ESTs, Weakly similar to (deffine not ava 
ESTs 

mammaglobin 2 (MGB2; mammaglobin B; lip 
ESTs. Weakly similar to I38022 hypothec 
regulator of G-protein signalling 16 
gb:QV1-HT0413-01020(M)594K)3 HT0413 Homo 
lipoprotein lipase 
COC28 protein kinase 2 
chromosome 1 open reading frame 21 
cysteine and gtycine-nch protein 2 
alpha-2-g!ycoprotein 1, zinc 
kineclln 1 (klnesin receptor) 



4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
3.9 
3.9 
3.9 
3.9 
3.9 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
•3.8 
3.8 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



454071 
451859 
420281 
427691 
428824 
424676 
418026 
457465 
417601 
407999 
425548 
446619 
411213 
406625 
417511 
428769 
407137 
401866 
451195 
426044 
426310 
440029 
408573 
431830 
444781 
431493 
414175 
411789 
418851 
453968 
407104 
449051 
434398 
454042 
459496 
414998 
407756 
442101 
449722 
452554 
421991 
420058 
425776 
407846 
406925 
445873 
418054 
414921 
452268 
428862 
412520 
410530 
401780 
447131 
418334 
415138 
421168 
431473 
421379 
411984 
408101 
405366 
414612 
411393 
435767 
416406 
433068 



AI041793 Hs.42502 
H44491 Hs.252938 
AI623693 Hs.191533 
AW194426 Hs.20726 
W23624 Hs.173059 
Y08565 Hs.151678 
BE379727 Hs.83213 
AW301344 Hs.122908 
NM.014735HS.82292 
AI126271 Hs.49433 
AA890023 Hs.1906 
AU076643 Hs.313 
AA676939 Hs.69285 
Y13647 Hs.1 19597 
AL049176 Hs.82223 
AW207175 Hs.106771 
T97307 

U10492 Hs.438 
AA502490 Hs.336695 
NM_000909Hs.169266 
AW0897O5 Hs.293711 
AA284775 Hs.43148 
Y16645 Hs.271387 
NM-.014400Hs.H950 
AI791493 Hs.129873 
A1308876 Hs.103849 
AF245505 Hs.72157 
AI417828 Hs.192435 
AA847843 Hs.62711 
S57296 Hs.323910 
AW961400 Hs.333526 
AA121098 Hs.3838 
H22570 Hs.172572 
AA808940 Hs.274450 
NM_002543Hs.77729 
AA116021 Hs.38260 
AI651930 Hs.135684 
BE280074 Hs.23960 
AW452434 Hs.58006 
NM.014918HS.110488 
AK001423 Hs.94694 
U25128 Hs.159499 
AA426202 Hs.40403 
L34041 Hs.9739 
AA250970 Hs.251946 
NM_002318Hs.83354 
BE390551 Hs.77628 
NM_003512Hs.28777 
NM000346HS.2316 
AA442324 Hs.795 
M25609 Hs.64173 

NMJM45B5HS.17466 
AA319233 Hs.5521 
C18356 Hs.295944 
AF182277 Hs.330780 
AA825686 Hs.321176 
Y15221 Hs.103982 
NM.005419HS.72988 
AW968504 Hs.123073 

BE274552 Hs.76578 
AW797437 Hs.69771 
H73505 Hs.117874 
D86961 Hs.79299 
NMJJ06456HS.288215 



,TM,7tmJ. 

l SS,TM,EGF l WLrecepta,ldLrecepLb,EGF 
,SS,AAA, 

,SS,GlycosJransL2, 
SS 

GlycosJransf_2 t Ridn_BJectin,SS,Glycos 
lipocalin.SS.Iipocarm.Iipocalin.ferrili 
.SS.Pribosyltran.Sulfatase 
PHD,pklnase,SS 
SS 

$S,TM,fn3,SS,TM,fn3, 
,SS,TM,e(hand.ion trans 

SSJM,CUB l F5_F8_type^C.MAM,SSJM,CUB l F5ji6uropilin1 



ESTs 

ESTs, Weakly similar to ALU1.HUMAN ALU S 
ESTs 
ESTs 
ESTs 

UDP-N-acetyl-alpha-D-galadosarnine:polyp 
fatty add binding protein 4, adipocyte 
DNA replication factor 
KIAA0215 gene product 

ESTs, Weakly similar to YZ28_HUMAN HYPOT 
prolactin receptor 

secreted phosphoprotein 1 (osteopontin, 



stearoyi-CoA desaturase (delta-9-desatur 
chordin-fike 
ESTs 

gb:ye53h05.s1 Soares fetal liver spleen 
Target Exon 

mesenchyme homeo box 1 
ESTs 

neuropeptide Y receptor Y1 
ESTs. Weakly similar to S64329 probable 
ESTs 

small inducible cytokine subfamily A (Cy 
GPI-anchored metastasis-associated prote 
ESTs, Weakly similar to A36036 cytochrom 
,T^hemopaxin,Peptidase_M10,riemopexinpehypothelical protein DKFZp761D1 12 
ig.LRRCT. OKFZP564I1922 protein 

,SS,TM ESTs 

,SS,HMGJ>ox, Homo sapiens, clone IMAGE:3351295. mRNA 

,SS,TM,S^,PH,pkinase.Reoep - L_domain^urv-erW)2 avian erythroblastic leukemia v 



SS.TM,Oesaturase,SS 
SS 

.SS^tmJ.SPRY, 

,SS,TM,GDALCD39 

.SS.fiiament, 

SS.homeobox,Ets,SS,homeobox, 
SS 

SS,TM,7tmJ, 
SS 

,SS,TM,PMP22_Claudin l 
,SS,TM,IL8 
,SS,PH,lactamase_B 
,SS,p450,p450 



HER2 receptor tyrosine kinase (c-erb-b2, 
serum-inducible kinase 
hypothetical protein FU20093 
EST 

oxidised low density lipoprotein (lectin 
ubiquitin specific protease 18 
ESTs 
cycfin B1 

ESTs, Weakly similar to ALUS JHUMAN ALU S 
WAAQ990 protein 

Homo sapiens cONA FU10561 fis, done NT 
parathyroid hormone receptor 2 



SRY (sex determining region Y)-box 9 (ca 
H2A histone family, member 0 
ATPase, H transporting, lysosomal (vacuo 
NlvL005557«:Homo sapiens keratin 16 (foca 
retinoic add receptor responder (tazaro 
ESTs 

tissue factor pathway inhibitor 2 
cytochrome P450, subfamily IIB (phenobar 
ESTs, Weakly similar to S65824 reverse t 
small inducible cytokine subfamily B (Cy 
signal transducer and activator of trans 
COC2-related protein kinase 7 
NM_003371*:Homo sapiens vav 2 oncogene ( 
protein Inhibitor of activated STAT3 
B-factor, properdin 
ESTs 

lipoma HMG1C fusion partner-like 2 
sialyltransferase 



SS 

pkinase,POLO_box,SS ) pkinase,POLOJ)ox, 
,SS,(g,pkinase,LRRNT,LRRCT, 
,SS,TM,KRA8,SCAN,2f-C2H2 1 ig 
,SS,TM 

SS.UCH-1 ,UCH-2,SS,TM,G jlu Jranspept 
SS 

cydin.SS.TM.cyclin. 
SS 
SS 
SS 

SS,TM,7tmJ2,SS,TM,7t/TL2 

JM,ABC_membrane,ABCJran,RibosomaLS4eCbp/^ transactivator, wit 

,SS 1 TM,transpcrtprot,SWlB,RhoGAP,DAG_PEglycerol-3-phosphate dehydrogenase 1 (so 
.SS.rrm.PABP.pkinase.M-S-S.rrm poly{A)-binding protein, cytoplasmic 1 -I 

,SS,TM,mito_carr,LysyLoxidase lysyloxidase-like 2 

SS,START.SS,START,NNMT PNMT.TEMT, steroidogenic acute regulatory protein r 
SS,histone,Calc„CGRP^IAPP,(g,MHCJ ( SPRY,H2A histone family, member L 
SS,HMG_box, 
histone.SS.histone.BolA 
ATP-synLab,SS,7tm_1 ,ATP-synLab 
filament.SS.filament, filament 
TM 

,SS,TM,RibosomaLL27e, 
.Kunltz^BPTI, 
SS,p450,SS 
SS 

SS.IL8, 

SH2,STAT,SS,STAT 
.pkinase, 

RhoGEF,PH.SS,RhoGEF,PH, 
SAP,SS,FG-GAP,vwa 
SS^ushi.trypsin.vwa.rrm.fibrinogen^C.fn 
,SS,Peptidase_S8,P,Peptidase_S8,P 
,SSJM 

.SS.Pribosyltran, 
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445462 AA378776 Hs.288649 
439452 AA918317 Hs.57987 
452017 AF109302 Hs.27495 
409099 AK000725 Hs.50579 
5 452106 A1141031 Hs.21342 
447519 U46258 Hs.339665 
426928 AF037062 Hs.172914 
438825 BE327427 Hs.79953 
414575 H11257 Hs.22968 

10 417837 AL079905 Hs.1103 
422128 AW881145 
445941 A1267371 Hs.172636 
429973 AI423317 Hs.164680 
444542 AI161293 Hs.280380 

15 459561 AI547306 Hs.134981 
425741 AF052152 Hs.159412 
426501 AW043782 Hs.293616 
456508 AA502764 Hs.123469 
434228 Z42047 Hs.283978 

20 415752 BE314524 Hs.78776 
400419 AF084545 
439750 AL359053 Hs.57664 - 
423858 AL137326 Hs.133483 
428514 AW236861 Hs.193139 

25 428698 AA852773 Hs.334338 
448988 Y09763 Hs.22785 
432072 N62937 Hs.269109 
417433 BE270266 Hs.82128 
452194 AI694413 Hs.332649 

30 444051 N48373 Hs.10247 
420042 AW01514O Hs.161723 
457292 A1921270 Hs.334682 
421458 NM_003654Hs.104576 
431104 AW970859 Hs.313503 

35 443767 BE562136 Hs.9736 
419589 AW973708 Hs.201925 
415447 Z97171 Hs.78454 
443464 BE548446 Hs.5167 
423431 AA326062 

40 413278 BE563085 Hs.833 

458451 AW297181 Hs.195922 
440449 AA885430 Hs.201925 
413753 U17760 Hs.75517 
434876 AF16G477 Hs.61460 

45 435575 AF213457 Hs.44234 
415773 R21651 Hs.324725 
446440 AV658411 Hs.42656 
450847 NM_003155Hs.25590 
426075 AW513691 Hs.270149 

50 452110 T47667 Hs.28005 
439963 AW247529 Hs.6793 
402837 NA 

439451 AF086270 Hs.278554 
406664 L34041 Hs.9739 

55 417315 AI080042 Hs.336901 
413011 AW068115 Hs.821 
414987 AA524394 Hs.294022 
429197 H24471 Hs.26930 

^ ' 448030 N30714 Hs.325960 

60 407604 AW191962 Hs.249239 
419092 J05581 Hs.89603 
456672 AK002016 Hs.114727 
443171 BE281128 Hs.9030 
452256 AK000933 Hs.28661 

65 432201 AI538613 Hs.298241 
406642 AJ245210 
400903 NA 



ss.ss 

ss.ss 

ss 

ss 

ss 

ss 

,SS,adh_shortJGF-beta,TGFb_propeptide 
,SS,TM,histone,ANFjeceptor,guanyiate_cy 
,SS,pkinase,ig, 

SS,TGFb_4?ropep8de,T6F-bela 1 SS 
SS 

SS,SS,fipoxygenase,PlAT 
,SS,T-box,UDPGT 

SS.SS.Peplidase_M1,EGF,lg,Iectin_c ( siishi 
SS 

.pkinase, 

SS 

SS 

,SS.TM,7tmJ 
TM 

,SS,Pep!]dase_M1, 

JM,integria_B,Ricln_BJectln,rnn 

.SS.TM 

,SS,START,NNMT_PNMT_TEMT, 
SS 

SSJM.TM 
.Sema.tg, 

SS.TM.LRRCT.LRRNT.LRRJM.LRRCT, 

.SSJM,7tmJ3,ANF.jeceptor,sushi 

,SS,lg t 

.SS.CUB, 

SS.TM.SS.TM.G-patch 
SS 

,Sema,ig ( 

,SS,PCI,RasGEF,hormone_rec,zf-C4, 
,FGF, 

SS.OLF.OLF.OLF.RIbosomal L4 

SS.TM.SSF.SS.TM 

,SS,p450,p450 

.SS.TM.ubiquitjn.lamlniiLG.lamlnin.EGF.k 
,SS,RibosomaLL14 
,FGF, 



hypothetical protein MGC3Q77 

B-ceD Ctl/lymphoma 1 1B (zinc finger pro 

prostate cancer associated protein 7 

hypothefical protein FU20718 

ESTs 

ESTs 

retinol dehydrogenase 5 (1 1-ds and 9-d 
ESTs 

Homo sapiens done IMAGE:451939, mRNA se 
transforming growth factor, beta 1 
gb:QV0-OT0033-010400-182-a07 OT0033 Homo 
ESTs 
ESTs 



ESTs 

Homo sapiens done 24628 mRNA sequence 
ESTs 

ESTs. Weakly similar to AF208855 1 BM-01 
Homo sapiens PR02751 mRNA, complete cds 
putative transmembrane protein 
Target 

Homo sapiens mRNA full length insert cDN 
Homo sapiens mRNA; cDNA DKFZp434B0650 (f 
ESTs 

K1AA1866 protein 

gamma-aminobutyrlc add (GABA) A recepto 
ESTs 

5T4 oncofetal trophoblast glycoprotein 
olfactory receptor, family 2, subfamily 
activated leucocyte cell adhesion molecu 
ESTs 

hypothetical protein FU 14251 
carbohydrate (keratan sulfate Gal-6) sal 
ESTs 

proteasome (prosome, macropain) 26S subu 
Homo sapiens cONA FU13446 fis. done PI 
myocflm, trabecular meshwork inducible 
Homo sapiens mRNA; cDNA DKFZp434F152 (fr 
gb:EST29171 Cerebellum II Homo sapiens c 
interferon-stimulated protein, 15 kOa 
ESTs 

Homo sapfens cONA FLJ13446 fis, done PL 



SSJaminin_EGF,lamlnin_Nterm^dh^shortS laminin, beta 3 (nfcein (125kD), kalinln 



,SS,HLH 
SS.lg.SSJM 

.SS,TM l Ri»somaLS3Ae,G-gamma 
SS 

.SS.homeobox, 
,SS,fn3, 

,SS,TM,ActiviruBcp 1 pklnase 

,TM,p45Q,Ets 

SS 

,SS,Chramo__shadow,chromo, 



Ig superfamily receptor LNIR 
triggering receptor expressed on myeloid 
ESTs, Moderately similar to A47582 B-cel 
KIAA1681 protein 
stanniocaldn 1 

ESTs, Weakly similar to 2109260A B cell 
Homo sapiens cDNA FU11309 fis, clone PL 



ENSP00000241312*:DJ947L8.1.8 (novel Sush 
heterochromatin-like protein 1 



l SS,TM,trarisportj)rotSWIB,Rr»GAP 1 DAG_PEg^ 1 (so 

,SS,RNAj)oLA.RNAj>oLA2.RibosomaLS24e 1 rito protein S24 



SS,LRR,LRRNT,SS,LRRNT,LRR, 
.SS,connexin,ho/mone„recif-C4,connexin 
.SS.Gelsolin. 
.SS.TM 
,SS,TM,C1q, 
SS.TM.SEA, 
.SS.PK.PK 
SS.TM,7tmJ l rrm.SS 
,TM,GDI,7tmJ, 

SSJM.trypsin,SS,TM,trefoll,trypsln ( tref 

SS 

SS 

311 



biglycan 
hypothetical protein FU14950 
ESTs, WeaWy similar to T20272 hypotheti 
membrane-spanning 4-domains, subfamily A 
collagen, type VIII. alpha 2 
mucin 1, transmembrane 
Homo sapiens, done MGC:16327, mRNA, com 
TONDU 

Homo sapiens cDNA FU10071 fis, done HE 
Transmembrane protease, serine 3 
gb:Homo sapiens mRNA for immunoglobulin 
Target Exon 
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15 
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35 



40 



434408 
452994 
445903 
424364 
410196 
419150 
433417 
418624 
436291 
411000 
412519 
450223 
422790 
424269 
435854 
447388 
451631 
448105 
438637 
423024 
456592 
425920 
444670 
401197 
437755 
452560 
410274 
450098 
404826 
458389 
408196 
433675 
418848 
422095 
415992 
424631 
409956 
419667 
450946 
447770 
423826 
418838 
442804 
432234 



AI031771 

AW962597 

A1347487 

AW383226 

AI936442 

T29618 

AA587773 

AI734080 

BE568452 

N40449 

AA196241 

AA418204 

AA809875 

AW137691 

AJ278120 

AW630534 

R00866 

AW591433 

BE500941 

AA593731 

R91600 

AL049977 

H58373 



Hs.132586 

Hs.31305 

HS.132781 

Hs.201189 

Hs.59838 

Hs.89640 

Hs,8859 

Hs.104211 

Hs.5101 

Hs.201619 

Hs.73980 

Hs.241493 

Hs.25933 

Hs.199754 

Hs.4996 

Hs.76277 

Hs.298241 
Hs.126730 
Hs325823 

Hs.162209 
Hs.332938 



AW204256 Hs.291887 
BE077084 Hs.336432 
AA381807 Hs.61762 
W27249 Hs.8109 



H70284 

AL034548 

AW977653 

AI820961 

AI868872 

C05837 

AA688021 

AW103364 

AU077005 

AA374569 

AB032417 

U20325 

AW385224 

AW300118 

AA532807 



Hs.160152 

Hs.43627 

Hs.75319 

Hs.193465 

Hs.282804 

Hs.145807 

Hs.179808 

Hs727 

Hs.92208 

Hs.127698 

Hs.19545 

Hs.1707 

Hs.35198 

Hs.131257 

Hs.105822 



,SS,Giyco_hydro_2 ESTs 
SS,WD40,SS,WD40, KIAA1 547 protein 

fn3,SS,TM,EF1 BO class I cytokine receptor 

SS ESTs, Weakly similar to G01763 atrophin- 

UBACTjepeatSS.UBACT_repeat.ThiF Jamityhypothetlcal protein FLJ10808 
,TM,pkinase,fn3, TEK tyrosine kinase, endothelial (venous 

.SS.SRCR, Homo sapiens, Similar to RIKEN cDNA 5830 

,Sema,ig, ESTs 
.SS.abhydrolase, protein regulator of cytokinesis 1 

SS ESTs, Weakly simflar to S38383 SEB4B pro 

SS,Troponin t Hemagglutinin,SS,TM,C2,Tropo troponin T1, skeletal, slow 



,SS,proJsomerase, 
,TM f histone,Sec1 ,histone,sugar_tr 
,SS.TM,7tm_2,GPS 
,SS,WD40 

,SS I TM,rrm,oxidoredji6 l oxidoredji6 
SS 

,SS,TM,trefoll f trypsln,trefoil 
JM.PH, 
.SS.TM.CD36 



natural killer-tumor recognition sequenc 
ESTs 
ESTs 

putative ankyrin-repeat containing prote 
Homo sapiens, clone MGC:9381, mRNA, comp 
gb:ye79c02.s1 Soaies fetal liver spleen 
Transmembrane protease, serine 3 
ESTs, Weakly similar to KIAA1214 protein 
ESTs. Moderately similar to ALU5_HUMAN A 



,SS.Ran BP1,UM,Ran_BP1,GRIPJPR,LlM gb:yq10c02.rl Soares fetal Over spleen 
SS.mSS,TM.PMP2Cciaudin,PMP22_Claudindaudin8 
,SS.TM hypothetical protein MGC5370 

arf.arf, ENSP00000229263*:HSPC213. 
,wnt ESTs 
.SS l rrm,zf-RanBP,pkinase.C2,pkinase_C,DA ESTs 
SS.SS hypoxia-inducible protein 2 

SS hypothetical protein FLJ21080 

.SS.TM Target Exon 

.SS.RA ESTs, Weakly similar to FPHU alpha-fetop 

HMG_box,pkinase^-CCHC,SS,TM 1 HMG_box.SRY (sex determining region Y>iox 22 
,SS,ribonucjed_sm, ribonucleotide reductase M2 polypeptide 

,deam.ZU5,pkirase f Actwin_recp 1 ESTs 
SS,Cu-oxkJase.SS,Cu-oxidase ( Cu<oxidase hypothetical protein FLJ22704 
.SS.TM hypothetical protein FU13593 

SS ESTs 

SS.TGF-beta.TGFbjjropeptide.SS.TGF-beta.inhibin. beta A (acth/in A, activin AB a 
SS,disintegrin,Reprolysin,Pep_.M12B jirope a disintegrin and metalloproteinase doma 
SS ESTs. Moderately similar to 2109260A Be 

Frizzled,Fz.SS,TM t Frizzled,F2. frizzled (Drosophila) homolog 4 

SS.SS cocaine- and amphetamine-regulated trans 

.SS.TM.Phosphodiest ectonucleotide pyrophosphatase/phosphodi 

.SS.TM.G-gamma ESTs 
,SSJM,pkinase, ESTs 
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TABLE 20A 
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10 
15 
20 
25 
30 



Table 20A shows the accession numbers for those pkeys lacking unigenelD's for Table 20. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number: Gene duster number 



Accession: 


Genbank £ 


tccession numbers 


Pkey 


CAT number 


Accessions 


410785 


1221055J 


AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 


411667 


1253334,1 


BE160198 AW935898 T11520 AW935930 AW856073 AW861034 


418636 


177402J 


AW749855 AA225995 AW750208 AW750206 


420854 


197072J 


AW296927 AI684514 AI263168 AA281079 


422128 


211994J 


AW881 145 AA490718 M85637 AA304575 T06067 AA331991 


423431 


228162J 


AA326062 AA325758 AW962182 


423945 


233566 1 


AA410943 AW948953 AA334202 AA332882 


426878 


273265 1 


BE069341 AW748403 AL044891 Al 908240 AA393080 


433687 


373061J 


M743991 M604852 AW272737 


447197 


711623J 


R36075A1366546R36167 


451631 


878098.1 


R00866 R01523AI806815 


456207 


165078_-1 


AA1 93450 


456592 


202684J 


R91600T87079AA291455 
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TABLE 20B 

Table 20B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 20. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



10 



15 



20 



25 



30 



35 



40 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled The DMA sequence of human chromosome 22." Dunham L et aL, Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand NLposition 


400608 


9887666 


Minus 


96756-97558 


400903 


2911732 


Plus 


59112-59228 


401045 


8117619 


Plus 


90044-90184,91111-91345 


401093 


8516137 


Minus 


22335-23166 


401197 


9719705 


Plus 


176341-176452 


401747 


9789672 


Minus 


11859oM188l6,119119-119244,119609.119761J20422-120990^ 








131932,132451-132575.133580-134011 


401780 


7249190 


Minus 


28397-28617^8920-29045,29135-29296,29411.29567,29705-29787,30224-30573 


401781 


7249190 


Minus 


83215-83435,83531-83656,83740-83901,84237^393,84955^5037,86290^6814 


401866 


8018106 


Plus 


73126-73623 


402230 


9986312 


Minus 


29782-29932 


402408 


9796239 


Minus 


110326-110491 


402578 


9884928 


Plus 


66350-66496 


402606 


9909429 


Minus 


81747-82094 


402837 


9369121 


Minus 


201^2186,9570-9758,11136-11309.19429-19677,2121^ 


403199 


9958183 


Minus 


58895-59036,66618-66789 


403329 


8516120 


Plus 


96450-96598 


403593 


6862650 


Minus 


62554-62712,69449-69602 


403943 


7711864 


Plus 


100742-100904,101322-101503 


404091 


7684554 


Minus 


82121-83229 


404347 


9838195 


Plus 


74493-74829 


404826 


6572184 


Plus 


47726-48046 


405366 


2182280 


Plus 


22478-22632 


405654 


4895155 


Minus 


53624-53759 
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TABLE 21: 210 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
LIKELY TO ENCODE EITHER ENZYMES OR PROTEINS 
AMENABLE TO MODULATION BY SMALL MOLECULES 

Table 21 shows 210 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode either enzymes or proteins amenable to modulation by small 
molecules. These were selected as for Table 19, except that the ratio was greater than or equal 
to 3.0, the "average" normal adult tissue level was set to the 85 th percentile value amongst 
144 non-malignant tissues, and the 96 th percentile value amongst 73 breast cancers was 
greater than or equal 80 units, and the predicted protein contained a structural domain that is 
indicative of enzymatic function or of being modulatable by small molecules (e.g. pkinase, 
peptidase, phosphatase, ATPase, or ionjxansporter domains). The predicted protein domains 
are noted. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of 93rd percentile tumor to BS 0 * percentile of normal body fissue 



Pkey ExAccn UnigenelD Predicted Protein Domains UnigeneTitle R1 

449746 AJ668594 Hs.176588 SS,p450 ESTs, Weakly similar to CP4Y HUMAN CnOC 657 

400292 AA250737 Hs.72472 death,ZU5 ( TM^ctivin_recp,pkinase, BMP-R1B 55 9 

424735 U31875 Hs.272499 SS.TM short-chain alcohol dehydrogenase family 53.8 

407178 AA195651 Hs.104106 SS.Dihydroorotase, ESTs 393 

408045 AW138959 Hs.245123 PhosphodiestSomatomedin B, ESTs 349 

450375 AA009647 Hs.8850 SSJM.oTsintegrinPep.M12Bj)ropep t Reprol a dlsintegrin and metalloproteinase dorna 257 

429170 NM_001394Hs.2359 DSPcRhodanese, dual specificity phosphatase 4 24 9 

445730 A1624342 Hs,170042 SS,TM,Cation_efilux ESTs 241 

424634 NM_003613Hs.151407 ig,tsp_1.SS.AAA cartilage intennediate layer protein, nu 21.7 

420757 X78592 Hs.99915 hormone_recAndrogen_recep,2f-C4 ( androgen receptor (dihydrotestosterone r 21.7 

424399 AJ905687 Hs.2533 SS aldehyde dehydrogenase 9 family, member 20.3 

447350 AI375572 Hs.172634 pkinase. ESTs 192 

456207 AA193450 SS,TM,p450,p450 gbzr40e07.r1 SoaresJ4hHMPu_S1 Homo sapi 18.3 

456938 X52509 Hs.161640 SSJM,arrihrotranJ_2,C^erin_Cjenura aminotransferase 18.1 

402578 SS l p450,SSJM l p450 C1001134:gi|2117372|pir||l65981 fatty ac 17.8 

425692 D90041 Hs.155956 SS,Acetyftransf2. N-acetyltransferase 1 (arylamine N-acety 16.7 

424001 W678Q3 Hs.137476 pkinase, paternally expressed 10 16.5 

418007 M13509 Hs.83169 SS,hemopexin,Pepfidase_M10,SS.Peptidase_matnx metaiioproteinase 1 {MMP1; Inters 15.7 

421727 Y13153 Hs.107318 Jcynurenine 3-monooxygenase (kynurenine 3 15.3 

411869 W20027 Hs.23439 SS,Peptidase_M1, ESTs 13.9 

400289 X07820 Hs.2258 henropexinPeptkiasejVllO.SS.Pep^ 13.5 

443348 AW873596 Hs.182278 SS.DENN calmodurm 2 (phosphorylase kinase, delt 130 

424086 AI351010 Hs.102267 SS,LysyLoxidase lysyl oxidase 12.8 

400295 W72838 Hs.2533 SS aldehyde dehydrogenase 9 family, member 1Z7 

408771 AW732573 Hs.47584 TM,l^tetra l ion_trans,SS,TM,ICtetra,ion t potassium voltage-gated channel, delayed 11.9 

421155 H87879 Hs.102267 SS,Lysyl_oxidaseAfdose_epim,Epimerase,S fysyl oxidase 11.8 

424905 NMJ)02497Hs.153704 pkinase,SS f TM,pldnase,polyprenyLsynt, NIMA (never in mitosis gene a)~reiated k 117 

438167 R28363 Hs.24286 SS,TM.7tm_1 l p450 l rrm ESTs 11.5 

459583 AI907673 pkinase, gb:iL-BT1 52-080399-004 BT152 Homo sapien 11.5 

423945 AA410943 deam,ZU5.TMActivin_recp,pkinase l gb:zt32h03.ii Soares ovary tumor NbHOT H 11.4 

445263 H57646 Hs.42586 SS^cyltransferase, KIAA156G protein m 
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45 



50 



55 



60 



65 



423887 
429859 
428368 
418912 
451952 
402408 
415539 
406687 
419948 
400285 
408380 
409203 
424902 
431725 
418092 
427811 
423554 
426261 
413374 
432677 
456986 
453331 
407721 
418004 
410555 
443695 
423545 
401045 

442082 
418836 
404091 
450865 
424085 
449650 
432304 
415669 
442942 
428795 
428479 
400300 
447359 
402230 
424687 
432328 
432239 
400286 
425247 
434737 
439310 
441111 
452355 
427711 
429353 
425325 

448706 
403593 
432777 
446232 
411165 
414117 
411096 
450506 
417975 
447752 
427122 



AL080207 Hs.134585 
NM.007050HS.225952 
BE440042 Hs.83326 
NMJ00685HS.89472 
AL120173 Hs.301663 
NA 

AJ733881 Hs.72472 
M31126 Hs.272620 
AB041035 Hs.93847 
NA 

AF123050 Hs.44532 
AA780473 Hs.687 
NMJtt)3866Hs.153687 
X65724 Hs.2839 
R45154 Hs.106604 
M81057 Hs.180884 
M90516 H3.1674 
AW242243 Hs.168670 
NM_001034Hs75319 
NM_004482Hs.278611 
038299 Hs.170917 



SS,TM,BRCT,anKABC_tran,ABC_tran DKFZP434G232 protein 
SS.TM.Y j)hosphatase,MAM,fn3, protein tyrosine phosphatase, receptor t 

SS,PepUdase_M10,hemopexin,SSPeptidase_rnatrixmetalIoproteinas 3 (stromelystn 
SSJM,7tm_1 ,SS,TM,7tmJ. angiotensin receptor 1 

SS.pWnase, ESTs 

SS,carb_anhydrase NNU>30920*:Homo sapiens hypothetical pro 

death.ZUSJM^ctivirurecp^kinase, BMP-R1B 
SS,PeptidaseJrf10,herrK)pexm,SS.Peptidase^ 11 (MMP11; stro 



FenicjedurjUM.Ferrtcjeduct, 
TMABC_tran f ABC_membrane, 
SSJHubiquitiny^^NFjeceptor^ushi 
SS.p450.SS.p450 
SS.SS 

SS,Cys_knot,SS 
deathZU5,pkinase,Activinjecp, 



NM_016931:Homo sapiens NADPH oxidase 4 ( 
Eos Control 
diubiquffln 

cytochrome P450, subfamily IVB, polypept 
inositol por/phosprate-4-phosphatase. ty 
Norrie disease (pseudogliorna) 
ESTs 



AI240665 
Y12735 
U37519 
U92649 



Hs.8895 
Hs.38018 
Hs.87539 
Hs.64311 



AW204099 Hs.337720 
AP000692 Hs.129781 



C1 1 001 883*:gi|6753278|reqNP_033938.1 J c 



SS^rccarbOpeptPropep^M14,SS.ProrjepJ^14(arboxypeptidase B1 (tissue) 
GATase_2,SIS, glutamlne-fructose-6-phosphate transamln 

SS JM,WD40,ubiqultin,E1^^TPase l CationpefO)dsoma! famesylated protein 
SS ribonucleotide reductase M2 polypeptide 

SS,TM,Glycos_transL2,RiciruBJectin,DPP UDP^cetyl-alpta : D-galactosamine'.polyp 
SS,TM.7tm_1. prostaglandin E receptor 3 (subtype EP3) 

SS,TM,disintegrin,Pep_M1 2B4)ropep.Reprol ESTs 

pkinase. dual-specificiry tyrosine-(Y)-phosphoryl 

SS,aldedrt,SS,aJdedh, aldehyde dehydrogenase 3 family, member 

TM,disin!egrin,Reprolysin, a dislntegrin and metalloproteinase doma 

ESTs, Weakly similar to AF126780 1 retin 
GAF,PDEase chromosome 21 open reading frame 5 

rCEj)20.SS I ICE_p10.ICEj)2Q J ICEj)10,ICEj) 



Hs7413 
Hs.161712 



R41823 
AI655499 
NA 

AI248013 Hs.106532 
NMJ)02914Hs.139226 
AF055575 Hs.23838 
AA932186 Hs.69297 
NMJ)05025Hs.78589 
AW167087 Hs.131562 
R45503 Hs.97469 
Y00272 Hs.184572 
X03363 

NMJ)12G93Hs.18268 
NA 

J05070 Hs.151738 
AI572739 Hs.195471 
X81334 Hs.2936 
NA 

NMJ)05940Hs.155324 
AA828246 Hs.291884 
AF086120 Hs.102793 
AI806867 Hs.126594 
N54926 Hs.29202 
M31659 Hs.180408 
AL117406 Hs.200102 



TM.EPH_Ibd.pkinase.SAM.fn3. 
TM,Activin_recp J pkinase,death,ZU5, 
TM,7trn_3,ANF_receptor, 
zf-C2H2 

SSMA,ViraLhelicase1 ,rrm, 

TM,ionJrans,SSJM,ionJrans, 

TMJtmJ, 

SS^erpin, 

SS,ig,Sema.pklnase, 
SSJM 

SS.pkinase.pkinase 



67 

ESTs; calsyntenin-2 
ESTs 

Target Exon 

ESTs, Weakly similar to I38588 reverse t 
replication factor C (activator 1) 2 (40 
calcium channel, voltage-dependent, L ty 
ESTs 

serine (or cysteine) proteinase inhibito 
ESTs 

ESTs. Highly similar to A39769 N-acetyll 
cell division cycle 2, G1 to S and G2 to 



X52730 Hs.1892 
phenyiethanofamlne N-methyttransferase 
AW291095 Hs.21814 SS.TM.pldnase, 
NA , 

AA564991 Hs.269477 
A1281848 Hs.194691 
NMJ)00169Hs.69089 
W88559 Hs.1787 
U80034 Hs.68583 
NMJ)04460Hs.418 
AA641836 Hs.30085 
M73700 Hs.105938 
AW057736 Hs^23910 



SS,TM,pkinase.Recep_L_domaIn,SH2.PH.FuriHER2 receptor tyrosine kinase (c-erb-b2, 
SS.adenytatekinase, adenylate kinase 5 

SS.TM.p450, Target Exon 

SS.Peptidase_M10.fn2,hemopexin,SS.TM.Pepmatrix metaDoproleinase 9 (gelatinase B 
6PF2K.PGAM, 6^hosphofmcto-2-kinase/mJctose-2,6-bi 
SS.Peptidase M10,hemopexin,SS,Peptidase - matrix metanoprotelnase 13 (collagenase 
SSJM.ABCjran.ABC.membrane.SS C16000922:gl|7499103|pirj|T20903hypothe 
SS,Pepridase_M10,hemopexin,SS matrix metaUoprotefnase 1 1 (MMP11; stro 

UCH-1,pkinase,OPR,Rhodanese^MP-bindlrig.ESTs 
SS,TM,UDPGT,caseinJappa ESTs 
SS.TM^Phosphodiest, ESTs 
TM,7tm J ,TM G protein-coupted receptor 34 

SS solute carrier family 25 (mitochondrial 

SS JM.ABC Jran,ABCjrembrane, ATP-binding cassette transporter MRP8 

SS,NNMT_PNMT TEMT,SS,NNMTJWJEMT,STAR 

5.1 

interleukin 20 receptor, alpha 
CIDE-N,pkinase Target Exon 

alpha-amytase, ESTs 
SS,TM,7tnru3,RibosomaL-13 retinolc add induced 3 

SS,Melibiase,BTK,PH l pldnase,SH2,SH3,Ribo galactosidase, alpha 
TM,ionJrans,K_tetf3. proteolipW protein 1 (Pellzaeus-Merzbac 

Peptidase J43, mitochondrial Intermediate peptidase 

SS.DPPlV.N term.Peptidase S9.SS,DPPJV_NJbrob)ast activation protein, alpha 
SS.trypsin hypothetical protein FU23186 

SS,transferrin,7tm J .transferrin, lactotransferrin 
SS,TM l pWnase^ecep_LJomain,SH2.PH l FuriHER2 receptor tyrosine kinase (c-erb-b2, 



10.9 

10.4 

10.3 

10.3 

10.3 

9.8 

9.4 

9.1 

9.1 

8.8 

8.6 

8.3 

8.3 

8.3 

8.3 

8.2 

8.1 

7.8 

7.6 

7.6 

7,5 

7.3 

7.2 

7.2 

7.1 

6.9 

6.8 



6.7 
6.6 
6.6 
6.5 
6.5 
6.4 
6.2 
6.2 
6.2 
6.1 
6.1 
6.1 
6.1 
6.1 
5.8 
5.8 
5.6 
5.6 
5.4 
5.4 
5.2 
5.2 
52 
5.2 
5.1 



5.1 
5.1 
5.0 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.8 
4.8 
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400181 NA SS,TM,3Beta_HSD, ENSPO0Q001 71 555:CDNA FU 10727 fis, clone 4.6 

452093 AA447453 Hs.27860 SS.TM.7tmJ, Homo sapiens mRNA; cOMA OKFZp586M0723 (f 4.6 

435542 AA687376 Hs.269533 SS.pkinase.RhoGEF.ig.PH.SH3, ESTs 4.6 

417576 AA339449 Hs.82285 AIRS.formylJransf.GARS.SS.GARS 4.6 

5 446089 A1860021 Hs.270651 pkinase ESTs, Moderately similar to A47582 B-cel 4.6 

424420 8E614743 Hs.146688 SS.TM.MAPEG, prostaglandin E synthase 4.5 

452190 H26735 Hs.91668 SS.TM.PH^^urin-Bke.pkinase.Recep^ Homo sapiens clone PP1498 unknown mRNA 4.5 

419986 Ai345455 Hs.78915 pkinase.OPR, GA-bfoding protein transcription factor, 45 

421582 AI910275 Hs.1406 SS,trefoii.SSJM.IdLrecepta,SRCR,tryps trefoil factor 1 (pS2) " 4.5 

10 446733 AA363360 Hs.26040 SS.TM.p450. ESTs, Weakly similar to fatty acid omega 4.4 

453060 AW294092 Hs.21594 SS,ras,Y_phosphatase,ras hypothetical protein MGC15754 4.4 

400205 NA NM>006265*:Homo sapiens RAD21 (S. pombe) 4.4 

420854 AW296927 SS.TM.Peptidase.M1, gb:UI-H-BW0-aj*c-07-O-Ul.s1 NCI CGAP Su 4.3 

432690 AF181490 Hs.278627 SS,pyr_redox.SS.Ribosomal_L39 prenylcysteine lyase 4.3 

15 441560 F13386 Hs.7888 pkinase, Homo sapiens clone 23736 mRNA sequence 4.3 

416445 AL043004 Hs.79337 SS.pkinase, K1AA01 35 protein 4.3 

439024 R96696 Hs.35598 SS.TM.trypstn,vwd.ig ESTs 4.3 

432882 NM.013257Hs.279696 pkinase.pkinase.C, senjm/glucocortfcold regulated kinase-Ji 4.2 

447754 AW073310 Hs.163533 pkinase. Homo sapiens cDNA FLJ 14142 fis, clone MA 42 

20 453775 NMJ02916HS.35120 SS.AAA.Pl3_Pl4.kinasePI3Ka ( Pl3lCrbd,Pl3 replication factor C (activator 1} 4 (37 4.2 

431657 A1345227 Hs.105448 SS,TM.pkinase ESTs, Weakly similar to B34087 hypotheti 4.1 

427899 AA829286 Hs.332053 SS.SAAj)rotelns .ABCLmembraneABC tran, serum amyloid A1 4.1 

422938 NM_001809Hs.1594 SS.TM.ihiolase, centromere protein A (1 7kD) 4.1 

418478 U38945 Hs.1174 ank,ank cyclin-dependent kinase inhibitor 2A (me 4.1 

25 406922 S70284 SS,TM.Desaturase,SS gb:stearoyl-CoA desaturase [human, adipo 4.1 

439285 AL133916 Hs.172572 SS.ig.pkinase.LRRNT.LRRCT, hypothetical protein FLJ20093 4.1 

429922 Z97630 Hs.226117 SS,Wmker_histone,7trrLl H1 histone family, member 0 4.1 

420139 NM.005357HS.95351 SS,TM,p450, lipase, hormone-sensitive 4.0 

425071 NMJ)13989Hs.154424 SS.T4_debdinase.T4_deiodinase, detodinase. iodothyronine, type II 4.0 

30 424511 BE300512 Hs.193557 SS.Y_phosphatase.Band_41 ESTs, Moderately similar to ALU7_HUMAN A 4.0 

428722 IJ76456 Hs.190787 SS.TIMP, tissue inhibitor of metatloproteinase 4 3.9 

428330 L22524 Hs.2256 SS,PeptidaseJ/l10,SS,PepiidaseJM0,^ 3.9 

414831 M31158 Hs.77439 SS,cNMP_binding,Rlla,HMGJ>ox protein kinase. cAMP-dependent. regulato 3.9 

413589 AW452631 Hs.313803 SS.abhydrolase ESTs, Highly similar to AF157833 1 noncj 3.6 

3 5 418629 BE247550 Hs.86859 SS l SH2,PH ( SS.TMPH,SH2,Furin-like,pklnas growth factor receptor-bound protein 7 3.8 

413453 AA129640 Hs.128065 SS,Peptiaase_C1,gpdh ESTs 3.8 

403943 p450.SS,p450 C5000355:gi|4503225lreqNP_000765.1|cyt 3.8 

444618 AV653785 Hs.173334 ELL-RELATED RNA POLYMERASE II, ELONGATIO 3.8 

408761 AA057264 Hs.233936 SS,TM,7tm_1, ESTs, Weakly similar to (defiine not ava 3.8 

40 427809 M26380 Hs.180878 SS,lipase,PLAT,Sec7,PH, lipoprotein lipase 3.7 

418203 X54942 Hs.83758 CKS.SS.CKS. CDC28 protein kinase 2 37 

454071 A1041793 Hs.42502 TM,7tmJ, ESTs 3.7 

424676 Y08565 Hs.151678 Glycos_transL2»Rlcin.B_lectin.SS.GIycos UDP^-acetyklpha-D-gaIactosamine:poryp 3.7 

457465 AW301344 Hs.122908 SS,Pribosyltran,Suifatase DNA replication factor 3.7 

45 417601 NMJ)14735Hs.82292 PHD.pkinase.SS K1AA0215 gene product 3.7 

446619 AU076643 Hs.313 SS.TM.etod.bnJrans secreted phosphoprotein 1 (osteopontin, 3.7 

406625 Y13647 Hs.1 19597 SSJM,Desaturase.SS stearoyl-CoA desaturase (delta-9-desatur 3.6 

428769 AW207175 Hs.106771 SS,7tmJ,SPRY, ESTs 3.6 

426310 NM_000909Hs.169266 SS,TM.7tm_1. neuropeptide Y receptor Y1 3.6 

50 417531 NWL.0Q3157Hs.1Q87 SS.pkinase.vwa.vwa.GlycoJransLB serine/threonine kinase 2 3.6 

444781 NMJ)14400Hs.11950 SS.PH, lactamase B GPl-anchored metasiasis^assoclated prate 3.6 

431493 AI791493 Hs.129873 SS.p450.p450 ESTs. Weakly similar to A36036 cytochrom 3.6 

428966 AF059214 Hs.194687 cholesterol 25-hydroxylase 3.6 

414175 AI308876 Hs.103849 TM,hemopexinPeptldase_M10^emopexin,Pephypothetical protein DKFZp761D112 3.6 

55 455325 AW895719 TM,ion_trans,KJetra f gb:QV4-NN0039-290300-154-f06NN0039Homo 3.6 

429597 NMJ)03816Hs.2442 adislntegrinandmetalloproteinasedoma 36 

425320 U29344 Hsm90 AcyLtransf.adfL2inc.ketoacyl-synt 1 pr>bi fatty acid synthase 3.5 

431854 AA383550 Hs.271699 IMS.SS polymerase (DNA directed) lota 3.5 

407104 S57296 Hs.323910 SSJM,SH2.PH,pkinase < Recep_l^domaHFuriv-erW)2 avian erythroblastic leukemia v 3.5 

60 449051 AW961400 Hs.333526 SS HER2 receptor tyrosine kinase (c-erb-b2, 3 5 

434398 AA121098 Hs.3838 pkinase,POLO_box.SS.pkinase,POLO_box, serunvlnduclble kinase 35 

454042 H22570 Hs. 172572 SS,ig,pkinase,LRRNT,LRRCT, hypothetical protein FU20093 3.5 

407756 AA 116021 Hs.38260 SS,UCH-1.UCH-2,SS.TM.G^g!u_transpept ubiquiiin specific protease 18 35 

401464 AF039241 Hs.9028 Peptidase_M24, histone deacetybse 5 3 5 

65 412970 AB026436 Hs.177534 DSPcMyosinJaa, dual specificity phosphatase 10 3 5 

412049 N53437 Hs.18268 SS,adenylatekinase. adenylate kinase 5 35 

425776 U25128 Hs.159499 SS.TM.7tm_2 ) SS,TM 1 7trru2 parathyroid hormone receptor 2 3.4 
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45 



407846 AA426202 Hs.40403 


TM,ABC^membrane^BC^tfaji > RlbosomaLS4eCbp/p300-interacting transaclivator, wH 


3.4 


406925 L34041 Hs.9739 


SS,TM 1 tfansponj>ratSWlB t RhoGAP,DAGJ 5 E-g!yceroW-phosphat9 dehydrogenase 1 (so 


3.4 


445873 AA250970 Hs.251946 


SS.rrm.PABP.pkinase.l 4-3-3/rm 


poly(A)-binding protein, cytoplasmic 1-1 


3.4 


418054 NM.002318HS.83354 


SS,TM,mitojarr,LysyLoxfdase 


tysylox(dase-like2 


. 3.4 


406815 AA833930 Hs.288036 


SS.IPPT, 


tRNA isopentenylpyrophosphate transfers 


3.4 


410530 M25809 Hs.64173 


ATP-synLab,SS,7tmJ ATP«synLab 


ATPase, H transporting, lysosomal (vacuo 


3.4 


407021 U52077 


gb:Human marinerl transposase gene, comp 


3.4 


421168 AF182277 Hs.330780 


SS,p450.SS 


cytochrome P450, subfamily I1B (phenobar 


3.4 


431473 AA82S686 Hs.321176 


SS 


ESTs. Weakly similar to S65824 reverse t 


3.4 


408101 AW968504 Hs.123073 


pkinase, 


CDC2-retated protein kinase 7 


3.4 


422083 NM_001141Hs.1 11256 


lipoxygenase.PLAT. 


arachidonate 15-Bpoxygenase, second typ 


3.3 


411393 AW797437 Hs.69771 


SS l sushl,tryr^in,vwa,mn,fibnnogen_C,fn 


B-factor, properdin 


3.3 


435767 H73505 Hs.117874 


SS,Peptidase_S8,P f Peptidase_S8,P 


ESTs 


3.3 


433068 NM_006456Hs.288215 


SS,Pribosyltran, 


siar/toansferase 


3.3 


426928 AF037062 Hs.172914 


SS.adh_short,TGF-b€ta,TGFbj)ropep{ide 


retinal dehydrogenase 5 (11-cls and 9-ci 


3.3 


414575 H11257 Hs.22968 


SS,pklnase,Ig, 


Homo sapiens clone IMAGE:451939, mRNA se 


3.3 


445941 AI267371 Hs.172636 


SS,SS,tipoxygenase,PLAT 


ESTs 


3.3 


444542 AI161293 Hs.280380 


SS,SS,Pepfidase_M1 f EGF,ig l /ectifuc 1 sushi 


aminopepfidase 


3.3 


425741 AF052152 Hs.159412 


pkinase, 


Homo sapiens clone 24628 mRNA sequence 


3.3 


434228 Z42047 Hs.283978 


SS.TM ( 7tmJ 


Homo sapiens PR02751 mRNA, complete cds 


3.3 


433264 D85782 Hs.3229 




cysteine dioxygenase. type 1 


3.3 


400419 AF084545 


SS,Peptldase_M1, 


Target 


3.3 


439750 AL359053 Hs.57664 


TM.integrin^B.RiclrLBJecb'n.Tm 


Homo sapiens mRNA full length insert cDN 


3.3 


417757 R19897 Hs.106604 


death,ZU5,pWnase,Actjvin_recp l 


ESTs 


3.3 


452194 AI694413 Hs.332649 


SS,TM,7tmJ,ANF_receptor.sushi 


olfactory receptor, family 2, subfamily 


3.2 


421458 NM_003654Hs.104576 


SS 


carbohydrate (keratan sulfate GaJ-6) sul 


3.2 


443767 BE562136 Hs.9736 


SS,PCI ( RasGEF,horTnone_rec,zf-C4, 


proteasome (prosome, macropain) 26S subu 


3.2 


422648 086983 Hs.118893 


peroxidase.LRRCT, 


Melanoma associated gene 


3.2 


423431 AA326062 


SS,p450 ( p450 


gb:EST29171 Cerebellum 11 Homo sapiens c 


3.2 


451264 AI768235 


SSJrehalase 


gb:wg82g08j(1 Soares_NSF_F8_9W_OT_PA_P_S 


3.2 


452110 T47667 Hs.28005 


SS,TM,Activin recp, pkinase 


Homo sapiens cDNA FLJ11309 fc, clone PL 


3.2 


439963 AW247529 Hs.6793 


TM,p450,Ets 


platelet-activating factor acetythydrola 


3.2 


453941 U39817 Hs.36820 


SS,DEAD,HRDC,helicase_C, 


Bloom syndrome 


3.1 


406664 L34041 Hs.9739 


SS.TM.transpofl^prot.SWIB^hoGAP.DAG^PE- 


glycerol-3- 


phosphate dehydrogenase 1 (so 


3.1 




3.1 


453487 R31770 Hs.23540 


TM,7tmJ, 


ESTs 


420911 U77413 Hs.100293 




O-finked N-acetylglucosamine (GlcNAc) tr 


3.1 


443171 BE281128 Hs.9030 


SS.TM.7tm J ,rrm,SS 


TONDU 


3.1 


452256 AK000933 Hs.28661 


TM,GOi,7tmJ, 


Homo sapiens cDNA FLJ10071 fis. done HE 


3.1 


432201 AI538613 Hs.298241 


SS.TM.trypsin.SS.TM.trefoa.frypsin.tre/ 


Transmembrane protease, serine 3 


3.1 


419150 T29518 Hs.89640 


TM,pWnase,fn3. 


TEK tyrosine kinase, endothelial (venous 


3.1 


444443 AI149286 Hs.55099 


SS 


rab6 GTPase activating protein (GAP and 


3.1 


426283 NM.003937HS.169139 




kynureninase (Ucynurenlne hydrolase) 


3.1 


436291 BE568452 Hs.5101 


SS.abhydrolase, 


protein regulator of cytokinesis 1 


3.1 


450223 AA418204 Hs.241493 


SS.proJsomerase, 


natural Wller-tumor recognition sequenc 


3.1 


424269 AW137691 Hs.199754 


SS.TM.7tm_2.GPS 


ESTs 


3.1 


448105 AW591433 Hs.298241 


SS.TM.trefoil.trypsin.trefoll 


Transmembrane protease, serine 3 


3.0 


452560 BE077084 Hs.336432 


SS.rnn t 2J-RanBP,pklnase.C2 1 pkInase_C^)AGESTs 


3.0 
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TABLE 21A 

Table 21 A shows the accession numbers for those pkeys lacking unigenelD's for Table 2L 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: 


Unique Eos probeset identifier number 


CAT number Genec 


luster number 


Accession: 


Genbank accession numbers 


Pkey 


CAT number Accessions 


420854 


197072 1 


AW296927 AI684514 AI2631 68 AA281079 


423431 


228162 1 


AA326062AA325758AW962182 


423945 


233566 1 


AA410943 AW948953 AA334202 AA332882 


451264 


863988 1 


AI768235 R31400 H29082 H23107 


455325 


1279475J 


AW895719N31451 N41451 


456207 


165078 -1 


AA193450 
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TABLE 21B 



Table 21B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 21. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pfcey: Unique number corresponding to an Eos prabeset 

1 0 Ret Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. Dunham I et ai.' refers to the publication 

entitled "The DMA sequence of human chromosome 22.' Dunham I. et al. f Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposiiion: Indicates nucleotide positions of predicted exons. 



15 



20 



Pkey Ref 

401045 8117619 

402230 9966312 

402408 9796239 

402578 9884928 

403593 6862650 

403943 7711864 

404091 7684554 



Strand NLposition 

Plus 90044-90184.91111-91345 

Minus 29782-29932 

Minus 110326-110491 

Plus 66350-66496 

Minus 62554-62712.6944W9602 

Plus 100742-100904,101322-101503 

Minus 82121-83229 
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TABLE 22: 739 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

Table 22 shows 739 genes up-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 19, except that the ratio was greater than or equal to 3.0, the 
denominator was the 85 th percentile value for 12 non-malignant breast specimens, and the 
96 th percentile value amongst the 73 breast cancers was greater than or equal 100 units. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

1 5 Unigene Title: Unigene gene title 

R1 : Ratio of 90 th percentile tumor to 85 th percentile normal breast tissue 



Pkey ExAccn UnigenelD UnigeneTitle R1 



400292 AA250737 Hs.72472 BMP-R1B 51.5 

424735 U31875 Hs.272499 shortchain alcohol dehydrogenase family 38.3 

400297 AI127076 Hs.334473 hypothetical protein DKFZp55401 278 29.9 

431448 AL137517 Hs.334473 hypothetical protein OKFZp5640 1278 26.9 

25 451110 AI955040 Hs.265393 ESTs, WeaMy similar to fransformafion-f 25.8 

431211 M86849 Hs.323733 gap junction protein, beta2,26kD (conn 232 

418203 X54942 Hs.83758 COC28 protein kinase 2 22.6 
407980 AA046309 gb:zf12fOU1 Soares felalJieartNbHH19W 19.8 

414646 AA353776 Hs.901 CD48 antigen (B-cell membrane protein) 18.9 

30 446921 AB012113 Hs.16530 small inducible cytokine subfamily A (Cy 18.0 

409041 AB033025 Hs.50081 K1AA1 199 protein 17.6 

412140 AA219691 Hs,73625 RAB6 interacting, kinesln-Iike (rabkiies 17.6 

407824 AA147884 Hs.9812 Homo sapiens cDNA FU14388 fis, done HE 17.1 

453160 A1263307 Hs.239884 H2B histone family, member L 17.0 

35 407137 T97307 gb:ye53h05.s1 Soares fetal liver spleen 16.1 

425692 D90041 Hs.155956 N-acetyltransferase 1 (arylamlne N-aceiy 16.1 

438533 AI440266 Hs.170673 ESTs, Weakly similar to T24832 hypotheti 16.0 

428227 AA321649 Hs.2248 small inducible cytokine subfamily B (CX 15.5 

444342 NM_014398Hs.10887 slmBar to lysosome-associated membrane 15.1 

40 422505 AL120862 Hs.124165 programmed cell death 9 (PDC09) 14.9 

430515 AA746503 Hs.283313 ESTs 14.7 

417308 H60720 Hs.81892 K1AA0101 gene product 14.4 
452744 AI267652 Hs.30504 Homo sapiens mRNA; cDNA DKFZp434E082 (fr14.4 

412446 AI768015 Hs.92127 ESTs 14.2 

45 415539 A1733881 Hs.72472 BMP-R1B 14.1 

435496 AW840171 Hs.265398 ESTs, Weakly similar to transformafion-r 13.8 

438209 AL120659 Hs.6111 aryl-hydrocarbon receptor nuclear transl 13.8 
400205 NA NMJ06265*:Homo sapiens RAD21 (S. pombe)13.5 

430965 AA489732 Hs.154918 ESTs 13.4 

50 415263 AA948033 Hs.130853 ESTs 13.3 

451952 AL120173 Hs.301663 ESTs 13.2 

449722 BE280074 Hs.23960 cyclinBV 132 

406685 M18728 gb^uman nonspecific crossreacting antig 13.0 

406690 M29540 Hs.220529 carctrwembryonfc anfigen-reteted cell ad 12.8 

55 429925 NMJX)0766Hs.226213 cytochrome P450, 51 (lanosterol 14-alpha 12.8 

416498 U33632 Hs.79351 potassium channel subfamily K, member 1 127 

432378 AI493046 Hs.146133 ESTs 12.5 

441377 BE218239 Hs.202656 ESTs 125 
456207 AA193450 gb:zr40e07.r1 Soares_.NhHMPu_S1 Homo sapi12.4 

60 422805 AA436989 Hs.121017 H2A histone family, member A 12.2 

407811 AW190902 Hs.40098 cysteine knot superfamlry 1 . BMP antagon 1Z2 

407178 AA195651 Hs.104106 ESTs 1Z2 
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420931 AF044197 Hs.100431 small Inducible cytokine B subfamily (Cy 12.1 

421727 Y13153 Hs.107318 kynurenine 3-monooxygenase (kynurenine 3 12.0 

434408 AI031771 Hs.132586 ESTs 12.0 

446591 H441B6 Hs.15456 PDZ domafo containing 1 11.9 

5 431385 BE178536 Hs.11090 membrane-spanning 4-domalns, subfamily A 11.8 

443348 AW873596 Hs.182278 calmodulin 2 (phosphorylase kinase, deft 11.7 

416602 NM_006159Hs.79389 nel (chicken)-«ke 2 11.7 

433365 AF026944 Hs.293797 ESTs 11.6 

437866 AA156781 Hs.74170 melanothionein 1E (functional) 11.5 

10 412472 AW975398 Hs.293836 ESTs 11.4 

416030 H15261 Hs.21948 ESTs 11.3 

439979 AW600291 Hs.6823 hypothetical protein FU 10430 113 

420757 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 11.3 

411598 BE336654 Hs.70937 H3 histone family, member A 11.2 

15 423600 AI633559 Hs.310359 ESTs 11.2 

430770 AA765694 Hs.123296 ESTs 11.0 

421037 AI684808 Hs.197653 programmed cell death 9 (POCD9) 103 

452461 N7B223 Hs.108106 transcription {actor 10.7 

409269 AA576953 Hs.22972 hypothetical protein FU 13352 10.6 

20 417791 AW965339 Hs.111471 ESTs 1Q.6 

447268 AI370413 Hs.36563 hypotheficat protein FU224 18 10.4 

424001 W67883 Hs. 137476 paternally expressed 10 10.4 

447342 A1199268 Hs.19322 Homo sapiens, Similar to RIKEN cDNA 2010 10.4 

424905 NMJ)02497Hs.153704 NIMA (never in mitosis gene a)-related k 10.1 

25 453619 H87~648 Hs.33922 Homo sapiens, clone MGC: 9084, mRNA, compl 0.1 

442942 AW167087 Hs.131562 ESTs 10.1 

434377 AW137148 Hs.306593 Homo sapiens cDNA FU1 1382 fis, clone HE 10.1 

427217 AA399272 Hs.144341 ESTs 10.1 

445730 AI624342 Hs.170042 ESTs 10.0 

30 432887 A1926047 Hs.162859 ESTs 10.0 

452243 AL355715 Hs.28555 programmed cell death 9 9.9 

424590 AW966399 Hs.46821 hypothetical protein FU20086 9.9 

432169 Y00971 Hs.2910 phosphoribosyl pyrophosphate synthetase 9.9 

438950 H23789 Hs.144530 EST 9.9 

35 418836 AI655499 Hs.161712 ESTs 9.8 

430291 AV660345 Hs.238126 CGI-49 protein 9.8 

444665 BE613126 Hs.47783 B aggressive lymphoma gene 9.7 
407377 C1 6391 gb:C1 6391 Clontech human aorta polyA mRN 9.7 

445413 AA151342 Hs.12677 CG1-147 protein 9.7 

40 443462 A1064690 Hs.171176 ESTs 9.7 

442145 A1022650 Hs.8117 erob2-lnteracfing protein ERBIN 9.7 

435570 AF212222 Hs.177812 uncharacterized bone marrow protein BM04 9.7 

439820 AL360204 Hs.283853 Homo sapiens mRNA full length insert cON 9.6 

428966 AF059214 Hs.194687 cholesterol 25-hydroxytase 9.6 

45 449448 D60730 Hs.57471 ESTs 9.6 

433929 AI375499 Hs.27379 ESTs 9.5 

432731 R31178 Hs.287820 fibronectinl . 9.3 

411815 AA1 56679 Hs.1 25790 leucine-rfch repeat-containing 2 9.3 

415385 R17798 Hs.7535 COBW-iike protein 9.3 

50 422026 U80736 Hs.1 10826 trinucleotide repeat containing 9 9.2 

432596 AJ224741 Hs.278461 matriEn3 9.2 

439451 AF086270 Hs.278554 heterochromatin-like protein 1 9.2 

423945 AA4 10943 gb:zt32h03.r1 Soares ovary tumor NbHOT H 9.1 

442432 BE093589 Hs.38178 hypothetical protein FLJ23468 9.1 

55 446715 AI337735 Hs. 17391 9 ESTs, Moderately similar to ZN9 INHUMAN Z 9.0 

408771 AW732573 Hs.47584 potassium voltage-gated channel, delayed 9.0 

437021 AI076089 Hs.292239 ESTs 9.0 

428479 Y00272 Hs. 184572 cell division cycle 2, G1 to S and G2 to 3.9 
428839 AI767756 Hs.82302 Homo sapiens cDNA FU14814 fis, done NT 8.9 

60 402408 NA NM_030920*:Homo sapiens hypothetical pro 8.9 

418601 AA279490 Hs.86368 catmegin 8.8 
426327 W03242 Hs.44898 Homo sapiens clone TCCCTA001 51 mRNA sequ8.8 

419519 AI198719 Hs.176376 ESTs 8.8 

440621 AW296024 Hs.150434 ESTs 8.8 

65 446142 AI754693 Hs.145968 ESTs 8.8 
418196 AI745649 Hs.26549 K1AA1 708 protein 8.7 
447178 AW594641 Hs.192417 ESTs 8.7 
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427585 031 152 Hs.179729 collagen, type X, alpha 1 (Schmid metaph 8.6 
415857 AA866115 Hs.127797 Homo sapiens cDNA FU1 1381 fs, clone HE 8.6 
435061 AI651474 Hs.163944 ESTs 8.6 
431374 BE258532 Hs.251871 CTP synthase 8.4 
5 417866 AW067903 Hs.82772 collagen, type XI, alpha 1 8.4 
437211 AA382207 Hs.5509 ecotropic viral integration site 2B 8.3 
437751 AA767373 Hs.35669 ESTs, Moderately similar to ALU 1 .HUMAN A 8.3 
423887 AL080207 Hs.134585 DKFZP434G232 protein 8.2 
440941 BE268362 Hs.7535 COBW-fike protein 8.2 

10 429859 NMJ)07050Hs.225952 protein tyrosine phosphatase, receptor t 8.2 
410193 AJ132592 Hs.59757 zinc finger protein 281 82 
431725 X65724 Hs.2839 Nome disease (pseudogBoma) 8.1 

446258 AI283476 Hs.263478 ESTs 8.1 
416747 AW876523 Hs.15929 hypothetical protein RJ 12910 8.1 

15 434424 A1811202 Hs.325335 Homo sapiens cONA: RJ23523 fis, done L 0.1 

421650 AA781795 Hs.122587 ESTs 8.0 

429534 AW976987 Hs.163327 ESTs, Weakly similar to 2109260A B cell 8.0 

457465 AW301344 Hs.122908 DNA repfication factor 8.0 

427961 AW293165 Hs.143134 ESTs 8.0 

20 436481 AA379597 Hs.5199 HSPC150 protein similar to ubiquitin-con 8.0 

418216 AA662240 Hs.283099 AF15q1 4 protein 8.0 
418250 U29926 Hs.83918 adenosine monophosphate deaminase (isofo 7.9 

400285 NA Eos Control 7.9 

401464 AF039241 Hs.9028 histone deacetylase 5 7.9 

25 407242 M18728 gb:Human nonspecific crossreacting antig 7.8 

422232 D43945 Hs.113274 transcription factor EC 7.8 

454024 AA993527 Hs.293907 hypothetical protein FU23403 7.8 

444542 AI161293 Hs.280380 aminopeptidase 7.8 

436396 A1683487 Hs.152213 wingless-type MMTV integration site fami 7.7 

30 437204 AL110216 Hs.12285 ESTs, WeaJdy similar to (55214 salivary 7.6 

408805 H69912 Hs.48269 vaccinia related kinase 1 7.6 

437207 T27503 Hs.15929 hypothetical protein FU12910 7.6 

442818 AK001741 Hs.8739 hypothetical protein FU10879 7.6 

426283 NMJ03937Hs.169139 kynureninase (L-kynurenine hydrolase) 7.5 

3 5 424687 J05070 Hs.151738 matrix metailoproteinase 9 (gelatinase B 7.5 

446315 NM_016293Hs.14770 bridging integrator 2 7.5 

433426 H69125 Hs.133525 ESTs 7.5 

406639 M97711 gb^uman T-cefl receptor (V beta 18.1, J 7.5 

420077 AW512260 Hs.87767 ESTs 7.4 

40 457332 AA961694 Hs.105187 Wnesm protein 9 gene 7.4 

422938 NM_001809Hs.1594 centromere protein A (17kD) 7.4 
447555 AI391662 Hs.160963 Homo sapiens, done MGC:12318, mRNA, com7.4 
444618 AV653785 Hs.173334 ELL-RELATED RNA POLYMERASE II, ELONGAT107.3 

410361 BE391804 Hs.62661 guanylate binding protein 1, interferon- 7.3 

45 400268 NA NM.003292:Homo sapiens translocated prom 7.3 

439509 AF086332 Hs.58314 ESTs 7.3 

407771 AL138272 Hs.62713 ESTs 7.3 

407202 N58172 Hs.109370 ESTs 7.3 

433096 AU076803 Hs.282975 carboxylesterase 2 (intestine, liver) 7.2 

50 422094 AF129535 Hs.272027 F-box only protein 5 ' 7.1 

430832 AI073913 Hs.100686 ESTs, Weakly similar to JE0350 Anterior 7.1 
430287 AW182459 Hs.125759 ESTs, WeaMy similar to LEU5.HUMAN LEUKE7.0 

423739 AA398155 Hs.97600 ESTs 7.0 
448212 AM75858 gb:tc87d07j(1 NCI_CGAP_CLL1 Homo sapiens7.0 

55 407277 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-BR 7.0 

454440 BE062906 Hs.28338 K1AA1546 protein 7.0 

444783 AK001468 Hs.62180 anillin (Drosophiia Scraps homolog). act 7.0 

421373 AA808229 Hs.167771 ESTs 6.9 

431960 AW241821 Hs.301927 c6.1A 6.9 

60 424704 AI263293 Hs. 1 52096 cytochrome P450, subfamily tIJ (arachido 6.8 

449517 AW500106 Hs.23643 serine/threonine protein kinase MASK 6.8 

439840 AW449211 Hs.105445 GDNF family receptor alpha 1 6.8 

414080 AA135257 Hs.47783 B aggressive lymphoma gene 6.8 

441243 AI767056 Hs.193002 ESTs 6.7 

65 408380 AF123050 Hs.44532 diubiquifln 6.7 

422956 BE545072 Hs.122579 hypothetical protein FU 10461 6.7 

446651 AA393907 Hs.97179 ESTs 6.7 
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419839 U24577 Hs.93304 phosphoBpase A2, group VI! (pfatetet-ac 6.7 

437740 AA810265 Hs.122915 ESTs 67 

421582 AI910275 Hs.1406 trefoil factor 1(pS2) 6.7 

427356 AW023482 Hs.97849 ESTs 6.6 

5 429597 NM_003816Hs.2442 a disintegrin and metalloprotelnase doma 6.6 

422634 NMJ)16010Hs.118821 CGI-62 protein 6.6 

421072 AI215069 Hs.89113 ESTs 6.5 

427718 A1798680 Hs.25933 ESTs 6.5 

411000 N40449 Hs.201619 ESTs, Weakly similar to S38383 SEB4B pro 6.5 

10 449343 AI151418 Hs.272458 protein phosphatase 3 (formerly 2B), cat . 6.4 

409757 NWL001898HS.123114 cystatinSN 6.4 

447164 AF026941 Hs.17518 Homo sapiens cig5 mRNA, partial sequence 6.4 

456938 X52509 Hs.1 61640 tyrosine aminotransferase 6.4 

418848 AI820981 Hs.193465 ESTs 6.4 

15 424902 NMJ)03866Hs.153687 inositol polyphosphates-phosphatase, ty 6.4 

452838 U65011 Hs.30743 preferentially expressed antigen in mela 6.4 

439452 AA918317 Hs.57987 B-cell CLUrymphoma 118 (zinc finger pro 6.4 

407266 AJ235664 gb:Homo sapiens mRNA for immunoglobulin 6.3 

411078 AI222020 Hs.182364 CocoaCrisp 6.3 

20 433001 AF217513 Hs.279905 clone HQ0310PRO0310p1 6.3 
434340 AI193043 Hs.128685 ESTs, Weakly simflar to T17226 hypothetl ,6.2 

429503 AA394183 Hs.26873 ESTs 6.2 

402578 C1001134:gi|2117372lplr|ll65981 fatty ac 6.2 

409646 AW161391 Hs.709 deoxycytkJine kinase 6.1 

25 430447 W17064 Hs.332848 SWI/SNF related, matrix associated, acti 6.1 

432415 T16971 Hs.289014 ESTs, Weakly similar to A43932 mucin 2 p 6.1 

443709 AI082692 Hs.134662 ESTs • 6.1 

420929 AI694143 Hs.296251 programmed cell death 4 6.1 

428368 BE440042 Hs.83326 matrix metalloprotelnase 3 (stromelysin ' 6.1 

30 428248 AU26772 Hs.40479 ESTs 6.0 

420344 BE463721 Hs.97101 putative G protein-coupled receptor 6.0 

453392 U23752 Hs.32964 SRY (sex determining region Y}-box 11 6.0 

425397 J04088 Hs. 156346 topoisomerase (DNA) II alpha (170kD) 6.0 

418007 M13509 Hs.83169 matrix metalloprotelnase 1 (MMP1; inters 6.0 

3 5 428585 AB007863 Hs.185140 KIAA0403 protein 6.0 

437608 AA761605 Hs.292308 ESTs. Weakly similar to ALU 1 .HUMAN ALU S6.0 

427408 AA583206 Hs.2156 RAR-related orphan receptor A 6.0 

406687 M31126 Hs.272620 mafrix metaltoproteinase 11 (MMP11;sfro 6.0 

418092 R45154 Hs.106604 ESTs 6.0 

40 447051 AW139130 Hs.160951 ESTs, WeaWy similar to Con1 (H.sapiens] 6.0 

441233 AA972965 Hs.135568 ESTs 6.0 

432239 X81334 Hs.2936 matrix metaUoproteinase 13 (collagenase 6.0 

435106 AA100847 Hs.193380 ESTs, Highly similar to AF 174600 1 F-box 5.9 

435525 AI831297 Hs.123310 ESTs 5.9 

45 458809 AW972512 Hs.20985 sin3-associated polypeptide, 30kD 5.9 
410785 AW803341 gb:IL24JMO079-09030O-05O%DO3 UMO079 Homo5.9 

422576 BE548555 Hs.118554 CGI-83 protein 5.9 

451398 AI793124 Hs.144479 ESTs 5.9 

441881 AW968904 Hs.179566 hypothetical protein FIJ22624 5.8 

50 412022 AI005043 Hs.24143 Wiskott-Aldrich syndrome protein interne 5.8 

416636 N32536 Hs.42645 solute carrier family 16 (monocarboxylic 5.8 

447350 A1375572 Hs.172634 ESTs 5.8 

434094 AA305599 Hs,238205 hypothetical protein PRO20 13 5.8 

409151 AA306105 Hs.50785 SEC22, vesicle trafficking protein (S. c 5.8 

55 448807 AI571940 Hs.7549 ESTs 5.8 

452281 T93500 Hs.28792 Homo sapiens cDNA FU11041 fis, clone PL 5.8 

421281 AI299139 Hs.17517 ESTs 5.8 

430361 AK333965 Hs.239926 sterol-C4-methy! oxidase-like 5.8 

400289 X07820 Hs.2258 matrix metaltoproteinase 10 (MMP10; str 5.7 

60 440527 AV657117 Hs.184164 ESTs, Moderately similar to S65657 alpha 5.7 

434674 AA831879 Hs.136985 ESTs 5.7 

426320 W47595 Hs.1 69300 transforming growth factor, beta 2 5.7 

452401 NM_007115Hs.29352 tumor necrosis factor, alpha-Induced pro 5.7 

448663 BE614599 Hs.106823 hypothetical protein MGC1 4797 5.7 

65 438199 AW016531 Hs.122147 ESTs 5.7 

446203 Z47553 Hs.14286 flavin containing monooxygenase 5 5.7 

428336 AA503115 Hs.163752 microseminoproteln, beta- 5.6 
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430379 AF134149 Hs.240395 
422835 BE218705 Hs.121378 
444758 AL044878 Hs.11899 
443426 AF098158 Hs.9329 
5 400301 X03535 Hs.1657 
447078 AW885727 Hs,301570 
432015 AL157504 Hs.159115 
438691 AA906288 Hs^12184 
439809 R41398 Hs.101774 
10 415786 AW419196 Hs.257924 
455373 BE247706 Hs.89751 
401645 NA 

437967 BE277414 Hs.5947 
445885 AI734009 Hs.127699 
15 439138 AI742605 Hs.193696 
440270 NM.015986Hs.7120 
437536 X91221 Hs.144465 
438167 R28363 Hs.24286 
452741 BE392914 Hs.30503 

20 426214 H59846 Hs.128355 
413554 AA319146 Hs.75426 
422867 L32137 Hs.1584 
434263 N34895 Hs.44648 
446382 AW205168 Hs.150823 

25 422406 AF025441 Hs.116206 
438321 AA576635 Hs.6153 
418310 AA814100 Hs.86693 
419625 U91616 Hs.91640 
450701 H39960 Hs.288467 

30 445900 AF070526 Hs.13429 
449051 AW961400 Hs.333526 
418758 AW959311 Hs.172012 
431070 AW408164 Hs.249184 
417079 U65590 Hs.81134 

35 421928 AF013758 Hs.109643 
423B04 AK000713 Hs.193736 
427427 AF077345 Hs.177936 
403485 

422168 AA586894 Hs.1 12408 
40 421937 AI878857 Hs.109706 
426752 X69490 Hs.172004 
453310 X70697 Hs.553 
423198 M81933 Hs.1634 
412281 AI810054 Hs.14119 
45 447513 AW955776 Hs.313500 
453931 AL121278 Hs.25144 
404347 

431808 M30703 Hs.270833 
429113 D28235 Hs.196384 

50 436291 BE568452 Hs.5101 
450603 R43646 Hs.12422 
434725 AK000796 Hs.4104 
435981 H74319 Hs,188620 
407376 AA993138 Hs.142287 

55 431689 AA305688 Hs.267695 
405348 NA 

436196 AK001084 Hs.333498 
437065 AL036450 Hs.103238 
410196 AI936442 Hs.59838 
60 429412 NM.006235Hs.2407 
446619 AU076643 Hs.313 
403329 NA 

442875 BE623003 Hs.23625 
442441 A1820662 Hs.129598 
65 430375 AW371048 Hs.93758 
424128 AW966163 
408873 AL046017 Hs.182278 



potassium channel subfamily K, member 6 5.6 
metal!olhioneln-like 5, {estis-specific 5.6 
3-hydroxy^^nethylgfuiaryK;oBnzyme A re 5.6 
chromosome 20 open reading frame 1 5.6 
estrogen receptor 1 5.6 
ESTs 5.6 
Homo sapiens mRNA; cDNA OKFZp586O0724 
ESTs 5.5 
hypothetical protein FU23045 5.5 
hypothetical protein FU13782 5.5 
membrane-spanning 4-domains, subfemflyA 5.5 
Cl6001440*:gij12330704Jgb)AAG52890.1jAF35.5 
mel transforming oncogene (derived from 5.5 
KIAA1603 protein 5.4 
ESTs 5.4 
cytokine receptor-like molecule 9 5.4 
ESTs 5.4 
ESTs 5.4 
Homo sapiens cDNA FU1 1344 fis, done PL 5.4 
ESTs, Moderately similar to ALU7_HUMAN A 5.4 
secretogranin II {chrarrwgranin C) 5.4 
cartilage oligomeric matrix protein (COM 5.4 
ESTs 5.4 
ESTs 5.4 
Opannteracting protein 5 5.3 
CGI-48 protein 5.3 
ESTs 5.3 
nuclear factor of kappa light polypeptid 5.3 
Homo sapiens cDNA FU12280 fis, clone MA 5.3 
Homo sapiens clone 24787 mRNA sequence 5.2 
HER2 receptor tyrosine kinase (c-erb-b2, 52 
hypothetical protein OKFZp434J037 5.2 
transcription factor 19 (SC1) 5.2 
interleukin 1 receptor antagonist 5.2 
pofyadenylate binding protein-interaciin 5.2 
hypothetical protein FU2Q7QQ 5.2 
ESTs 5.2 
C3001813*:gi|12737279|reflXPJ)12163.1| k 5.2 
S100 caicium-binding protein A7 (psorias 5. 1 
..... - , . . . . 5.1 

5.1 
5.1 
5.1 
5.1 



titin 

solute carrier family 6 (neurotransmitte 
cell division cycle 25A 
ESTs 

ESTs. Moderately similar to ALU7JHUMAN A 5.1 

5.1 
5.1 
5.1 
5.1 
5.1 
5.1 
5.0 
5.0 
5.0 
5.0 



ESTs 

Target Exon 

amphiregufin (schwannoma-derived growth 



protein regulator of cytokinesis 1 
ESTs 

hypothetical protein 
ESTs 

ESTs, Weakly similar to ALUF.HUMAN III! 
UOP-GakbetaGlcNAc beta 1,3-galactosyllr „, v 
C7001664:gi}12698061]db]jBAB21849.1l (AB 5.0 
Homo sapiens cDNA RJ10222 fis, done HE 5.0 
ESTs 5.0 
hypothetical protein FU10808 5.0 
POU domain, class 2, associating factor 5.0 
secreted phosphoprotein 1 (osteopontin, 4.9 
Target Exon 4.9 
Homo sapiens clone TCCCTA00142 mRNA sequ4.9 
ESTs 4,9 
H4 histone family, member H 4.9 
gb:EST378236 MAGE resequences, MAGI Homo4.9 
calmodutin 2 (phosphorylase kinase, dett 4.9 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



407910 AA650274 Hs.41296 fibronectin leucine rich transmembrane p 4.9 

432606 NM_002104Hs.3066 granzymeK (serine protease, granzyme 3; 4.9 

453204 R10799 Hs.191990 ESTs 4.8 

452020 AA722012 Hs.255757 ESTs, Weakly similar to AT2A_HUMAN POTEN4.6 

449048 Z45051 Hs.22920 similar to S68401 (cattle) glucose Indue 4.8 

408369 R38438 Hs.182575 solute carrier family 15 (H??? transport 4.8 

431645 AF078849 Hs.266483 dynein Hghtchaln-A 4.8 

423575 C18863 Hs.163443 Homo sapiens cDNA FU1 1576 fis, clone HE 4.8 

444246 H93281 Hs.10710 hypothetical protein FU204 17 4.8 

421524 M312082 Hs. 105445 GDNF family receptor alpha 1 4.8 

452827 AI571835 Hs.55468 ESTs 4.8 

414222 AL135173 Hs.878 sorbitol dehydrogenase 4.8 

456086 AL161999 Hs.77324 eukaryotic translation termination facto 4.8 

419078 M93119 Hs.89584 insulinoma-associated 1 4.8 

418973 AA233056 Hs.191518 ESTs 4.8 

447033 AI357412 Hs.157601 ESTs 4.8 

451621 AI879148 Hs.26770 fatty add binding protein 7, brain 4.7 

419968 X04430 Hs.93913 Interieukin 6 (Interferon, beta 2) 4.7 

424326 NM_014479Hs.145296 disintegrin protease 4.7 

431585 BE242803 Hs.262823 hypothetical protein FU 10326 4.7 

429294 AA095971 Hs.198793 Homo sapiens cONA: FU224 63 fis, done H 4.7 

416814 AW192307 Hs.80042 dolichyl-P-Glc:Man9GlcNAc2-PP-dQiichylgl 4.7 

439897 NM.015310Hs.6763 K1AA0942 protein 4.7 

429687 AI675749 Hs.211608 nudeoporin 153kD 4.7 

422880 AF228704 Hs.121524 glutathione reductase 4.7 



405801 



NM_000390:Homo sapiens crtoraideremia (Ra4.6 



432435 BE218886 Hs.282070 ESTs 4.6 

439544 W26354 Hs.28891 hypothetical protein FU 11 360; artemisp 4.6 

425354 U62:027 Hs.1 55935 complement component 3a receptor 1 4.6 

436027 AI864053 Hs.39972 ESTs, WeaWy similar to I38588 reverse t 4.6 

424623 AW963062 Hs.337404 ESTs 4.6 

403366 NA Target Exon 4.6 

402542 Target Exon 4.6 

450193 AI916071 Hs.15607 Homo sapiens Fanconl anemia complementat 4.6 

411678 AI907114 Hs.71465 squalene epoxidase 4.6 

456844 A1264155 Hs.152981 CDP-diacytglycerol synthase (phosphatida 4.6 

448072 AI459306 Hs.24908 ESTs 4.5 

408045 AW138959 Hs.245123 ESTs 4.5 

423782 A1472209 Hs.323117 ESTs 4.5 
447388 AW630534 Hs.76277 Homo sapiens, done MGC:9381, mRNA, comp4. 5 

448140 AF146761 Hs.20450 BCM-ltke membrane protein precursor 4.5 

452561 AI692181 Hs.49169 KIAA1 634 protein 4.5 
425331 AW962128 gb:EST374201 MAGE resequences, MAGG Homo4.5 

428801 AW277121 Hs.254881 ESTs 4.5 

428500 AI815395 Hs.184641 fattyadddesaturase2 4.5 

426075 AW513691 Hs.270149 ESTs, Weakly similar to 2109260A B cell 4.4 

437259 A1377755 Hs.120695 ESTs 4.4 

400409 AF153341 Hs.283954 Homo sapiens'winged helix/forkhead trans 4.4 

412863 AA121673 Hs.59757 zinc finger protein 281 4.4 

426989 A1815206 Hs.99395 ESTs 4.4 

401866 Target Exon 4.4 

418819 AA228776 Hs.191721 ESTs 4.4 

406348 Target Exon 4.4 

412138 AW895387 gb:QV4-NN0O38-30030O-157-c10 NN0038 Homo4.4 

428550 AW297880 Hs.98661 ESTs 4.4 
411743 AW862214 gb:QV4-CT0361-30l299-074-b05CT0361 Homo4.4 

429966 BE081342 Hs.283037 HSPC039 protein 4.4 

423291 NMJ)04129Hs.126590 guanylate cydase 1,sohible f beta2 4.4 
423456 AL1 10151 Hs.128797 DKF2P586D0824 protein 4.4 
452190 H26735 Hs.91668 Homo sapiens done PP1498 unknown mRNA 4.4 
424871 NM 004525HS.1 53595 low density Dpoproteirwelated protein 4.3 
429575 AA706003 Hs.99387 ESTs 4.3 
429922 Z97630 Hs.226117 H1 histone family, member 0 4.3 
421379 Y15221 Hs.1 03982 small inducible cytokine subfamily B (Cy 4.3 
400300 X03363 HER2 receptor tyrosine kinase (c-erb-b2, 4.3 
437258 AL041243 Hs.174104 ESTs 4.3 
446595 T57448 Hs, 15467 hypothetical protein FU20725 4.3 
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403011 NA ENSP00000215330*:ProbabIeserine/lhreoni 4.3 

419055 AI365384 Hs.11571 Homo sapiens cDNAFLM 1570 fis, clone HE 4.3 

418661 NM.001949Hs.1189 E2Ftranscripfion factor 3 4.3 

407786 AA687538 Hs.38972 tetraspanl 4.3 

5 429183 AB014604 Hs.197955 K1AA0704 protein 4,3 

442914 AW188551 Hs.99519 hypothetical protein FU 14007 4.3 

441029 AI091795 Hs.179246 ESTs 4.3 

452194 AI694413 Hs.332649 olfactoiy receptor, family 2, subfamily 4.3 

414821 M63835 Hs.77424 Fc fragment of IgG.hgh affinity la. re 42 
10 410102 AW248508 Hs.279727 Homo sapiens cONA FU14035 6s, clone HE 4.2 
452110 T47667 Hs.28005 Homo sapiens cDNA FU1 1309 fis, done PL 42 

442007 AA301116 Hs.142838 nucleolar phosphoprotein Nopp34 4.2 

417318 AW953937 Hs.12891 ESTs 4.2 

431818 AW510444 Hs.191705 ESTs. Weakly similar to T471 84 hypotheti A2 

15 443646 AI085198 Hs.164226 ESTs 42 

419169 AW851980 Hs.262346 ESTs, Weakly similar to S72482 hypotheti 4.2 

446839 BE091926 Hs.16244 mitotic spindle coiled-co'il related prot 42 

423242 AL039402 Hs.125783 DEME-6 protein 4.2 

432116 AA902953 Hs.308538 ESTs 4.2 

20 409038 T97490 Hs.50002 small inducible cytokine subfamily A {Cy 4.2 

445625 BE246743 Hs.288529 hypothetical protein FU22635 4.2 

425139 AW630488 Hs.325820 protease, serine, 23 4.2 

447397 BE247676 Hs.18442 E-1 enzyme 4.2 

410166 AK001376 Hs.59346 hypothetical protein FU 1051 4 4.1 

25 437295 AW779318 Hs.88417 ESTs 4.1 

430486 BE062109 Hs.241551 chloride channel, calcium activated, fam 4.1 

441790 AW294909 Hs.132208 ESTs 4.1 

410129 BE244074 Hs.58831 regulator of Fas-induced apoptosis 4.1 

427521 AW973352 Hs.290585 ESTs 4.1 

30 425247 NNL005940HS.155324 matrix metaltoprotelnase 11 (MMP11;stro 4.1 

412886 AF041163 Hs.74647 Human T-ceB receptor active alpha-chain 4.1 
441153 BE562826 gb:601336534F1 NIH MGC 44 Homo sapiens c4.1 

444301 AK000136 Hs.10760 asporin (LRR dass 1) 4.1 

426711 AA383471 Hs.1 80669 conserved gene amplified in osteosarcoma 4.1 

35 405850 NA Target Exon 4.1 

440283 A1732892 Hs.190489 ESTs 4.0 

432441 AW292425 Hs.163484 ESTs 4.0 

400284 NA estrogen receptor 1 4.0 

417341 N91453 Hs.102987 ESTs 4.0 

40 429732 U20158 Hs.2488 lymphocyte cytosoGc protein 2 (SH2 doma 4.0 

411393 AW797437 Hs.69771 B-factor, properdin 4.0 

425704 U79293 Hs.159264 Human clone 23948 mRNA sequence 4.0 

419594 AA013051 Hs.91417 topoisomerase (DNA) II binding protein 4.0 

419092 J05581 Hs.89603 mucin 1, transmembrane 4.0 

45 443147 A1034351 Hs.19030 ESTs 4.0 

408633 AW963372 Hs.46677 PRO2000 protein 4.0 

433404 T32982 Hs. 102720 ESTs 4.0 

421506 BE302796 Hs.105097 thymidine kinase 1, soluble 4.0 

417900 BE250127 Hs.82906 COC20 (cell division cycle 20, S. cerevi 3.9 

50 414602 AW630088 Hs.76550 Homo sapiens mRNA; cONA DKFZp564B1264 (0.9 

413762 AW411479 Hs.848 FK506-binding protein 4 (59kO) 3.9 

404580 NMJ)14112*:Homo sapiens trfchorhlnophala 3.9 

452046 AB018345 Hs.27657 K1AA0802 protein 3.9 
459587 AA031956 gb:zk15e04.s1 Soares_pregnantuterus NbH3.9 

55 416658 U03272 Hs.79432 fibrillin 2 {congenital contracture! ara 3.9 

426647 AA243464 Hs.294101 pre-B-cell leukemia transcription factor 3.9 

429353 AL1 17406 Hs.200 1 02 ATP-binding cassette transporter MRP8 3.9 

419038 AW134924 Hs.190325 ESTs 3.9 

416918 X07871 Hs.89476 C02 antigen (p50), sheep red blood cell 3.9 

60 421977 W94197 Hs.1 10165 ribosomal protein L26homolog 3.9 

442557 AI201183 Hs.130251 ESTs 3.9 

421168 AF182277 Hs.330780 cytochrome P450, subfamily II B (phenobar 3.9 
431701 AW935490 Hs. 14658 Human chromosome 5q13.1 clone 5G8 mRNA3.9 

418526 BE019020 Hs.85838 solute carrier family 16 (monorarboxylic 3.9 

65 414998 NMJ»2543Hs.77729 oxidised low density lipoprotein (lectin 3.9 

422790 AA809875 Hs.25933 ESTs 39 

419741 NMJW7019HS.93002 ubiquitin carrier protein E2-C 3*9 

327 



WO 02/059377 PCT/US02/02242 



430017 AA263172 Hs.35 protein tyrosine phosphatase, non-recept 3,9 

458814 AI498957 Hs.170861 ESTs, Weakly similar to Z195JMMAN ZINC 3.8 

428514 AW236861 Hs.193139 ESTs 3.8 

434521 NM_002267Hs.3886 karyopherin alpha 3 (importin alpha 4) 3.8 

5 409425 U40462 Hs.54452 zinc finger protein, subfamily 1 A, 1 (Ik 3.8 

439560 BE565647 Hs.74899 hypothetical protein FU 12820 3.8 

424028 AF055084 Hs.153692 Homo sapiens cDNA FU14354 lis, done Y7 3.8 

400021 AFFX control - HUM1SGF3A/M97935_MA 3.8 

453403 BE466639 Hs.61779 Homo sapiens cDNAFU 13591 lis, done PL 3.8 

10 445941 AI267371 Hs.172636 ESTs 3.8 

434378 AA631739 Hs.335440 EST 3.8 

429220 AW207206 Hs.136319 ESTs 3.8 

439176 AI446444 Hs.190394 ESTs, Weakly similar to B28096 Gne-1 pr 3.8 

401045 C11001883 4 :gi|6753278|retlNP_033938.1|c 3.8 

15 430178 AW449612 Hs.152475 ESTs 3.8 

423397 NM_001838Hs.1652 chemokine (C-C motif) receptor 7 3.8 

447630 AI660149 Hs.44865 lymphoid enhancer-oinding factor 1 3.8 

436391 AJ227892 Hs.146274 ESTs 3.8 

413011 AW068115 Hs.821 blglycan 3.8 

20 422121 AI767949 Hs.179833 ESTs 3.8 

452268 NMJ)03512Hs.28777 H2A histone family, member L 3.8 

427811 M81057 Hs.180884 carboxypeptldase B1 (tissue) 3.8 

415579 AA165232 Hs.222069 ESTs 3.8 
437330 AL353944 Hs.50115 Homo sapiens mRNA;cDNADKF2p761J11 12 (f 3.8 

25 427122 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c-erb-b2, 3.7 

400286 NA C16000922:gi|7499103|plr|IT20903hypothe 3.7 

420281 A1623693 Hs.191533 ESTs 3.7 

419926 AW900992 Hs.93798 DKFZP586D2223 protein 3.7 

417541 AJ992191 Hs.180040 hypothetical protein FU22439 3.7 

30 426172 AA371307 Hs.125056 ESTs 3.7 

429638 AI916662 Hs,21 1577 kinectin 1 (kinesln receptor) 3.7 

457001 J03258 Hs.2062 vitamin D (1,25- dihydroxyvitamin D3) re 3.7 
424109 AW406878 gb:UI-HF-BL0-adg^-06-04Jl.r1 N1HJAGC.373.7 

417022 NM_014737Hs.80905 Ras association (RalGDS/AF-6) domain fam 3.7 

35 436222 A1208737 Hs.122810 Homo sapiens cONA FU 11489 fls, clone HE 3.7 

430448 AI633553 Hs.13303 Homo sapiens cONA: FU21784 fis, clone H 3.7 

432729 AK000292 Hs.278732 hypothetical protein FU20285 3.7 

413916 N49813 Hs.75615 apolipoprotein C-ll 3.7 

421662 NMJ)14141Hs.1Q6552 cell recognition molecule Caspr2 3.7 

40 441633 AW958544 Hs.112242 norma) mucosa of esophagus spedficl 3.7 

408761 AA057264 Hs.238936 ESTs, Weakly similar to (defline not ava 3.7 

406153 Target Exon 3.7 

445563 AW873606 Hs.149006 ESTs 3.7 

453464 AI884911 Hs.32989 receptor (calcitonin) activity modifying 3.7 

45 448918 AB011152 Hs.22572 KIAA058O protein 3.7 

413936 AF113676 Hs.297681 serine (or cysteine) proteinase inhiblto 3.6 

448069 U76248 Hs.20191 seven in absentia (Drosophila) homolog 2 3.6 

453313 BE005771 Hs.153746 hypothetical protein FU22490 3.6 

425234 AW152225 Hs.165909 ESTs, WeaWy simflar to 138022 hypotheti 3.6 

50 419941 X98654 Hs.93837 phosphatidylinositol transfer protein, m 3.6 

402397 AF188625 Hs.169507 phosphofipase A2, group !(0 3.6 

430378 Z29572 Hs.2556 tumor necrosis factor receptor superfaml 3.6 

448106 A1800470 Hs.171941 ESTs 3.6 

426431 NM_00(M58Hs.169853 transcription factory hepatic; LF-83; 3.6 

55 431843 AA516420 Hs.183526 ESTs, Weakly similar to 138022 hypotheti 3.6 
426878 BE069341 gb:QV3-BT0381-27010(W)73<08BT0381 Homo3.6 

434061 AW024973 Hs.283675 NPD009 protein 3.6 
445292 AV653264 Hs.13982 Homo sapiens cONA FU1 4666 fls, done NT 3.6 
452101 T60298 Hs.10844 Homo sapiens cDNA FU14476 fis, done MA 3.6 

60 427581 NM014788HS.179703 KIAA0129 gene product 3.6 

409047 AW961434 Hs.31539 ESTs 3.6 

416820 NMJXW402Hs.80206 g(ucose-6-phosphate dehydrogenase 3.6 

410386 W26187 Hs.3327 Homo sapiens cONA: FU22219 fis, done H 3.6 

440516 S42303 Hs.161 cadherin 2, type 1,N-cadherin (neuronal 3.6 

65 434360 AW015415 Hs.127780 ESTs 3.6 

428970 BE276891 Hs.194691 retinolc add induced 3 3.6 

415079 R43179 Hs.22895 hypothetical protein FU23548 3.6 
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409519 AK001015 Hs.55220 BCL2-associated athanogene 2 3.6 
430044 AA464510 Hs.152812 ESTs 3.6 
430829 AW451999 Hs.194024 ESTs 3.6 
434224 AA380731 Hs.84 Interieukin 2 receptor, gamma (severe co 3.6 
5 439247 AF088020 Hs.46767 EST 3.6 
431542 H63010 Hs.5740 ESTs 3.5 
430713 AA351647 Hs.2642 eukaryotic translation elongation factor 3.5 
434988 AI418055 Hs.161160 ESTs 3.5 
437748 AF234882 Hs.5814 suppression of to/norigenicity 7 3.5 

10 418322 AA284166 Hs.84113 cycQn-dependent kinase inhibitor 3 (CDK 3.5 
439569 AW602166 Hs.222399 CE6P1 protein 3.5 
459583 AI907673 gb:ll.-BT152-Q80399-004 BT152 Homo sapien3.5 

403212 NMJ)19595:Homo sapiens intersecfin 2 (IT 3.5 

409099 AKD00725 Hs.50579 hypothetical protein FU20718 3.5 

15 453968 AA847843 Hs.62711 Homo sapiens, clone IMAGE3351295, mRNA3.5 

436338 W92147 Hs.1 18394 ESTs 3.5 

422890 Z43784 Hs.75893 ankyrin 3,nodeofRanvier(ankyrinG) 3.5 
442295 AI827248 Hs.224398 Homo sapiens cONA FU1 1469 fis, clone HE 3.5 

417975 AA641836 Hs.30085 hypotheticaJ protein FU23 186 3.5 

20 433730 AK002135 Hs.3542 hypothetical protein RJ1 1273 3.5 

439926 AW014875 Hs.137007 ESTs 3.5 

• 445873 AA250970 Hs.251946 pc4y(A}-bin(ling protein, cytoplasmic 1-1 3.4 

410153 BE311926 Hs.15830 hypothetical protein FU12691 3.4 
422128 AW881145 gb:QVO-OT0033^)10400-182-a070T0033Homo 3.4 

25 414921 BE390551 H$.77Q2& steroidogenic acute regulatory protein r 3.4 

441134 W29092 Hs.7678 cellular retinoic acid-binding protein 1 3.4 

444564 AI167877 Hs.143716 ESTs 3.4 

402470 NA Target Exon 3.4 

418120 AA213437 Hs.192249 ESTs 3.4 

30 422414 AW875237 Hs.13701 ESTs 3.4 

433345 AI681545 Hs.152982 hypotheticaJ protein FU1 31 17 3.4 

409213 U61412 Hs.51133 PTK6 protein tyrosine kinase 6 3.4 

422611 AA158177 Hs.118722 fucosyltransferase 8 (alpha (1,6) fucosy 3.4 

423554 M90516 Hs.1674 gliitarrtofrudose-6^nosphatetransamin 3.4 

35 442619 AA447492 Hs.20183 ESTs, Weakly similar to AF164793 1 prate 3.4 
402359 NA Cl9001991^il12656111Igb|AAK00751.1|AF23.4 

439398 AA284267 Hs.221504 ESTs 3.4 

415208 F01020 Hs.172004 tifin 3.4 

452853 AA812633 Hs.10845 ESTs 3.4 

40 429345 R11141 Hs.199695 hypothetical protein 3.4 

449027 AJ271216 Hs.22880 dipeptidylpeptidase 111 3.4 

412115 AKD01763 Hs.73239 hypothetical protein FU10901 3.4 

432180 Y18418 Hs.272822 RuvB (E cofi homo!og)-like 1 3.4 

428977 AK001404 Hs.194698 cyclin B2 3.4 

45 431611 U58766 Hs.264428 tissue specific transplantation antigen 3.4 
418286 AA622528 Hs.319825 Homo sapiens, done IMAGE:3616574, mRNA,3.4 

436895 AF037335 Hs.5338 carbonic anhydrase XII (tumor antigen H 3.4 

443378 AW392550 Hs.9280 proteasome (prosome, macropain) sub-unit, 3.4 

428450 NWL014791HS.184339 KIAA0175 gene product 3.3 

50 449571 AW016812 Hs.200266 ESTs 3.3 

412777 AI335773 Hs.270123 ESTs 3.3 

420542 NM.000505Hs.1321 coagulation factor XII (Hageman factor) 3.3 

412754 AW160375 Hs.74565 amyloid beta (A4) precursor-like protein 3.3 

418327 U70370 Hs.84136 paired-like homeodomaln transcription fa 3.3 

55 449065 A1627393 Hs.258998 ESTs, Weakly similar to high mobility gr 3.3 

425999 AW513051 Hs.332981 ESTs, Weakly similar to 138022 hypotheti 3.3 

430280 AA36125B Hs.237868 interieukin 7 receptor 3.3 

407777 AA161071 Hs.71465 squatene epoxidase 3.3 

426516 BE262660 Hs.1 70197 glutamic-oxaloacetic transaminase 2, m'rt 3.3 

60 414361 AI086138 Hs.204044 ESTs 3.3 

427080 AW068287 Hs.1 73466 ras-related C3 botuBnum toxin substrate 3.3 

426429 X73114 Hs.169849 myosin-bindtog protein C, slow-type 3.3 

446163 AA026880 Hs.25252 prolactin receptor " 3.3 

428566 U41763 Hs.184916 clathrin, heavy polypeptkJe-like 1 3.3 

65 418641 BE243136 Hs.86947 a disintegrin and metailoproteinase doma 3 3 

436293 AI601188 Hs.120910 ESTs 33 

411257 AA628967 Hs.115274 ESTs, Highly similar to IHH_HUMAN INDIAN 3.3 
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430253 AK001514 Hs.236844 hypothetical protein FU10552 3.3 
430066 A1929B59 Hs.237825 signal recognition particle 72kD 3.3 
436469 AK001455 Hs.5198 Down syndroms critical region gene 2 3.3 
437786 BE142681 Hs.155573 polymerase (DNA directed), eta 3.3 
444079 H09048 Hs.23606 ESTs 3.3 
457183 H91882 Hs.118569 Dvl-binding protein IDAX (Inhibition of 3.3 
431215 AA496078 Hs.121554 Human DNA sequence from clone RP 11-21 8C 13. 3 
424563 AA446932 Hs.1 51428 ret finger protein 2 3.3 
450828 AW270655 Hs.193804 ESTs 3.3 
408652 R43409 Hs.6829 Homo sapiens mRNA for KIAA1 644 protein, 3.3 
445142 AW978484 Hs.93842 Homo sapiens cDNA: FU22554 lis, clone H 3.3 
426761 A1015709 Hs.172089 Homo sapiens mRNA; cDNA DKFZp586l2022 (13.3 
439237 AW408158 Hs.318893 ESTs. Weakly similar to A47582 B-ceU gr 3.3 
422616 BE300330 Hs.1 18725 selenophosphate synthetase 2 3.3 
443247 BE614387 Hs.333893 c-Myc target JP01 3.3 
406663 U24683 Hs.302063 immunoglobulin heavy constant mu 3.3 
434137 AA907734 Hs.124895 ESTs 3.3 
408877 AA479033 Hs.130315 ESTs, Weakly similar to A47582 B-cell gr 3.3 
439101 C01765 Hs.38750 hypothetical protein FU 11 526 3.3 
408221 AA912183 Hs.47447 ESTs 3.3 
447519 U46258 Hs.339665 ESTs 3.3 
404755 NA Target Exon 3.3 

451871 AI821005 Hs.1 18599 ESTs 3.2 
420319 AW406289 Hs.96593 hypothetical protein 3.2 
430580 AA806105 Hs.300697 immunogbbulin heavy constant gamma 3 (G 3.2 
40Q202 NA NfVL002795 # :Homo sapiens proteasome (pros32 

400222 NA NM_002082 t :Homo sapiens G protein-couple 3.2 

425988 BE045897 Hs.274454 ESTs, Weakly similar to I38022 hypotheti 3.2 
458098 BE550224 Hs.74170 metaltothioneinlE (functional) 3.2 
430589 AJ002744 Hs.246315 UDP-N-acetyl^alpha-D-galactosamine:polyp 3.2 
431563 AI027643 Hs.120912 ESTs 3.2 
442353 BE379594 Hs.49136 ESTs, Moderately similar to ALU7JHUMAN A 3.2 
422309 U79745 Hs. 11 4924 solute canier family 16 {monocarboxylic 3.2 
419703 AI793257 Hs.128151 ESTs 3.2 
420380 AA640891 Hs.102406 ESTs 3.2 
410853 H04588 Hs.30469 ESTs * 3.2 

454417 AI244459 Hs.1 10826 trinucleotide repeat containing 9 3.2 
432745 AI821926 gbmt78(05.x5 NCLCGAP_Pr3 Homo sapiens 3.2 

422032 AA476966 Hs.1 10857 polymerase (RN A) III (DNA directed) poly 3.2 
415339 NM.015156HS.78398 KIAA0071 protein 3.2 
426384 AI472078 Hs.303662 ESTs 3.2 
448030 N30714 Hs.325960 membrane-spanning 4-domains, subfamily A 3.2 
418739 AA3109S4 Hs.88012 SHP2 interacting transmembrane adaptor 3.2 
442053 R35343 Hs.24968 Human DNA sequence from clone RP1-233G 163.2 
434747 AA837085 Hs.220585 ESTs 3 ; 2 
427297 AW292593 Hs.334907 Homo sapiens, clone MGC:17333, mRNA, com3.2 
412228 AW503785 Hs.73792 complement component (3aVEpstein Barrvi 3.2 
452304 AA025386 Hs.61311 ESTs, Weakly similar to S10590 cysteine 3.2 
453953 AW408337 Hs.36972 CD7 antigen (p41) 3.2 
407758 D50915 Hs.38365 KIAA0125 gene product 3.2 
451149 AL047586 Hs.10283 RNA binding motif protein 8B 3.2 
430015 AW768399 Hs.1 12157 ESTs « 
433313 W20128 Hs.296039 ESTs « 
418334 AA319233 Hs.5521 ESTs ** 
450223 AA418204 Hs.241493 natural kllier-Uimor recognition sequenc 3.2 
454365 AW966728 Hs.54642 methionine adenosyltransferase II, beta 3.2 
451128 AL118668 gb:DKFZp761 I0310_r1 761 (synonym: hamy2)3.2 

417793 AW405434 Hs.82575 small nuclear ribonucleoprotein polypept 3.2 
428027 U22029 Hs.334345 cytochrome P450, subfamily IIA (phenobar 3.2 
441197 BE244638 Hs.166 sterol regulatory element binding transc 3.2 
424634 NMJ)03613Hs.151407 cartilage Intermediate layer protein, nu 3.2 
419986 AI345455 Hs.78915 GA-binding protein transcription factor, 3.2 
416714 AF283770 Hs.79630 CD79A antigen (trnmunoglobulin-associated 3.2 
449465 NM_004380Hs.23598 CREB binding protein (Rubinstein-Taybi s 3.2 
422166 W72424 Hs.112405 S1 00 calcium-binding protein A9 (calgran 3.2 
409079 W87707 Hs.82065 interleukin 6 signal transducer (gp1 30, 3.2 
423551 AA327598 Hs.233785 ESTs 3.2 
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453553 AA036849 Hs.61829 Homo sapiens cDNA FU12763 fis, clone NT 3.2 

442580 AI733682 Hs.130239 ESTs 3.2 
458079 AI79687Q Hs.54277 DNA segment on chromosome X (unique) 9923.2 

425700 AF076292 Hs.159251 forkheadboxHI 3.2 

5 417124 BE122762 Hs.25338 ESTs 3.2 

407104 S57296 Hs.323910 v-erb-©2 avian erythroblastic leukemia v 3.2 

442215 AI703172 Hs.129005 ESTs, Weakly similar to 2109260A B cell 3.1 

430271 T06199 Hs.237506 DnaJ (Hsp40) homolog, subfamily B, membe 3.1 

425317 AW205118 Hs.210548 Interleukin 21 receptor 3.1 

10 426095 AI278023 Hs.89986 ESTs 3.1 

442313 BE388898 Hs.8215 hypothetical protein FU 11 307 3.1 

424709 AL137589 Hs.152149 hypothetical protein DKF2p434K0410 3.1 

429671 8E379335 Hs.211594 proteasome (prosome, macropain) 26S subu 3.1 

432715 AA247152 Hs.200483 ESTs, Weakly similar to K1AA1 074 protein 3.1 

15 431574 AW572659 Hs.261373 hypothetical protein dJ434014.3 3.1 

436875 AI124756 Hs.5337 isocitrate dehydrogenase 2 (NADP), mitoc 3.1 

405017 NA Target Exon 3.1 

433805 AA706910 Hs.1 12742 ESTs 3.1 

437352 AL353957 Hs.284181 hypothefical protein DKFZp434P0531 3.1 

20 430105 X70297 Hs.2540 cholinergic receptor, nicotinic, alpha p 3.1 

422083 NNL.001141HS.1 11256 arachidonate 15-lipoxygenase, second typ 3.1 

413507 BE145360 Hs.190064 ESTs, Weakly simitar to 138022 hypolheli 3.1 

415989 A1267700 Hs.317584 ESTs - 3.1 

422907 AI879263 Hs.6986 Human glucose transporter pseudogene 3.1 

25 425548 AA890023 Hs.1906 prolactin receptor 3.1 

422599 BE387202 Hs.1 18638 non-metastatic cells 1, protein (NM23A) 3.1 

439963 AW247529 Hs.6793 platelet-activating factor acetylhydrola 3.1 

453883 A1638516 Hs.22630 cofactor required for Sp1 transcripfiona 3.1 

458021 AI885190 Hs.156089 ESTs, Weakly similar to repressor protei 3.1 

30 418478 U38945 Hs,1174 cycfin-dependent kinase inhibitor 2A (me 3.1 

400814 NA Target Exon 3.1 

402327 Target Exon 3.1 

416935 M190712 gb:zp87f09.r1 Stratagene HeLa cell s3 93 3.1 

439838 AL355722 Hs.106875 Homo sapiens EST from clone 35214, full 3.1 

35 437035 A1571514 Hs.133022 ESTs 3.1 

449523 NM_000579Hs.54443 chemokine (OC motif) receptor 5 3.1 

406642 AJ245210 gb:Homo sapiens mRNA for Immunoglobulin 3.1 

408624 AF052762 gb:Homo sapiens clone csneg8-1 immunoglo 3.1 

421924 BE514514 Hs.109606 coronin, acfcn-binding protein, 1A 3.1 

40 414523 AU076633 Hs.76353 serine (or cysteine) proteinase inhibito 3.1 

416379 N38857 Hs.203933 ESTs 3.1 

422823 D89974 Hs.121102 vanin2 3.1 

433904 AI399956 Hs.208956 ESTs 3.1 

421904 BE143533 Hs.109309 hypothetical protein FU20035 3.1 

45 428834 AW899713 Hs.339315 ESTs 3.1 

436043 AW963838 Hs.168830 Homo sapiens CDNAFU1 2136 fis, done MA 3.1 

452823 AB012124 Hs.30696 transcription factor-Oka 5 (basic helix 3.1 

405381 NA Target Exon 3.1 

428746 AW503820 Hs.1 92861 Spi-B transcription factor (Spi-l/PU.1 r 3.1 

50 435147 AL133731 Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (f3.1 

425782 U66468 Hs.1 59525 cell growth regulatory with EF-handdoma 3.1 

423306 W88562 Hs.108198 ESTs 3.1 

419123 AA234276 Hs.88253 ESTs 3.1 

438581 AW977766 Hs.292133 ESTs, Moderately similar to 178885 serin 3.1 

5 5 417105 X60992 Hs.81226 CD6 antigen 3.0 

428361 NM_015905Hs. 183858 transcriptional Interniediary factor 1 3.0 

417880 BE241595 Hs.82848 selectin L (lymphocyte adhesion molecule 3.0 

402606 NM 024626:Homo sapiens hypothetical prot 3.0 

401451 NfVL004496^omo sapiens hepatocyte nude 3.0 

60 421876 AA299652 Hs.1 11496 Homo sapiens cDNA FU1 1643 fis, clone HE 3.0 

409518 BE384836 Hs.3454 KIAA1 821 protein 3.0 

416933 BE561850 Hs.80506 srnafl nuclear ribonudeoprotein porypept 3.0 

414324 Y14768 Hs.890 rymphotoxin beta (TNF superfamDy, membe 3.0 

425081 X74794 Hs.154443 mir^romosorne maintenance deficient (S. 3.0 

65 401519 C15000476*:gi|12737279]refpCPjD12163.1 1 3.0 

411704 AI499220 Hs.71573 hypothetical protein FU 10074 3.0 

428819 AL135623 Hs.193914 WAAQ575 gene product 3.0 
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428423 AU076517 Hs.184276 solute carrier family 9 {sodium/hydrogen 3.0 
413835 M272727 Hs.249163 fatty add hydroxylase 3.0 
412600 L28824 Hs.74101 spleen tyrosine kinase 3.0 
410491 AA465131 Hs.64001 Homo sapiens clone 25218 mRNA sequence 3.0 
5 433658 L03678 Hs.156110 Immunoglobulin kappa constant 3.0 
427666 AI791495 Hs.180142 catmodufin-iike skin protein 3.0 
452514 AI904898 gb:RC-BT068- 1 30399-085 BT068 Homo sapien3.0 

429500 X78565 Hs.289114 hexabrachion{tenascinC,cytotactin) 3.0 
432485 N90866 Hs.276770 CDW52 antigen (CAMPATH-1 antigen) 3.0 

10 437400 AB011542 Hs.5599 EGF4ike-domain, multiple 5 3.0 
452234 AW084176 Hs.223296 ESTs. Weakly similar to I38022 hypotheti 3.0 
413269 BE167526 gb:CM4-HT0509-080300-1 07-g07 HT0509 Homo 3.0 

453216 AL137566 Hs.32405 Homo sapiens mRNA; cDNA DKFZp586G0321 (f 3.0 
400929 ENSP00000252232*:Sterol regulatory eleme 3.0 

15 445145 AI961702 Hs.147434 ESTs 3.0 
432615 AA557191 Hs.55028 ESTs, WeaWy similar to 154374 gene NF2 3.0 
423279 AW959861 Hs.290943 ESTs 3.0 
429392 AL109712 Hs.296506 Homo sapiens mRNA full length insert cDN 3.0 
408548 AA055449 Hs.63187 ESTs, Weakly similar to ALUC JiUMAN till 3.0 

20 451346 NM_006338Hs.26312 glioma amplified on chromosome 1 protein 3.0 
413109 AW389845 Hs.1 10855 ESTs 3.0 
401714 NA ENSP000a0241802*:CDNAFU11007FIS,CLON 3.0 

421462 AF016495 Hs.104624 aquaporin9 3.0 
421750 AK000768 Hs.107872 hypothetical protein FU20761 3.0 

25 453293 AA382267 Hs.10653 ESTs 3.0 
457085 AA412446 Hs.98138 ESTs 3.0 
438930 AW843633 Hs.306163 hypothetical protein AL1 1011 5 3.0 
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TABLE 22A 



Table 22A shows the accession numbers for those pkeys lacking unigenelD's for Table 22. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset Identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 



407980 103087.1 AA046309 A1263500 AA046397 

410785 1221055.1 AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 

411743 1256098 1 AW862214 AW859811 AW862215 

412138 1279172.1 AW895387 AW895547 AW895564 AW895323 AW895405 AW895539 AW895538 

413269 1356961 1 BE167526 BE1 67651 8E076401 R24654 

416935 163179J AA190712AA190665AA252564 

422128 211994.1 AW881145 AA490718 M85637AA304575 T06067 M331991 

423945 233566 1 AA410943 AW948953 AA334202 AA332882 

424109 235506.1 AW406878 AW966560 AW966151 AW966496 AA336174 AA335376 AA335537 

424128 235728 1 AW966163AA335983AA336011 AA335668 AA335973 

425331 250199.1 AW962128 AA355353 AA427363 

426878 273265 1 BE069341 AW748403 AL044891 A1908240 AA393080 

432745 353673 1 AI821926 AA658826 AA564492AA635129 AI791191 

441153 51084.2 BE562826 BE378727 

448212 755099J AI475858 AW969013 

451128 859865.1 AL1 18668 D78823 AI762176 

452514 920172.1 AI904898 AI904849 AI904899 

456207 165078_-1 AA193450 
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TABLE 22B 



Table 22B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 22. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



10 



15 



20 



25 



30 



35 



40 



45 



Pkey: Unique number corresponding to an Eos probeset 

Ret Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. "Dunham I.etaL refers to the publication 

entitled The DNA sequence of human chromosome 22." Dunham 1. et al., Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which axons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref 


Strand 


NLposition 


400814 8569925 


.Minus 


72840-72924,74761-74849 


400929 7651921 


Minus 


122033-122241 , 123483-124028 


401045 8117619 


Plus 


90044-90184,91111-91345 


401451 6634068 


Minus 


119926-121272 


401519 6649315 


Plus 


157315-157950 


401645 7657839 


Minus 


34986-35133 


401714 6715702 


Pius 


96484-96681 


401666 6018106 


Plus 


73126-73623 


402327 7656695 


Minus 


108675-108770.109801-109910 


402359 9211204 


Minus 


4040341961 


402408 9796239 


Minus 


110326-110491 


402470 9797107 


Plus 


195129-195776 


402542 9801558 


Minus 


67076-67594 ■ 


402578 9884928 


Plus 


6635M6496 


402606 9909429 


Minus 


81747-82094 


403011 6693597 


Minus 


3468-3623 


403212 7630897 


Minus 


156037-156210 


403329 8516120 


Plus 


96450-96598 


403366 8783692 


Minus 


49323-49652 


403485 9966528 


Plus 


2888-3001,3198-3532,36554117 


404347 9838195 


Plus 


74493-74829 


404580 6539738 


Minus 


240588-241589 


404755 7706327 


Minus 


53729-53846 


405017 6532084 


Plus 


35551-35690 


405348 2914717 


Minus 


4331043462 


405381 6006920 


Minus 


7636-8054 


405801 2924321 


Plus 


63469-63694 


405850 6164995 


Plus 


13871-14110 


406153 9929734 


Minus - 


12902-13069 


406348 9255985 


Minus 


71754-71944 
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TABLE 23: 320 GENES DOWN-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

5 Table 23 shows 320 genes down-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 22, except that the numerator was set to the median value 
for 12 non-malignant breast specimens, the denominator was set to the median value amongst 
the 73 breast cancers, the 90 th percentile value amongst the 12 non-malignant breast 
specimens was greater than or equal 80 units, and the ratio was greater than or equal to 4.0 
10 (i.e. 4-fold down-regulated in tumor vs. normal breast). 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

1 5 UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of 50 th percentile norma! body tissue to 75* percentile tumor 



20 Pkey ExAccn UnigenelD UnigeneTiUe Ratio 

428722 U76456 Hs. 190787 tissue inhibitor of metalloprotelnase 4 22.4 

428848 NMJJ00230 Hs.194236 teptin (murine obesity homolog) 17.4 

445263 H57646 Hs.42586 KIAA1 560 protein 15.4 

25 418935 T28499 Hs.89485 carbonic anhydrase IV 15.0 

407228 M25079 Hs.155376 hemoglobin, beta 14.6 

417511 AL049176 Hs.82223 chordin-Cke 14.6 

439285 AL133916 Hs.172572 hypothetical protein FU20093 14.3 

412442 A1983730 Hs.26530 serum deprivation response (phosphatidyl 13.6 

30 410544 A1446543 Hs.95511 ESTs 12.6 

412047 AA934569 Hs.49696 ESTs 12.2 

422667 H25642 Hs.133471 ESTs 12.0 

406664 L34041 Hs.9739 grycerol-3-phosphate dehydrogenase 1 (so 12.0 

423201 NM0Q0163 Hs.125180 growth hormone receptor 11.7 

35 422163 AF027208 Hs.1 12360 prominin (mouse)-like 1 10.8 

428769 AW207175 Hs.106771 ESTs 10.6 

407049 X72632 NM_021724*:Homo sapiens nuclear receptor 10.1 

412295 AW088826 Hs.1 17176 poly(A)-binding protein, nuclear 1 9,8 

425126 N32759 Hs.172944 chorionic gonadotropin, beta polypeptide 9.8 

40 406791 A1220684 Hs.272572 hemoglobin, alpha 2 9.5 

447471 AF039843 Hs.18676 sprauty (Drosophfla) homolog 2 9.5 

451533 NM 004657 Hs.26530 serum deprivation response (phosphatidyl 9.4 

419407 AW410377 Hs.41502 hypothetical protein FU21 276 9.0 

411939 AI365585 Hs.146246 ESTs 9.0 

45 410532 T53088 Hs.155376 hemoglobin, beta 8.9 

425707 AF1 15402 Hs.1 1713 E74-like factor 5 (ets domain transcript 8.8 

416585 X54162 Hs.79386 leiomodin 1 (smooth muscle} 8.7 

443060 D78874 Hs.8944 procollagen C^endopepBdase enhancer 2 8.6 

435265 AA779958 Hs.185932 ESTs 8.5 

50 422511 AW764A2 Hs.1 17938 collagen, type XVJI, alpha 1 8.4 

433138 AB029496 Hs.59729 semaphorin sem2 8.3 

402195 NM_004497*:Homo sapiens hepatocyte nude 8.1 

429350 AI754634 Hs.131987 ESTs 8.1 

445107 AI208121 Hs.147313 ESTs.Wealuyslrnnartol38022hypo1heti 8.1 

55 406643 N77976 Hs.272572 hemoglobin, alpha 2 8.0 

410199 AW377424 Hs.205126 Homo sapiens cONA: FU22667 fis. done H 8.0 

417225 AA815048 Hs.24078 hypothetical protein FU 12649 7.8 

437569 AA760849 Hs.294052 ESTs 7.5 

436062 AK000027 Hs.98633 ESTs 7.5 

60 425078 NM.002599 Hs.154437 phosphodiesterase 2A, cGMP-stimulated 7.5 

430327 AW973636 Hs.55931 ESTs 7.4 
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447577 AI393693 Hs.183297 OKFZP556F2124 protein 7.4 

446039 A1150491 Hs.90756 ESTs 7.2 

422060 R20893 Hs.325823 ESTs, Moderately similar to ALU5_HUMAN A 7.2 

424455 AA452006 Hs.333199 ESTs 7.1 

424343 AW956360 Hs.4748 adenylate c/daso activating polypeptide 7.1 

442792 A1352340 Hs.131194 ESTs 7.0 

406714 AI219304 Hs.283103 hemoglobin, gamma G 6.9 

407571 AI446183 Hs.9572 ESTs, Highly similar to CYA5_HUMAN ADENY 6.8 

429580 AA346839 Hs.209100 DKFZP434C171 protein 6.7 

453500 A1478427 Hs.43125 esophageal cancer related gene 4 protein 6.7 

422233 AB002058 Hs.113275 purineraic receptor P2X-l3ce1, orphan r 6.7 

420205 AA256395 Hs.88156 ESTs 6.6 

404368 NA ENSP00000241075*:TRRAP PROTEIN. 6.6 

447261 NMJ006691 Hs.17917 extracellular link domain-containing 1 6.5 

417090 AA193282 Hs.85863 ESTs, Weakly similar to 834612 zinc Cng 6.5 

410677 NNL003278 Hs.65424 tetranectin (plasminogen-blnding protein 6.5 

427283 AL1 19796 Hs.174185 ectonucleotidepyrophosphatase/phosphodi 6.5 

415011 AW963G85 g b:EST375 158 MAGE resequences. MAGH Homo6.4 

412068 S72043 Hs.73133 metallothionein 3 (growth inhibitory fac 6.4 

416253 BE250659 Hs.15463 Homo sapiens, clone IMAGE:2959994, mRNA 6.4 

435B85 AA701483 Hs.36341 ESTs 6.3 

402779 NA Target Exon 6.3 

418138 AA213626 Hs.136204 EST 6.3 

439335 AA742697 Hs.62492 ESTs, Weakly similar to B390 66 proline-r 6.3 

427019 AA001732 Hs.173233 hypothetical protein FU10970 6.2 

411478 BE143068 gb:MR0-HT0158-030200403-bQ9HT0158Homo6.2 

452654 BE004783 gb:MR2-BN01 14-270400-004-e1 1 BN01 14 Homo6.1 

447359 NM_012093 Hs.18268 adenylate kinase 5 6.1 

414323 NM.014759 Hs.334688 K1AA0273 gene product 6.1 

441266 H15968 Hs.293845 Homo sapiens, done IMAGE 3502329, mRNA, 6.1 

417011 F08212 Hs.234898 ESTs, Weakly similar to 2109260A B cell 6.0 

400089 NA Eos Control 6.0 

433614 W07475 Hs.277101 cytochrome c oxidase subunit IV isoform 5.9 

440439 N92818 Hs.64754 ESTs, Weakly similar to potential COS [H 5.9 

454404 BE067414 gb: MR4-BTQ355-2001 00-201 -eO5BT0355 Homo 5.9 

436704 AA062610 Hs.148050 EST 5.9 

406563 NA Target Exon 5.9 

433490 AW451023 Hs.65848 hypothetical protein DKFZp7610132 5.9 

419313 AA843387 Hs.87279 ESTs 5.9 

409196 NfvL001874 Hs.334873 carboxypeptklase M 5.8 

410882 AW809163 gb:MR4-ST0118-261099-012-a03ST0118Homo5.8 

453469 AB014533 Hs.33010 KIAA0633 protein 5.8 

441899 AI372588 Hs.8022 TU3A protein 5.8 

426210 AA372052 Hs.334559 Homo sapiens cONA FU 14458 fis. clone HE 5.8 

413065 BE063555 gb:CM1-BT0283-O81199-033-d09BT0283 Homo 5.8 

454192 AW876813 Hs.3343 phosphoglycerate dehydrogenase 5 J 

425187 AW014486 Hs.22509 ESTs 5.7 

429757 AW452355 Hs.256037 ESTs 5.7 

420202 AL036557 Hs.95910 putative lymphocyte G0/G1 switch gene 5.7 

416284 AI695473 Hs.298006 ESTs 5.7 

428553 AA181641 Hs.184907 G protein-coupled receptor 1 5.6 

404689 NA Target Exon 5.6 

438887 R68857 Hs.265499 ESTs 5.6 

406082 S47833 Hs.82927 adenosine monophosphate deaminase 2 (iso 5.6 

449748 H23963 Hs.32043 ESTs 5.6 

431048 R50253 Hs.249129 cell death-inducing OFFA-like effector a 5.5 

452205 C15819 gb:C15819Clontech human aorta pdyAmRN 5.5 

434040 AW444613 Hs.288809 hypothetical protein RJ20159 5.5 

407744 AB020629 Hs.38095 ATP-binding cassette, sub-family A (ABC1 5.5 

450606 AI668605 Hs.60380 ESTs, Moderately similar to ALU6_HUMAN A 5.5 

414629 AA345824 Hs.76688 carboxylesterase 1 (monocyte/macrophage 5.5 

401665 C1 1000703:gi|1OO48448|ref|NP,O65258.1| g 5.5 

436107 T99079 Hs.191194 ESTs 5.5 

444432 AI161428 Hs.75916 splicing factor 3b, subunit 2, 145kD 5.5 

434715 BE005346 Hs.116410 ESTs 5.5 

447205 BE617015 Hs.11006 ESTs, Moderately similar to T17372 plasm 5.5 

408122 AI432652 Hs.42824 hypothetical protein FU1071 8 5.5 
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454016 AW016805 


Hs.233108 


ESTs 


5.5 


414913 R25621 




gb:yh45K)6.r1 Soares placenta Nb2HP Homo 


5.4 


459033 AA017590 


Hs.129907 


ESTs 


5.4 


441003 BE172240 


Hs.126379 


ESTs, Weakly similar to 138022 hypotheti 


5.4 


450637 N49826 


Hs.18602 


ESTs 


5.4 


442398 AA994520 




gb:ou42g09.s1 Scares._NFL.J_GBC_S1 Homos5.4 


403612 NA 




Target Exon 


5.3 


407102 AA007629 


Hs.9739 


gIycerol-3-phosphate dehydrogenase 1 (so 


5.3 


410057 R66634 


Hs.268107 


multimerin 


5.3 


428232 BE272452 


Hs.183109 


monoamine oxidase A 


5.3 


432769 AA620814 


Hs.144959 


ESTs 


5.3 


431344 R99530 


Hs.272572 


hemoglobin, alpha 2 


5.3 


427032 AF012023 


Hs.1 73274 


integrin cytoplasmic domain-associated p 


5.3 


406305 BE261320 


Hs.158196 


transcriptional adaptor 3 (ADA3, yeast h 


5.3 


437411 AW613948 


Hs.194915 


ESTs 


5.3 


442800 AI809481 


Hs.131227 


ESTs 


5.3 


402054 NA 




Target Exon 


5.3 


432085 AF212829 


Hs.272406 


potassium channel, subfamily K, member 9 


5.3 


415313 R59638 


Hs.6181 


ESTs 


5.2 


459159 AI904646 




gb:QV-6T065420399-103 BT065 Homo sapien 


5.2 


427164 AB037721 


Hs.173871 


KIAA1300 protein 


5.2 


441391 BE46793Q 


Hs.170381 


ESTs 


5.2 


458959 AI285901 


Hs.181297 


ESTs 


5.2 


402698 NA 




ENSP00000251335*:DJ1003J2.1 (sodium and 


5.2 


401810 NA 




Target Exon 


52 


438879 AA827674 


Hs.189073 


ESTs 


52 


414657 AA424074 


Hs.76780 


protein phosphatase 1, regulatory (inhib 


5.2 


427809 M26380 


Hs.180878 


lipoprotein lipase 


5.1 


456063 NMJJ06744 


Hs.76461 


retinol-binding protein 4, interstitial 


5.1 


451186 AW023469 


Hs.65256 


ESTs, Weakly similar to leuctne-rich gli 


5.1 


451882 AI821324 


Hs.100445 


ESTs 


5.1 


402583 NA 




NM_021620:Homo sapiens PR domain contain 


5.1 


431130 NNL006103 


Hs.2719 


HE4; WFDC2; putative ovarian carcinoma m 


5.1 


458218 AI435179 


Hs.126820 


ESTs 


5.1 


416083 R53467 


Hs.269122 


ESTs, Weakly similar to ALU1 HUMAN ALUS 


5.1 


455282 BE143867 




gb:MR0-HT01 64-0701QQ-013-h02 HT0164 Homo5.1 


426488 X03350 


Hs.4 


alcohol dehydrogenase 1B (class I), beta 


5.1 


426156 BE244537 


Hs.167382 


natriuretic peptide receptor A/guany tale 


5.1 


407891 AA486620 


Hs.41135 


endomudn-2 


5.0 


408610 AW026692 


Hs.224829 


ESTs 


5.0 


445967 D59597 


Hs.1 18821 


CGI-62 protein 


5.0 


434813 AI524307 


Hs.162870 


ESTs 


5.0 


437526 AI076012 


Hs.121388 


ESTs, Weakly similar to MDHC_HUMAN MALAT 5.0 


454775 BE160229 




gb:QV1-HT0413-090200-062-a12 HT0413 Homo 5.0 


409451 AF012626 


Hs.54472 


fragile X mental retardation 2 


5.0 


409853 AW502327 




gbUI-HF-BROp^ka-a-07-O-Ul.rt NIHJMGCJ5 


5.0 


405062 




Target Exon 


5.0 


446490 AK0Q0706 


Hs.15125 


hypothetical protein FU20699 


5.0 


417622 AW298163 


Hs.82318 


WAS protein family, member 3 


5.0 


421978 AJ243662 


Hs.1 10196 


NICE-1 protein 


5.0 


440338 R62431 


Hs.12758 


ESTs 


5.0 


415421 R35009 


Hs.24903 


ESTs 


5.0 


417574 R00348 




gb:ye69e06.rl Soares fetal liver spleen 


5.0 


409882 AJ243191 


Hs.56874 


heat shock 27kD protein family, member 7 


5.0 


447998 AI768289 


Hs.304389 


ESTs 


4.9 


445613 BE550889 


Hs.158491 


ESTs 


4.9 


443074 AW341470 


Hs.144907 


ESTs 


4.9 


451324 AI783500 


Hs.208052 


ESTs 


4.9 


432433 AW014734 


Hs.157969 


ESTs 


4.9 


449654 AI989812 


Hs.199850 


ESTs 


4.9 


414519 N94587 


Hs.55063 


ESTs 


4.9 


457531 AW973716 


Hs.13913 


KIAA1577 protein 


4.9 


433200 AA682722 


Hs.192725 


ESTs 


4.9 


430782 AF026263 


Hs^47920 


cholinergic receptor, muscarinic 5 


4.8 


427555 AW137094 


Hs.97990 


ESTs 


4.8 


433545 AA868510 


Hs.112496 


ESTs 


4.8 


420334 A1349351 


Hs.118944 


hypothetical protein FU22477 


4.8 
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421795 
427138 
413072 
443721 
408053 
427067 



426220 
414593 
426893 
434046 
401590 
457971 
427722 
443793 
407737 
441955 
441499 
447517 
403017 
450580 
404611 
414831 
459290 
444341 
408614 
449638 
434418 
447360 
419583 
440698 
451199 
438338 
433756 
423301 
417237 
439745 
424137 
449338 
434744 
407402 
443510 
415754 
415986 
457416 
418064 
437120 
453950 
401093 
436935 
457974 
428222 
442705 
437409 
458494 
410490 
416069 
438463 
444043 
438327 
457711 
400870 
441425 
416267 
407262 
444567 
403263 



X63094 

N77624 

6E063965 

AW450451 

AW139474 

M843716 

AI025499 

AI383475 

BE386764 

AA398716 

AW292618 

NA 

AW134679 

AK000123 

AA045290 

R49187 

AA972327 

AW298235 

AI382726 

N40087 

H58589 

M31158 

NMJW1546 

A1142027 

AL137698 

AW204277 

AF1 34707 

AI375984 

F00312 

AI348455 

AI290653 

NNI014861 

AW015933 

S67580 4 

H86385 

AL389981 

AA335769 

H73444 

N94835 

AF035303 

NMJJ12190 

AA169114 

Z43619 

BE142052 

BE387287 

AJ356125 

AA156998 

AW206494 

AW842353 

AL133112 

AI264634 

AL359599 

AI360906 

H03589 

R37101 

AA807958 

AI499723 

H87407 

AF147401 

AA933590 

H45384 

M12873 

AV654020 

NA 



Hs.283822 
Hs.173717 

Hs.266355 
Hs.246862 
Hs.177927 
Ks.132238 
Hs.171697 

Hs.97418 
Hs.113011 

Hs.242849 

Hs.180479 

Hs.25930 

Hs.6659 

Hs.142903 

Hs.101689 

Hs.182434 

Hs.15248 

Hs.35156 

Hs.77439 

Hs.34853 

Hs.146650 

Hs.46531 

Hs.250723 

Hs.278679 

Hs.167216 

Hs.147492 

Hs.124758 

Hs.6168 

Hs.1 12654 

Hs.1645 

Hs.81737 

Hs.149219 

Hs.16262 

Hs.394 

Hs.283828 

Hs.9520 
Hs.12247 

Hs.62654 
Hs.83384 
Hs.157767 
Hs.211568 

Hs.253560 
Hs.321717 
Hs.183085 
Hs.131127 
Hs.283850 
Hs.158436 

Hs.20982 
Hs.314232 
Hs.135089 
Hs.172944 
Hs.23917 

Hs.28937 



Hs.184261 



Rhesus blood group, D antigen 4.8 
phosphatide acid phosphatase type 2B 4.8 
gb:QV3-ST0296-140200-085-h01 BT0295 Homo 4.8 
ESTs 4.8 
ESTs 4.8 
ESTs 4.7 
ESTs 4.7 
ESTs, Weakly similar to T1 3924 sdk prote 4.7 
gb:601273249F1 NIH.MGC.20 Homo sapiens c 4.7 
ESTs 4.7 
ESTs 4.7 
Target Exon 4.7 
ESTs 4.7 
hypothetical protein FU201 16 4.6 
ESTs, Weakly similar to 21 09260A B cell 4.6 
ESTs 4.6 
ESTs 4.6 
ESTs 4.6 
ESTs 4.6 
Target Exon 4.6 
ESTs 4.6 
Homo sapiens cDNA FU11027 fis, clone PL 4.6 
protein kinase, cAMP-dependenl regulato 4.6 
inhibitor of DNA binding 4, dominant neg 4.6 
ESTs 4.6 
Homo sapiens mRNA; cDNA DKFZp434C1915 (f 4.6 
hypothetical protein MGC2747 4.6 
a disintegrin and metalloproteinase doma 4.6 
ESTs 4.6 
gb:HSBB0D101 STRATAGENE Human skeletal m4.6 
Homo sapiens cDNA FU1 1777 fis, done HE 4.6 
ESTs 4.6 
KIAA0703 gene product 4.6 
Homo sapiens, clone MGC3764, mRNA, comp 4.5 
cytochrome P450, subfamily IVA, polypept 4.5 
patmitoyf-protein thioesterase 2 4.5 
Homo sapiens mRNA full length insert cON 4.5 
ESTs 4.5 
adrenomedutlin 4.5 
Homo sapiens genomic DNA, chromosome 21q 4.5 
gb:Homo sapiens clone 23943 mRNA sequenc 4.5 
formyltetrahydrofolate dehydrogenase 4.5 
hypothetical protein FU 11413 4.5 
gb:HSC1GE121 normalized Infant brain cDN 4.5 
kringle-containbg transmembrane protein 4.5 
S100 calcium-binding protein, beta (neur 4.4 
ESTs, Weakly similar to HXA2JHUMAN HOMED 4.4 
eukaryotic translation initiation factor 4.4 
C12000586*:gil6330167|dbjlBAA86477.1j (A 4.4 
ESTs 4.4 
ESTs, Weakly similar to S22765 heterogen 4.4 
Homo sapiens mRNA; cDNA DKFZp434K098 (fr 4.4 
ESTs 4.4 
Homo sapiens mRNA; cONA DKFZp547C126 (fr 4.4 

4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 



ESTs 

gb:yj42d08.r1 Soares placenta Nb2HP Homo 

ESTs 

ESTs 

ESTs 

chorionic gonadotropin, beta polypeptide 
ESTs 

C11000905:gil11692565lgblAAG39879.1lAF28 
homeobox protein from AL590526 
gb:yn99c10.r1 Soares adult brain N2b5HB5 
gb:Human Ig rearranged H-chain mRNA VDJ4 
ESTs, Weakly similar to T26686 hypotheti 
Target Exon 
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410034 BE067414 
456804 AI421645 
448427 BE395250 
416931 D45371 
421296 NNL002666 
400973 NA 
452602 AW366194 
412330 NMJ05100 
405016 

435104 AI475671 
406118 

418556 T02850 
429745 AA480818 
433088 AW451206 
444445 AA342329 
453880 AI803166 
447384 A1377221 
414541 BE293116 
444975 AV652165 
403921 NA 
451477 AJ798425 
406344 

416970 AA191201 
413662 BE155866 
458504 AW070634 
404682 NA 
418089 N69913 
403433 

446532 AW975460 
414217 A1309298 
418425 A1871247 
419589 AW973708 
457029 AA397789 
447860 AF193807 
448988 Y09763 
440610 AI733098 
439590 AF086410 
427240 AA399975 
408932 AW594172 
436112 T77545 
444382 A1144152 
456716 AA318060 
419846 NM_015977 
410036 R57171 
400545 NA 
403051 NA 
420139 NM_005357 
450244 AA007534 
453261 AA034116 
440246 W52010 
414516 AI307802 
438232 AI150595 
410233 AA082947 
412179 BE270758 
441871 AI306150 
426411 AK000708 
453692 AL1 10416 
448640 AW817177 
417481 AA203281 
412912 AW118878 
454183 AW807116 
426328 AW631296 
435942 R06285 
417629 T76945 
403593 NA 
402690 

418190 R49591 



gb:MR4-BTt)355-200100-201-e05 BT0355 Homo4.3 
Hs.139851 caveofin2 4.3 
Hs.309438 EST 4.3 
Hs.80485 adipose most abundant gene transcript 1 43 
Hs.103253 perilipin 4.3 
ENSP00000236667*:Mucin 5B (Fragment). 4.3 
Hs.55962 ESTs 4.3 
Hs.788 A kinase (PRKA) anchor protein (gravin) 4.3 
CY000171*:gi|9280405|gblAAF8S402.1|AF245 4.3 
Hs.88607 ESTs, Highly similar to F-box protein FB 4.3 
ENSP00000246632:CONA FU20261 fe ( done 4.3 
gb:FB1 2A9 Fetal brain. Stratagene Homo s 4.3 
Hs.221736 ESTs " 4.3 

Hs.1 15899 ESTs 4.3 
Hs.1 15920 Homo sapiens cDNA: FU22816 fis, clone K 4.3 
Hs.28462 ESTs, Weakly similar to 138022 hypothec 4.3 
Hs.40528 ESTs " 42 

Hs.76392 aldehyde dehydrogenase 1 family, member 4.2 
Hs.182482 ESTs, Weakly similar to T00362 hypothec 4.2 
C5000212^i|10047237[db]lBAB13407.1 j (A U 
Hs.42710 ESTs 4.2 

C5001660:gi|11611537|dbj|BAB18935.1 j (AB 4.2 

Hs.35861 DKFZP586E1621 protein A2 
Hs25522 KIAA1808 protein 4.2 

Hs.144794 ESTs 4.2 

C9001 188*:gi|12738842|ref|NPJ)73725.1| p 42 

Hs.6858 ESTs, Weakly similar to I78885 serine/th 4.2 

NM_001622:Homo sapiens alpha-2-HS-glycop 4.2 

Hs.143563 ESTs 4.2 

Hs.279898 Homo sapiens cDNA: FU23165 fis, done t 4.2 

Hs.6262 hypothetical protein MGC8407 4.2 

Hs.201925 Homo sapiens cONA FU13446 fis, done PL 42 

Hs.161803 ESTs 42 

Hs.131 835 Rhesus blood group, B glycoprotein 4.2 

Hs.22785 gamma-aminobutyric acid (GABA) A recepto 4.2 

Hs.130800 ESTs 4.2 

gb:Homo sapiens full length insert cDNA 4.2 

Hs.274151 ligatin 4.2 

Hs.278513 TP53TG3 protein 4.2 

Hs.1 87559 ESTs 4.2 

Hs.58246 ESTs 4.2 

Hs.135121 hypothetical protein FU22415 4.2 

Hs.285681 Williams-Beuren syndrome chromosome regi 42 

Hs.57975 calsequestrin 2 (cardiac muscle) 4.1 

Target Exon 4.1 

Target Exon 4.1 

Hs.95351 lipase, hormone-sensitive 4.1 

Hs.125062 ESTa 4.1 

Hs.1 18494 ESTs 4.1 

Hs.191379 ESTs 4.1 

Hs.135560 ESTs. Weakly similar to T43458 hypotheti 4.1 

Hs.122226 ESTs 4.1 

gb:zn1 0g07.s1 Stratagene hNT neuron (937 4.1 

Hs.69428 hypothetical protein MGC3020 4.1 

Hs. 153450 ESTs, WeaWy similar to 1909123A Na gfuc 4,1 

Hs.169764 hypothetical protein FU20701 4.1 

gb:DKFZp434KQ431_r1 434 (synonym: htes3) 4.1 

Hs.102558 Homo sapiens, done MGC5352, mRNA, comp 4.1 

Hs.21798 ESTs 41 

Hs.1 10835 ESTs 4.1 
gb:MR4^T0062-0401(XM)24-b12 ST0062 Homo 4.1 

gb:hh83c09.y1 NCLCGAP.GU1 Homo sapiens 4.1 

Hs.191215 ESTs 4.1 

gb:yc92c07.rl Soares Infant brain 1N1BH 4.1 

Target Exon 4.0 

Target Exon 40 

Hs.270425 ESTs 4^ 
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408641 AW245207 Hs.5555 hypothetical protein MGC5347 4.0 

427899 AA829286 Hs.332053 senim amyloid A1 4.0 

445975 AI811536 Hs.145734 ESTs 4,0 

438831 BE263273 Hs.6439 synapsinH 4.0 

455578 BE005350 Hs.14355 Homo sapiens cDNA RJ 13207 fis. clone NT 4.0 

401840 NA Target Exon 4.0 

413753 U17760 Hs.75517 lamlnln, beta 3 (nicein (125kD), kafinln 4.0 

445030 A1205925 Hs.147238 ESTs, Highly similar to AAC3_HUMAN ALPHA 4.0 

433873 AW156913 Hs.150473 ESTs, Weakly simffar to A Chain A. C/yst 4.0 

456736 AW248217 Hs.1619 achaete-scute complex (Drosophila) homol 4.0 

450112 BE047734 Hs.5473 ESTs, Moderately similar to ALU5_HUMAN A 4.0 

448906 AI589567 Hs.309719 ESTs 4.0 
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Table 23A shows the accession numbers for those pkeys lacking unigenelD's for Table 23. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number: Gene duster number 
Accession: Genbank accession numbers 



15 

Pkey CAT number Accessions 

409853 1156226J AW5Q2327 AW502488 AW501829 AW502625 AW502687 
20 410034 1170594J BE067414 BE06795B BE067419 BEQ67953 AW577127 AW6G1412 
410233 118656 1 AA082947 AA083036 
410490 1205347 J H03589 AW750687 AW750688 
410882 1225686J AW809163 AW809247 AW809177 AW809190 AW809225 

411478 1247073 1 BE143068AW849143AW848705AW848569AW848071 AW848475AW848092AW848005 
25 413065 1347960J BE063555 BE151321 BE151319 BE151657 BE151655 BE063556 BE151322 

413072 1348163 1 BE063965 BE063968 BE064034 BE064028 BE063874 BE063966 BE063869 BE064043 BE064033 BE063884 

414593 1464909 1 BE386764 BE387560 

414913 1506721 1 R25621 C03959 C04010 

415011 151328 1 AW963085AA159005AW963073 
30 415986 1564410 1 Z43619 R6 1274 H 12206 R 12883 

416267 1583547 1 H45384 H49125H41699 

417574 1687770 1 R00348 R09593 

417629 1690392.1 T76945 R20210R05755 

418556 1767866.-1 T02850 
3 5 419583 186198.1 F00312 AA247490 F31427 AA383663 F22045 

426328 264901.1 AW631296 AA375484 

439590 47413 1 AF066410 W94386 W74609 

442398 541271.1 AA994520 AW393574 

452205 90415.1 C15819 AA024741 AA024742 
40 452654 925931J BE004783 BE004947 AI911790 

453692 977825 1 AL110416AW876759 

454183 1049636 1 AW8071 16 AW807569 AW807415 AW807338 AW807288 AW807263 AW807316 AW177402 AW807413 AW807068 BE141561 
BE141569 AW807401 AW807310 BE141565 AW807318 AW8071 19 AW807299 AW807241 AW807225 AW807204 AW807345 
AW807103 BE141615 AW807431 AW807393 AW807337 AW807406 AW807259 AW807375 AW845890 AW807220 AW807399 
45 AW807064 AW807376 AW807024 BE141595 AW807236 AW807027 AW8071 12 AW807378 AW807202 BE141593 AW807216 

AW807138 AW807244 AW807221 AW807297 AW807050 AW807248 AW807404 AW807075 AW807237 AW807212 AW807308 
AW8071 10 AW807104 BE140912 AW807301 AW807382 AW807294 AW807026 AW807020 AW807108 AW807025 AW807433 
AW807124 AW807419 AW807031 AW807264 AW807032 AW807029 AW807052 AW807391 AW807207 AW807215 AW807019 
AW807238 AW807201 BE141590 AW807302 AW807323 AW807380 AW807109 BE141588 AW845877 AW807418 AW807407 
50 AW807309 BE 141614 AW845861 AW807396 AW807300 AW807348 AW807311 AW807214 AW807132AW807402 AW807350 

AW807028 AW807298 AW807291 AW807305 AWB07217 AW807312 AW807261 AW807268 AW807260 AW807265 AW807198 
AW807346 AW807315 BE140940 AW807351 AW807122 AW807235 AW807076 AW807314 AW807213 AW807143 AW807131 
AW800079 AW807066 AW807077 AW845923 AW807421 BE141619 BE140943 AW807420 AW807381 BE141621 AW845921 
BE141629 BE141625 BE141624 BE141636 BE141630 AW807405 AW807290 AW807353 
55 454404 1170594J BE067414 BE067958 BE067419 BE067963 AW577127AW601412 

454775 1234106 1 BE160229 AW819879AW820179AW819882AW819876 AW820169BE153201 AW993736BE152911 

455282 1273020J BE143867AW935060AW886684 

459159 919998 J AI904646 BE179494 BE179421 
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TABLE 23B 

Table 23B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 23. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos prabeset ■ 

Ret Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. 'Dunham I et al." refers to the publication 

entitled 'The ONA sequence of human chromosome 22." Dunham I. etal., Nature (1999) 402:469-495. 
Strand: Indicates DMA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand Nt_posfo'on 


400545 


9800107 


Minus 


124618-124881 


400870 


9838306 


Minus 


34081-35027 


400973 


7960452 


Minus 


98119-98253 


401093 


8516137 


Minus 


22335-23166 


401590 


9966320 


Minus 


33547-33649 


401665 


7145001 


Plus 


121591-122537 


401810 


7342191 


Plus 


129063-129476 


401840 


7684597 


Plus 


56283-56439 


402054 


8083691 


Minus 


6288-8806 


402195 


7689778 


Minus 


147901-148884 


402583 


7684486 


Plus 


94883-95003 


402690 


8348058 


Plus 


13368-13998 


402698 


8570304 


Minus 


108641-108903 


402779 


9588555 


Minus 


38173-39210 


403017 


6693623 


Plus 


78630-79367 


403051 


4827080 


Minus 


5269-5411 


403263 


7770677 


Plus 


52431-52737 


403433 


9719611 


Minus 


72225-72437 


403593 


6862650 


Minus 


62554-62712.69449-69602 


403612 


8469060 


Minus 


94723-94659 


403921 


7711590 


Minus 


32973536 


404368 


7630956 


Minus 


102053-102199 


404682 


9797231 


Minus 


4097741150 


404689 


7534100 


Plus 


119461-119717 


405016 


6524300 


Plus 


51997-53308 


405062 


7657730 


Plus 


101283-101432 


406118 


9143818 


Plus 


53997-54629 


406344 


9255974 


Plus 


20254-20374,20526-20659,20835-21097 


406563 


7711604 


Plus 


34401-34538 
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TABLE 24: 

Table 24 depicts Seq ID No., UnigenelD, UnigeneTitle, Pkey, Pred.Cell.Loc., and ExAccn 
5 for all of the sequences in Table 25. The information in Table 24 is linked by Seq ID No to 
Table 25, 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number. Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

Pred.Cefl.Loa: Predicted Cellular Localization 

Seq.ID.No.: Sequence Identification Number found in Table 25 

Pkey ExAccn UnigenelD UnigeneTitle 



Pred.CeB.Loc. Seq. ID. No. 



449746 
407276 
415539 
400297 
450375 
102457 
429170 
424399 
422505 
449765 
425692 
426215 
439840 
410102 
429220 
416276 
409079 
442818 
442082 
444381 
446163 
416636 
442117 
433043 
429353 
452190 
446733 
452747 
423242 
417433 
432201 
423961 
439569 
114480 
404561 
325372 
112287 
335824 
424735 
400289 
427585 
429925 
429441 
421155 
420931 
420813 
452744 



Hs.176588 
Hs.326736 
Hs.72472 
Hs.334473 



AI668594 
AI951118 
AI733881 
AI127076 
AA009647 Hs.8850 
NM_001394Hs.2359 
NM_001394Hs.2359 
AJ905687 Hs.2533 
AL120862 Hs.124165 
N92293 Hs.206832 
090041 Hs.155956 
AW963419 Hs.155223 
AW449211 Hs.105445 
AW248508 Hs^79727 
AW207206 Hs.136319 
U41060 Hs.79136 
W87707 Hs.82065 
AKO01741 Hs.8739 
R41823 Hs.7413 
BE387335 Hs.283713 
AA026880 Hs.25252 
N32536 Hs.42645 
AW664964 Hs.128899 
W57554 Hs.125019 
AL1 17406 Hs.200102 
H26735 Hs.91668 
AA863360 Hs.26040 
BE153855 Hs.61460 
AL039402 Hs.125783 
BE270266 Hs.82128 
AI538613 Hs.298241 
D13666 Hs.136348 
AW602166 Hs.222399 
BE066778 Hs.151678 



NA 

AB033064 Hs.334806 
NA 

U31875 Ks.272499 

X07820 Hs.2258 

031152 Hs.179729 
NMJX)0786Hs.226213 

AJ224172 Hs.204096 

H87879 Hs.102267 

AF044197 Hs.100431 

X51501 Hs.99949 

A1267652 Hs.30504 



ESTs, Weakly similar to CP4 Y_HUMAN CYTOC 
Homo sapiens breast cancer antigen NY-8R 
BMP-R1B 

hypothetical protein DKF2p56401278 
a disintegrin and metaBoproteinase doma 



nuclear 

~r — , r— r ■ nuclear 

aldehyde dehydrogenase 9 family, member cytoplasm 
ESTs 

ESTs, Moderately similar to ALU8 J1UMAN A 

N-acetyttransferase 1 {arylamine N-acety 

stanniocalcin 2 

GDNF family receptor alpha 1 

Homo sapiens cDNA FU14035 fis, clone HE 

ESTs 

L1V-1 protein, estrogen regulated 
interteukin 6 signal transducer (gp130, 
hypothetical protein FU10879 
ESTs 

ESTs, Weakly similar to S64054 hypoM 
Homo sapiens cDNA RJ 13603 fis, done PL 
solute carrier family 16 (monocarboxylic 
ESTs 

lymphoid nudear protein (LAF-4) mRNA 
ATP-binding cassette transporter MRP8 
Homo sapiens clone PP1498 unknown mRNA 
ESTs, Weakry similar to fatty add omega 
Ig superfamily receptor LNIR 
DEME^6 protein 

5T4 oncofetal trophoblast glycoprotein 
Transmembrane protease, serine 3 
osteoblast specific factor 2 (fascidin 
CEGP1 protein 

UDP^-acetyUlpha-D-galactosamine.'polyp 
NMJJ141 1 2*:Homo sapiens trichorhinophala mitochodria 
Phase2&3Exons nudear 
KIAA1238 protein 

ENSP00000249072*:DJ222E13.1 (N-TERMINAL 
short-chain alcohol dehydrogenase family 
matrix metaltoprateinase 10 (stromelysln 
collagen, type X, alpha 1 (SchmkJ metaph 
cytochrome P450, 51 (lanosterol 14^a|pha ER 
BpophlBn B (uteroglobin family member) 
lysyl oxidase extracellular 
small indudble cytokine B subfamily (Cy 
proiactin-induced protein nuclear 
Homo sapiens mRNA; cDNA DKFZp434E082 (fr 
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SeqID 1S2 
SeqID3&4 
Seq/05&6 
Seql07&8 
SeqlD9&10 
Seq ID 11 & 12 
SeqlD11&12 
SeqID 13&14 
Seq ID 15&16 
Seq ID 17 & 18 
SeqID 19&20 
Seq ID 21 & 22 
Seq ID23&24 
Seq 1D25&26 
SeqlD27&28 
Seq ID29&30 
Seq ID 31 & 32 
SeqlD33&34 
SeqlD35&36 
Seq ID 37 & 38 
Seq ID39&40 
Seq ID 4U 42 
Seq ID 43 & 44 
Seq ID 45 & 46 
SeqlD47&48 
SeqlD49&50 
SeqlD51&52 
SeqlD53&54 
Seq ID 55456 
Seq ID 57 & 58 
SeqlD59&60 
SeqlD61&62 
Seq ID 63 & 64 
Seq ID 65 & 66 
SeqlD67&68 
Seq ID 69 & 70 
Seq ID 71 & 72 
Seq ID 73 874 
Seq ID 75476 
SeqlD77&78 
SeqlD79&80 
Seq ID 81 & 82 
Seq ID 83 & 84 
Seq ID 85 & 86 
Seq ID 87888 
Seq ID 89 A 90 
Seq ID 91 & 92 



WO 02/059377 



PCT/US02/02242 



420757 
424905 



10 



15 



20 



446921 
445537 
428227 
424001 
421727 
452838 
419667 
414812 
426320 
422867 
411558 
417866 
428398 
431958 
428722 
412970 
421379 
415752 
444051 
451110 



X78592 Hs.99915 
NMJX)2497Hs.153704 
NH.007050HS.225952 
AB012113 Ha.16530 
AJ245671 Hs.12844 
AA321649 Hs.2248 
W67883 Hs.137476 
Y13153 Hs.107318 
U65011 Hs.30743 
AU077005 Hs.92208 
X72755 Hs77367 
W47595 Hs.169300 
L32137 Hs.1584 
AA102670 Hs70725 
AW067903 Hs.82772 
AI249368 Hs.98558 
X63629 Hs.2877 
U76456 Hs.190787 
AB026436 Hs.177534 
Y15221 Hs.103982 
BE314524 Hs78776 
N48373 Hs.10247 
AI955040 Hs.265398 



androgen reoeptor (dihydrotestosterone r 
NIMA (never In mitosis gene a)-related k 
protein tyrosine phosphatase, receptor ( 
small inducible cytokine subfamily A (Cy 
EGF-like-domaln, multiple 6 
smaQ inducible cytokine subfamily B (Cy 
paternally expressed 10 
kynurenine 3-monooxygenase (kynurenine 3 
preferentially expressed antigen in mefca 
a disintegrin and metaDoproteinase doma 
monokine induced by gamma interferon 
transforming growth factor, beta 2 
cartilage oligomeric matrix protein (pse 
garnma-aminobutyric acid (GABA) A recepto 
collagen, type XI, alpha 1 
ESTs 

cadherin 3, type 1, P-cadherin (placenta 
tissue inhibitor of metalloprotelnase 4 
dual specificity phosphatase 10 
small Inducible cytokine subfamily B (Cy 
putative transmembrane protein 
activated leucocyte cell adhesion molecu 
ESTs, Weakly similar to transformation-r 



cytoplasm Seq ID 93 & 94 

nuclear Seq ID 95 & 96 

Seq ID 97 & 98 

extracellular Seq ID 99 & 100 
Seq ID 101 & 102 

extracellular Seq ID 103 & 104 . 

Seq ID 105 & 106 
Seq ID 107 & 108 

nuclear Seq ID 109 & 110 

Seq ID 1118. 112 

extracellular Seq ID 113 & 114 

extracellular Seq ID 115 & 116 

extracellular Seq ID 117 & 118 
Seq ID 119 & 120 
Seq ID 121 & 122 
Seq ID 123 & 124 

plasma membraneSeq ID 125 & 126 
Seq ID 127 & 128 
Seq ID 129 & 130 

extracellular Seq ID 131 & 132 
Seq ID 133 & 134 
Seq ID 135 & 136 
Seq ID 137 & 138 
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TABLE 24A 



5 

10 
15 



Table 24A shows the accession numbers for those pkeys lacking unigenelD's for Table 24. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Ptey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

335824 CH22J197FGJ19J1JJNICE 
325372 c12Jis 
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TABLE 24B 

Table 24B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 24. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

1 0 Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham J. et al." refers to the publication 

entitled 'The DNA sequence of human chromosome 22.' Dunham I. et al., Nature (1 999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 

15 

Pkey Ref Strand NLposition 
404561 9795980 Minus 69039-70100 

20 
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Table 25 

The 69 gene sequences identified to be overexpresscd in breast cancer may be used to identify coding regions from die public DNA databases (nr and htgs in 
Genbanlc). The sequences may be used to either identify genes that encode known proteins, or they may be used to predict the coding regions from genomic 
DNA using exon prediction algorithms, such as FGENESH (Salamov and Solovyev, 2000, Genome Res. 10:516-522) 

Seq ID NO: 1 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted ORF 

Coding sequence: 1-1518 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
! I I I I I 

ATGGAGCCCT CCTGGCTTCA GCAACTCATG GCTCACCCCT TCTTGCTGCT GATCCTCCTC 60 
TGCATGTCTC TGCTGCTGTT TCAGGTAATC AGGTTGTACC AGAGGAGGAG ATGG ATGATC 1 20 
AGAGCCCTGC ACCTGTTTCC TGCACCCCCT GCCCACTGGT TCTATGGCCA CAAGGAGTTT 1 80 
TACCCAGTAA AGGAGTTTGA GGTGTATCAT AAGCTGATGG AAAAATACCC ATGTGCTGTT 240 
CCCTTGTGGG TTGGACCCTT TACGATGTTC TTCAGTGTCC ATGACCCAGA CTATGCCAAG 300 
ATTCTCCTGA AAAGACAAGA TCCCAAAAGT GCTGTTAGCC ACAAAATCCT TGAATCCTGG 360 
GTTGGTCGAG GACTTGTGAC CCTGGATGGT TCTAAATGGA AAAAGCACCO CCAG ATTGTG 420 
AAACCTGGCT TCAACATCAG CATTCTG AAA ATATTCATCA CCATGATGTC TGAGAGTGTT 480 
CGGATG ATGC TGAACAAATG GGAGGAACGC ATTGCCCAAA ACTCACGTCT GGAGCTCTTT 540 
CAACATGTCT CCCTGATGAC CCTGGACAGC ATCATG AAGT GTGCCTTCAG CCACCAGGGC 600 
AGCATCCAGT TGGACAGTAC CCTGGACTCA TACCTGAAAG CAGTGTTCAA CCTTAGCAAA 660 
ATCTCCAACC AGCGCATGAA CAATTTTCTA CATCACAACG ACCTGGTTTT CAAATTCAGC 720 
TCTCAAGGCC AAATCTTTTC TAAATTTAAC CAAGAACTTC ATCAGTTCAC AGAGAAAGTA 780 
ATCCAGGACC GGAAGGAGTC TCTTAAGGAT AAGCTAAAAC AAG ATACTAC TCAGAAAAGG 840 
CGCTGGGATT TTCTGGACAT ACTTTTGAGT GCCAAAAGCG AAAACACCAA AGATTTCTCT 900 
GAAGCAGATC TCCAGGCTGA AGTGAAAACG TTCATGTTTG CAGGACATGA CACCACATCC 960 
AGTGCTATCT CCTGGATCCT TTACTGCTTG GCAAAGTACC CTGAGCATCA GCAGAG ATGC 1020 
CGAGATGAAA TCAGGGAACT CCTAGGGGAT GGGTCTTCTA TTACCTGGGA ACACCTGAGC 1080 
CAG ATGCCTT ACACCACGAT GTGCATCAAG GAATGCCTCC GCCTCTACGC ACCGGTAGTA 1 140 
AACATATCCC GGTTACTCGA CAAACCCATC ACCTTTCCAG ATGGACGCTC CTTACCTGCA 1200 
GGAATAACTG TGTTTATCAA TATTTGGGCT CTTCACCACA ACCCCTATTT CTGGGAAGAC 1260 
CCTCAGGTCT TTAACCCCTT GAGATTCTCC AGGGAA AATT CTG AAAAAAT ACATCCCTAT 1320 
GCCTTCATAC CATTCTCAGC TGGATTAAGG AACTGCATTG GGCAGCATTT TGCCATAATT 1380 
GAGTGTAAAG TGGCAGTGGC ATTAACTCTG CTCCGCTTCA AGCTGGCTCC AGACCACTCA 1440 
AGGCCTCCCC AGCCTGTTCG TCAAGTTGTC CTCAAGTCCA AGAATGG AAT CCATGTGTTT 1500 
GCAAAAAAAG TTTGCTAATT TTAAGTCCTT TCGTATAAGA ATTAATG AGA CAATTTTCCT 1560 
ACCAAAGGAA GAACAAAAGG ATAAATATAA TACAAAATAT ATGTATATGG TTGTTTGACA 1620 
AATTATATAA CTTAGG ATAC TTCTGACTGG TTTTGACATC CATTAACAGT AATTTTAATT 1 680 
TCTTTGCTGT ATCTGGTG AA ACCCACAAAA ACACCTG AAA AAACTCAAGC TGACTTCCAC 1 740 
TGCGAAGGGA AATTATTGGT TTGTGTAACT AGTGGTAGAG TGGCTTTCAA GCATAGTTTG 1800 
ATCAAAACTC CACTCAGTAT CTGCATTACT TTTATCTCTG CAAATATCTG CATG ATAGCT 1 860 
TTATTCTCAG TTATCTTTCC CCAATAATAA AAAA 



Seq ID NO: 2 Protein sequence: 

Protein Accession #: FGENESH predicted 

1 II 21 31 41 51 
I I I I I I 

MEPSWLQELM AHPFLLLILL CMSLLLFQVI RLYQRRRWMI RALHLFPAPP AHWFYGHKEF 60 
YPVKEFEVYH KLMEKYPCAV PLWVGPFTMF FSVHDPDYAK ILLKRQDPKS AVSHKILESW 120 
VGRGLVTLDG SKWKKHRQIV KPGFNISUiC IFITMMSESV RMMLNKWEER IAQNSRLELF 1 80 
QHVSLMTLDS IMKCAFSHQG SIQLDSTLDS YLKAVFNLSK ISNQRMNNFL HHNDLVFKFS 240 
SQGQIFSKFN QELHQFTEKV IQDRKESLKD KLKQDTTQKR RWDFLDILLS AKSENTKDFS 300 
EADLQAEVKT FMFAGHDTTS SA1SWILYCL AKYPEHQQRC RDEIRELLGD GSSJTWEHLS 360 
QMPYTTMCIK ECLRLYAPW NISRLLDKPI TFPDGRSLPA G1TVFINIWA LHHNPYFWED 420 
PQVFNPLRFS RENSEKIHPY AFIPFSAGLR NCIGQHFAII ECKVAVALTL LRFKLAPDHS 480 
RPPQPVRQVV LKSKNG1HVF AKKVC 

Seq ID NO: 3 DNA sequence 

Nucleic Acid Accession #: NMJ)52997 

Coding sequence: 1 00-4 125 (underlined sequences correspond to start and stop codons) 



I 11 21 31 41 51 
I I I I I I 

CTAGTCTATA CCAGCAACGA CTCCTACATC GTCCACTCTG GGGATCTTAG AAAGATCCAT 60 
AAAGCTGCCT CCCGGGGACA AGTCCGGAAG CTGGAGAAG A TG ACAAAGAG GAAGAAGACC 120 
ATCAACCTTA ATATACAAGA CGCCCAG AAG AGGACTGCTC TACACTGGGC CTGTGTCAAT 1 80 
GGCCATGAGG AAGTAGTAAC ATTTCTGGTA GACAGAAAGT GCCAGCTTGA CGTCCTTGAT 240 
GGCG AACACA GGACACCTCT GATGAAGGCT CTACAATGCC ATCAGGAGGC TTGTGCAAAT 300 
ATTCTGATAG ATTCTGGTGC CGATATAAAT CTCGTAGATG TGTATGGCAA CATGGCTCTC 360 
CATTATGCTG TTTATAGTG A GATTTTGTCA GTGGTGGCAA AACTGCTGTC CCATGGTGCA 420 
GTCATCG AAG TGCACAACA A GGCTAGCCTC ACACCACTTT TACTATCCAT AACG AAAAGA 480 
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AGTGAGCAAA TTGTOGAATT TTTGCTGATA AAAAATGCAA ATGCGAATGC AGTTAATAAG 540 
TATAAATGCA CAGCCCTCAT GCTTGCTGTA TGTCATGG AT CATCAGAGAT AGTTGGCATG 600 
CTTCTTCAGC AAAATGTTGA CGTCTTTGCT GCAG ATATAT GTGGAGTAAC TGCAGAACAT 660 
TATGCTGTTA CTTGTGGATT TCATCACATT CATGAACAAA TTATGGAATA TATACGAAAA 720 
TTATCTAAAA ATCATCAAAA TACCAATCCA GAAGG AACAT CTGCAGG AAC ACCTGATOAG 780 
GCTGCACCCT TGGCGGAAAG AACACCTGAC ACAGCTGAAA GCTTGGTGGA AAAAACACCT 840 
GATGAGGCTG CACCCTTGGT GGAAAGAACA CCTGACACGG CTGAAAGCTT GGTGGAAAAA 900 
ACACCTGATG AGGCTGCATC CTTGGTGGAG GGAACATCTG ACAAAATTCA ATGTTTGGAG 960 
AAAGCGACAT CTGGAAAGTT CGAACAGTCA GCAGAAGAAA CACCTAGGGA AATTACGAGT 1020 
CCTGCAAAAG AAACATCTGA GAAATTTACG TGGCCAGCAA AAGGAAGACC TAGGAAGATC J 080 
GCATGGGAGA AAAAAGAAGA CACACCTAGG GAAATTATGA GTCCCGCAAA AGAAACATCT 1 140 
GAGAAATTTA CGTGGGCAGC AAAAGGAAGA CCTAGGAAGA TCGCATGGGA GAAAAAAGAA 1200 
ACACCTGTAA AGACTGGATG CGTGGCAAGA GTAACATCTA ATAAAACTAA AGTTTTGGAA 1260 
AAAGGAAGAT CTAAG ATGAT TGCATGTCCT ACAAAAGAAT CATCTACAAA AGCAAGTGCC 1 320 
AATGATCAGA GGTTCCCATC AGAATCCAAA CAAGAGGAAG ATGAAGAATA TTCTTGTGAT 1380 
TCTCGGAGTC TCTTTG AGAG TTCTGCAAAG ATTCAAGTGT GTATACCTGA GTCTATATAT 1440 
CAAAAAGTAA TGGAGATAAA TAGAGAAGTA GAAGAGCCTC CTAAGAAGCC ATCTGCCTTC 1500 
AAGCCTGCCA TTGAAATGCA AAACTCTGTT CCAAATAAAG CCTTTGAATT GAAG AATGAA 1 560 
CAAACATTGA GAGCAGATCC GATGTTCCCA CCAGAATCCA AACAAAAGGA CTATGAAGAA 1620 
AATTCTTGGG ATTCTGAGAG TCTCTGTGAG ACTGTTTCAC AGAAGGATGT GTGTTTACCC 1680 
AAGGCTACAC ATCAAAAAGA AATAG ATAAA ATAAATGGAA AATTAGAAGA GTCTCCTAAT 1740 
AAAGATGGTC TTCTGAAGGC TACCTGCGG A ATGAAAGTTT CTATTCCAAC TAAAGCCTTA 1 800 
GAATTGAAGG ACATGCAAAC TTTCA AAGCG GAGCCTCCGG GGAAGCCATC TGCCTTCGAG 1 860 
CCTGCCACTG AAATGCAAAA GTCTGTCCCA AATAAAGCCT TGGAATTGAA AAATGAACAA 1920 
ACATGGAGAG CAGATGAGAT ACTCCCATCA GAATCCAAAC AAAAGGACTA TGAAGAAAAT 1980 
TCTTGGGATA CTGAG AGTCT CTGTGAGACT GTTTCACAGA AGGATGTGTG TTTACCCAAG 2040 
GCTGCGCATC AAAA AGAAAT AGATA AAATA AATGGAAAAT TAGAAGGGTC TCCTGTTAA A 2 1 00 
GATGGTCTTC TGAAGGCTAA CTGCGGAATG AAAGTTTCTA TTCCAACTAA AGCCTTAGAA 2160 
TTGATGGACA TGCAAACTTT CAAAGCAGAG CCTCCCGAGA AGCCATCTGC CTTCGAGCCT 2220 
GCCATTGAAA TGCAAAAGTC TGTTCCAAAT AAAGCCTTGG AATTGAAGAA TGAACAAACA 2280 
TTG AGAGCAG ATGAGATACT CCCATCAGAA TCCAAACAAA AGGACTATGA AGAAAGTTCT 2340 
TGGGATTCTG AGAGTCTCTG TGAGACTGTT TCACAGAAGG ATGTGTGTTT ACCCAAGGCT 2400 
ACACATCAAA AAGAAATAG A TAAAATAAAT GGAAAATTAG AAGAOTCTCC TGATAATGAT 2460 
GGTTTTCTGA AGGCTCCCTG CAGAATGAAA GTTTCTATTC CAACTAAAGC CTTAGAATTG 2520 
ATGGACATGC AAACTTTCAA AGCAGAGCCT CCCGAGAAGC CATCTGCCTT CGAGCCTGCC 2580 
ATTGAAATGC AAAAGTCTGT TCCAAATAAA GCCTTGG AAT TG AAGAATGA ACAAACATTG 2640 
AG AGCAGATC AGATGTTCCC TTCAGAATCA AAACAAAAGA AGGTTGAAGA AAATTCTTGG 2700 
GATTCTGAGA GTCTCCGTGA GACTGTTTCA CAGAAGGATG TGTGTGTACC CAAGGCTACA 2760 
CATCAAAAAG AAATGGATAA AATAAGTGGA AAATTAGAAG ATTCAACTAG CCTATCAAAA 2820 
ATCTTGGATA CAGTTCATTC TTGTG AAAGA GCAAGGGAAC TTCAAAAAGA TCACTGTGAA 2880 
CAACGTACAG GAAAAATGGA ACAAATGAAA AAG AAGTTTT GTGTACTG AA AAAGAAACTG 2940 
TCAGAAGCAA AAGAAATAAA ATCACAGTTA GAGAACCAAA AAGTTAAATG GGAACAAGAG 3000 
CTCTGCAGTG TGAGATTGAC TTTAAACCAA GAAGAAGAGA AGAGAAGAAA TGCCGATATA 3060 
TTAAATGAAA AAATTAGGGA AGAATTAGGA AGAATCGAAG AGCAGCATAG GAAAGAGTTA 3120 
GAAGTGAAAC AACAACTTGA ACAGGCTCTC AGAATACAAG ATATAGAATT GAAGAGTGTA 3180 
GAAAGTAATT TGAATCAGGT TTCTCACACT CATG AAAATG AAAATTATCT CTTACAJGAA 3240 
AATTGCATGT TGAAAAAGGA AATTGCCATG CTAAAACTGG AAATAGCCAC ACTGAAACAG 3300 
CAATACCAGG AAAAGGAAAA TAAATACTTT GAGGACATTA AGATTTTAAA AGAAAAGAAT 3360 
GCTGAACTTC AGATGACCCT AAAACTGAAA GAGGAATCAT TAACTAAAAG GGCATCTCAA 3420 
TATAGTGGGC AGCTTAAAGT TCTG ATAGCT GAGAACACAA TGCTCACTTC TAAATTGAAG 3480 
GAAAAACAAG ACAAAGAAAT ACTAG AGGCA GAAATTGAAT CACACCATCC TAGACTGGCT 3540 
TCTGCTGTAC AAGACCATG A TCAAATTGTG ACATCAAGAA AAAGTCAAG A ACCTGCTTTC 3600 
CACATTGCAG GAGATGCTTG TTTGCAAAGA AAAATGAATG TTGATGTGAG TAGTACGATA 3660 
TATAACAATG AGGTGCTCCA TCAACCACTT TCTGAAGCTC AAAGGAAATC CAAAAGCCTA 3720 
AAAATTAATC TCAATTATGC AGGAG ATGCT CTAAG AGAAA ATACATTGGT TTCAGAACAT 3780 
GCACAAAGAG ACCAACGTGA AACACAGTGT CAAATGAAGG AAGCTGAACA CATGTATCAA 3840 
AACGAACAAG ATAATGTGAA CAAACACACT GAACAGCAGG AGTCTCTAGA TCAGAAATTA 3900 
TTTCA ACT AC AAAGCAAAAA TATGTGGCTT CAACAGCAAT TAGTTCATGC ACATAAGAAA 3960 
GCTGACAACA AAAGCAAG AT AACAATTGAT ATTCATTTTC TTGAGAGGAA AATGCAACAT 4020 
CATCTCCTAA AAGAGAAAAA TGAGG AGATA TTTAATTACA ATAACCATTT AAAAAACCGT 4080 
ATATATCAAT ATGAAAAAGA GAAAGCAGAA ACAGAAAACT CATG AG AG AC AAGCAGTAAG 4140 
AAACTTCTTT TGGAGAAACA ACAGACCAGA TCTTTACTCA CAACTCATGC TAGGAGGCCA 4200 
GTCCTAGCAT CACCTTATGT TGAAAATCTT ACCAATAGTC TGTGTCAACA GAATACTTAT 4260 
TTTAGAAGAA AAATTCATG A TTTCTTCCTG AAGCCTACAG ACATAAAATA ACAGTGTG AA 4320 
GAATTACTTG TTCACGAATT GCATAAAGCT GCACAGGATT CCCATCTACC CTGATGATGC 4380 
AGCAGACATC ATTCAATCCA ACCAGAATCT CGCTCTGCAC TCCAGCCTAG GTGACAGAGT 4440 
GAGACTCCAC CTCGG AAA 



Scq ID NO: 4 protein sequence: 
Protein Accession «: NP_443723. 1 

1 U 21 31 41 51 
I 1 I 1.1 I 

MTKRKJOTNL NIQDAQKRTA LHWACVNGHE EVVTFLVDRK CQLDVLDGEH RTPLMKALQC 60 
HQEACANILI DSGAD[NLVD VYGNMALHYA VYSEILSWA KLLSHGAVIE VHNKASLTPL 120 
LLSITKRSEQ IVEFLLIKNA NAKAVNKYKC TALMLAVCHG SSE1VGMLLQ QNVDVFAAD1 1 80 
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CGVTAEHYAV TCGFHHIHEQ IMEYIRKLSK NHQNTNPEGT SAGTPDEAAP LAERTPDTAE 240 
SLVEKTPDEA APLVERTPDT AESLVEKTPD EAASLVEGTS DKIQCLEKAT SGKFEQSAEE 300 
TPREITSPAK ETSEKFTWPA KGRPRKIAWE KKEDTPREIM SPAKETSEKF TWAAKGRPRK 360 
IAWEKKETPV KTGCVARVTS NKTKVLEKGR SKMIACPTKE SSTKASANDQ RFPSESKQEE 420 
DEEYSCDSRS LFESSAKIQV CEPESIYQKV MEWREVEEP PKXPSAFKPA IEMQNSVPNK 480 
AFELKNEQTL RADPMFPPES KQKDYEENSW DSESLCETVS QKDVCLPKAT HQKEIDKING 540 
KLEESPMKDG LLK.ATCGMKV SIPTKALEUC DMQTFKAEPP GKPSAFEPAT EMQKSVPNKA 600 
LELKNEQTWR ADEILPSESK QKDYEENSWD TESLCETVSQ KDVCLPKAAH QKEIDKINGK 660 
LEGSPVKDGL LKANCGMKVS IPTKALELMD MQTFKAEPPE KPSAFEPAIE MQKSVPNKAL 720 
ELKNEQTLRA DEILPSESKQ KDYEESSWDS ESLCETVSQK DVCLPKATHQ KEIDKINGKL 780 
EESPDNDGFL KAPCRMKVSI PTKALELMDM QTFKAEPPEK. PSAFEPAIEM QKSVPNKALE 840 
LKNEQTLRAD QMFPSESKQK KVEENSWDSE SLRETVSQKD VCVPKATHQK EMDKISGKLE 900 
DSTSLSKJLD TVHSCERARE LQKDHCEQRT GKMEQMKKKF CVLKKKLSEA KECCSQLENQ 960 
KVKWEQELCS VRLTLNQEEE KRRNAD1LNE KIREELGRE EQHRKELEVK QQLEQALRIQ 1020 
DIELKSVESN LNQVSHTHEN ENYLLHENCM LKKEIAMLKL EIATLKHQYQ EKENKYFEDI 1080 
KHJCEKNAEL QMTLKLKEES LTKRASQYSG QLKVLIAENT MLTSK1XEKQ DKEILEAEIE 1 140 
SHHPRLASAV QDHDQIVTSR KSQEPAFHIA GDACLQRKMN VDVSSTTYNN EVLHQPLSEA 1200 
QRKSKSLK1N LNYAGDALRE NTLVSEHAQR DQRETQCQMK EAEHMYQNEQ DNVNKHTEQQ 1260 
ESLDQKLFQL QSKNMWLQQQ LVHAHKKADN KSK1T1DIHF LERKMQHHLL KEKNEEIFNY 1320 
NNHLKNRIYQ YEKEKAETEN S 

Seq ID NO: 5 DNA sequence 

Nucleic Acid Accession H: none found 

Coding sequence: 273-1785 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 
I I I I I I 

CGCGGGGCGC GGAGTCGGCG GGGCCTCGCG GGACGCGGGC AGTGCGGAGA CCGCGGCGCT 60 
GAGGACGCGG GAGCCGGGAG CGCACGCGCG GGGTGGAGTT CAGCCTACTC TTTCTTAGAT 120 
GTGAAAGGAA AGG AAGATCA TTTCATGCCT TGTTG ATAAA GGTTCAGACT TCTGCTGATT 1 80 
CATAACCATT TGGCTCTGAG CTATGACAAG AGAGG AAACA AAAAGTTAAA CTTACAAGCC 240 
TGCCATAAGT GAGAAGCAAA CTrCCTTGAT AACATGCTTT TGCGAAGTGC AGGAAAATTA 300 
AATGTGGGCA CCAAGAAAGA GGATGGTGAG AGTACAGCCC CCACCCCCCG TCCAAAGGTC 360 
TTGCGTTGTA AATGCCACCA CCATTGTCCA GAAGACTCAG TCAACAATAT TTGCAGCACA 420 
GACGGATATT GTTTCACGAT GATAGAAGAG GATGACTCTG GGTTGCCTGT GGTCACTTCT 480 
GGTTGCCTAG GACTAGAAGG CTCAGATTTT CAGTGTCGGG ACACTCCCAT TCCTCATCAA 540 
AGAAGATCAA TTGAATGCTG CACAGAAAGG AACGAATGTA ATAAAGACCT ACACCCTACA 600 
CTGCCTCCAT TGAAAAACAG AGATTTTGTT GATGGACCTA TACACCACAG GGCTTTACTT 660 
ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCCGG 720 
TATAAAAG AC AAGAAACCAG ACCTCGATAC AGCATTGGGT TAGAACAGGA TGAAACTTAC 780 
ATTCCTCCTG GAG AATCCCT GAGAGACTTA ATTGAGCAGT CTCAGAGCTC AGGAAGTGGA 840 
TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTG AAACAG 900 
ATTGGAAAAG GTCGCTATGG GGAAGTTTGG ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 960 
GTGAAAGTGT TCTTCACCAC AGAGGAAGCC AGCTGGTTCA GAGAGACAGA AATATATCAG 1020 
ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG CTGCAGATAT CAAAGGGACA 1080 
GGGTCCTGGA CCCAGTTGTA CCTAATCACA GACTATCATG AAAATGGTTC CCTTTATGAT 1 140 
TATCTGAAGT CCACCACCCT AGACGCTAAA TCAATGCTG A AGTTAGCCTA CTCTTCTGTC 1200 
AGTGGCTTAT GTCATTTACA CACAGAAATC TTTAGTACTC AAGGCAAACC AGCAATTGCC 1260 
CATCGAGATC TGAAAAGTAA AAACATTCTG GTGAAGAAAA ATGGAACTTG CTGTATTGCT 1320 
GACCTGGGCC TGGCTGTTAA ATTTATTAGT GATACAAATG AAGTTGACAT ACCACCTAAC 1380 
ACTCGAGTTG GCACCAAACG CTATATGCCT CCAGAAGTGT TGGACGAGAG CTTGAACAGA 1440 
AATCACTTCC AGTCTT ACAT CATGGCTGAC ATGTATAGTT TTGGCCTCAT CCTTTGGGAG 1 500 
GTTGCTAGGA GATGTGTATC AGGAGGTATA GTGGAAGAAT ACCAGCTTCC TTATCATGAC 1560 
CTAGTGCCCA GTGACCCCTC TTATGAGGAC ATGAGGGAGA TTGTGTGCAT CAAGAAGTTA 1620 
CGCCCCTCAT TCCCA AACCG GTGGAGCAGT GATGAGTGTC TAAGGCAGAT GGG AAAACTC 1 680 
ATGACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 1740 
ACACTTGCCA AAATGTCAGA GTCCCAGGAC ATTAAACTCT GATAGGAGAG GAAAAGTAAG 1800 
CATCTCTGCA GAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 1860 
TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCTTC CCAGTGGGTT CAGACCTCAC 1920 
CTTTCAGGGA GCGACCTGGG CAAAGACAGA GAAGCTCCCA GAAGGAGAGA TTGATCCGTG 1980 
TCTGTTTGTA GGCGG AG AAA CCGTTGGGTA ACTTGTTCAA GATATGATGC AT 



Sea IP NO: 6 Protein sequence: 
Protein Accession #: none found 



1 11 21 31 41 51 
I I I I I I 

MLLRSAGKLN VGTKKEDGES TAPTPRPKVL RCKCHHHCPE DSVNNICSTD GYCFTMIEED 60 
DSGLPWTSG CLGLEGSDFQ CRDTPIPHQR RSIECCTERN ECNKDLHPTL PPLKNRDFVD 120 
GPIHHRALLI SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETYI PPGESURDLI 180 
EQSQSSGSGS GLPLLVQRTI AKQIQMVKQI GKGRYGEVWM GKWRGEKVAV K.VFFTTEEAS 240 
WFRETETYQT VLMRHENILG FIAADIKGTG SWTQLYUTD YHENGSLYDY LKSTTLDAKS 300 
MLKLA YSSVS GLCHLHTEIF STQGKPAIAH RDLKSKNILV KXNGTCCIAD LGLAVKFJSD 360 
TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGULWEV ARRCVSGG1V 420 
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EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSPPNRWSSD ECLRQMGJCLM TECWAHNPAS 480 
RLTALRVKKT LAKMSESQDI KL 

Seq ID NO: 7 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 482-3007(underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

AACTGAGCTA ACAAGAAATA CTAGAAAAGG AGGAAGGAGA ACATTGCTGC AGCTTGGATC 60 
TACAACCTAA GAAAGCAAGA GTGATCAATC TCAGCTCTGT TAAACATCTT GTTTACTTAC 120 
TGCATTCAGC AGCTTGCAAA TGGTTAACTA TATGCAAAAA AGTCAGCATA GCTGTGAAGT 1 80 
ATGCCGTGAA TTTTAATTGA GGGAAAAAGG GACAATTGCT TCAGGATGCT CTA GTAT GCA 240 
CTCTGCTTGA AATATTTTCA ATGAAATGCT CAGTATTCTA TCTTTGACCA GAGGTTTTAA 300 
CTTTATGAAG CTATGGGACT TGACAAAAAG TGATATTTGA GAAGAAAGTA CGCAGTGGTT 360 
GGTGTTTTCT TTTTTTTAAT AAAGGAATTG AATTACTTTG AACACCTCTT CCAGCTGTGC 420 
ATTACAG ATA ACGTCAGGA A GAGTCTCTGC TTTACAGAAT CGGATTTCAT CACATGACAA 480 
CATGAAGCTG TGGATTCATC TCTTTTATTC ATCTCTCCTT GCCTGTATAT CTTTACACTC 540 
CCAAACTCCA GTGCTCTCAT CCAG AGGCTC TTGTGATTCT CTTTGCAATT GTGAGGAAAA 600 
AGATGGCACA ATGCTAATAA ATTGTG AAGC AAAAGGTATC AAGATGGTAT CTGAAATAAG 660 
TGTGCCACCA TCACGACCTT TCCAACTAAG CTTATTAAAT AACGGCTTGA CG ATGCTTCA 720 
CACAAATGAC TTTTCTGGGC TTACCAATGC TATTTCAATA CACCTTGGAT TTAACAATAT 780 
TGCAGATATT GAGATAGGTG CATTTAATGG CCTTGGCCTC CTGAAACAAC TTCATATCAA 840 
TCACAATTCT TTAGAAATTC TTAAAGAGGA TACTTTCCAT GGACTGGAAA ACCTGGAATT 900 
CCTGCAAGCA G ATAACAATT TTATCACAGT GATTGAACCA AGTGCCTTTA GCAAGCTCAA 960 
CAGACTCAAA GTGTTAATTT TAAATGACAA TGCTATTGAG AGTCTTCCTC CAAACATCTT 1020 
CCG ATTTGTT CCTTTAACCC ATCTAGATCT TCGTGG AAAT CAATTACAAA CATTGCCTTA 1 080 
TGTTGGTTTT CTCGAACACA TTGGCCGAAT ATTGGATCTT CAGTTGGAGG ACAACAAATG 1 140 
GGCCTGCAAT TGTGACTTAT TGCAGTTAAA AACTTGGTTG GAGAACATGC CTCCACAGTC 1200 
TATAATTGGT GATGTTGTCT GCAACAGCCC TCCATTTTTT AAAGGAAGTA TACTCAGTAG 1260 
ACTAAAGAAG GAATCTATTT GCCCTACTCC ACCAGTGTAT GAAGAACATG AGGATCCTTC 1320 
AGGATCATTA CATCTGGCAG CAACATCTTC AATAAATGAT AGTCGCATGT CAACTAAGAC 1380 
CACGTCCATT CTAAAACTAC CCACCAAAGC ACCAGGTTTG ATACCTTATA TTACAAAGCC 1440 
ATCCACTCAA CTTCCAGGAC CTTACTGCCC TATTCCTTGT AACTGCAAAG TCCTATCCCC 1500 
ATCAGOACTT CTAATACATT GTCAGGAGCG CAACATTGAA AGCTTATCAG ATCT GAGACC 1560 
TCCTCCGCAA AATCCTAGAA AGCTCATTCT AGCGGGAAAT ATTATTCACA GTTTAATGAA 1620 
GTCTGATCTA GTGGAATATT TCACTTTGGA AATGCTTCAC TTGGGAAACA ATCGTATTGA 1680 
AGTTCTTGAA GAAGGATCGT TTATGAACCT AACGAGATTA CAAAAACTCT ATCTAAATGG 1740 
TAACCACCTG ACCAAATTAA GTAAAGGCAT GTTCCTTGGT CTCCATAATC TTGAATACTT 1800 
ATATCTTGAA TACAATGCCA TTAAGGAAAT ACTGCCAGGA ACCTTTAATC CA ATGCCT AA 1860 
ACTTAAAGTC CTGTATTTAA ATAACAACCT CCTCCAAGTT TTACCACCAC ATATTTTTTC 1920 
AGGGGTTCCT CTAACTAAGG TAAATCTTAA AACAAACCAG TTTACCCATC TACCTGTAAG 1980 
TAATATTTTG GATGATCTTG ATTTACTAAC CCAGATTGAC CTTGAGGATA ACCCCTGGGA 2040 
CTGCTCCTGT GACCTGGTTG GACTGCAGCA ATGGATACAA AAGTTAAGCA AGAACACAGT 2100 
GACAGATGAC ATCCTCTGCA CTTCCCCCGG GCATCTCGAC AAAAAGGAAT TGAAAGCCCT 2160 
AAATAGTGAA ATTCTCTGTC CAGGTTTAGT AAATAACCCA TCCATGCCAA CACAGACTAG 2220 
TTACCTTATG GTCACCACTC CTGCAACAAC AACAAATACG GCTGATACTA TTTTACGATC 2280 
TCTTACGGAC GCTGTGCCAC TGTCTGTTCT AATATTGGGA CTTCTGATTA TGTTCATCAC 2340 
TATTGTTTTC TGTGCTGCAG GGATAGTGGT TCTTGTTCTT CACCGCAGG A GAAGATACAA 2400 
AAAGAAACAA GTAGATGAGC AAATGAGAGA CAACAGTCCT GTGCATCTTC AGTACAGCAT 2460 
GTATGGCCAT AAAACCACTC ATCACACTAC TG AAAGACCC TCTGCCTCAC TCTATGAACA 2520 
GCACATGGTG AGCCCCATGG TTCATGTCTA TAGAAGTCCA TCCTTTGGTC CAAAGCATCT 2580 
GGAAGAGGAA GAAG AGAGG A ATGAG AAAG A AGGAAGTGAT GCAAAACATC TCCAAAGAAG 2640 
TCTTTTGGA A CAGG AAAATC ATTCACCACT CACAGGGTCA AATATG AAAT ACAAAACCAC 2700 
GAACCAATCA ACAGAATTTT TATCCTTCCA AG ATGCCAGC TCATTGTACA GAAACATTTT 2760 
AGAAAAAGAA AGGGAACTTC AGCAACTGGG AATCACAGAA TACCTAAGGA AAAACATTGC 2820 
TCAGCTCCAG CCTGATATGG AGGCACATTA TCCTGGAGCC CACGAAGAGC TGAAGTTAAT 2880 
GGAAACATTA ATGTACTCAC GTCCAAGGAA GGTATTAGTG GAACAGACAA AAAATGAGTA 2940 
TTTTGAACTT AAAGCTAATT TACATGCTGA ACCTG ACTAT TTAGAAGTCC TGG AGCAGCA 3000 
AACATAGATG GAGAGTTTG A GGGCTTTCGC AGAAATGCTG TGATTCTGTT TTAAGTCCAT 3060 
ACCTTGTAAA TAAGTGCCTT ACGTGAGTGT GTCATCAATC AGAACCTAAG CACAGCAGTA 3120 
AACTATGGGG AAAAAAAAAG AAGAAGAAAA GAAACTCAGG GATCACTGGG AGAAGCCATG 3180 
GCATTATCTT CAGGCAATTT AGTCTGTCCC AAATAAAATC AATCCTTGCA TGTAAATC 



Scq ID NO: 8 Protein sequence: 
Protein Accession #: none found 

1 11 21 31 41 51 

MKLWIHLFYS SLLACISLHS QTPVLSSRGS CDSLCNCEEK DGTMUNCEA KGQCMVSEIS 60 
VPPSRPFQLS LLNNGLTMLH TNDFSGLTNA ISIHLGFNNT ADIE1GAFNG LGLLKQLHIN 120 
HNSLEILKED TFHG LENLEF LQADNNF1TV 1EPS AFSKLN RLK.VLILNDN A1ESLPPN1F 1 80 
RFVPLTHLDL RGNQLQTLPY VGFLEH1GRI LDLQLEDNKW ACNCDLLQLK TWLENMPPQS 240 
EGDWCNSP PFFKGSILSR LKKESICPTP PVYEEHEDPS GSLHLAATSS INDSRMSTKT 300 
TSIUCLPTKA PGUPY1TKP STQLPGPYCP IPCNOCVLSP SGLLCHCQER NTESLSDLRP 360 
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15 



PPQNPRKUL AGNIIHSLMK SDLVEYFTLE MLHLGNNRJE VLEEGSFMNL TRLQKLYLNG 420 
NHLTKLSKGM FLGLHNLEYL YLEYNA1KEI LPGTFNPMPK LKVLYLNNNL LQVLPPHIFS 480 
GVPLTKVNLK TNQFTHLPVS NILDDLDLLT QIDLEDNPWD CSCDLVGLQQ WIQKXSKNTV 540 
TDDILCTSPG HLDKKELKAL NSEILCPGLV NNPSMPTQTS YLMVTTPATT TNTADTILRS 600 
LTDAVPLSVL ILGLUMFIT IVFCAAGIW LVLHRRRRYK KKQVDEQMRD NSPVHLQYSM 660 
YGHKTTHHTT ERPSASLYEQ HMVSPMVHVY RSPSFGPKHL EEEEERNEKE GSDAKHLQRS 720 
LLEQENHSPL TGSNMKYKTT NQSTEFLSFQ DASSLYRNIL EKERELQQLG ITEYLRKNIA 780 
QLQPDMEAHY PGAHEELKLM ETLMYSRPRK VLVEQTKNEY FELKANLHAE PDYLEVLEQQ 840 

Seq ID NO: 9 DNA sequence 

Nucleic Acid Accession #: NM_003474 

Coding sequence: 307-3036 (underlined sequences correspond to start and stop codons) 



1 11 i\ 3! 41 51 
I I I I I I 

CACTAACGCT CTTCCTAGTC CCCGGGCCAA CTCGGACAGT TTGCTCATTT ATTGCAACGG 60 

o A TCAAGGC TGG CTTGTGCCAG AACGGCGCGC GCGCGACGCA CGCACACACA CGGGGGGAAA 120 

ZD CTTTTTTAAA AATGAAAGGC TAGAAG AGCT CAGCGGCGGC GCGGGCCGTG CGCG AGGGCT 1 80 
CCGGAGCTG A CTCGCCGAGG CAGGAAATCC CTCCGGTCGC G ACGCCCGGC CCCGCTCGGC 240 
GCCCGCGTGG GATGGTGCAG CGCTCGCCGC CGGGCCCGAG AGCTGCTGCA CTGAAGGCCG 300 
GCGACGATGG CAGCGCGCCC GCTGCCCGTG TCCCCCGCCC GCGCCCTCCT GCTCGCCCTG 360 
GCCGGTGCTC TGCTCGCGCC CTGCGAGGCC CGAGGGGTG A GCTTATGGAA CG AAGGAAG A 420 

ID GCTGATGAAG TTGTCAGTGC CTCTGTTCGG AGTGGGGACC TCTGGATCCC AGTGAAGAGC 480 
TTCGACTCCA AGAATCATCC AGAAGTGCTG AATATTCGAC TACAACGGGA AAGCAAAGAA 540 
CTGATCATAA ATCTGG AAAG AAATGAAGGT CTCATTGCCA GCAGTTTCAC GGAAACCCAC 600 
TATCTGCAAG ACGGTACTG A TGTCTCCCTC GCTCGAAATT ACACGGTAAT TCTGGGTCAC 660 
TGTTACTACC ATGG ACATGT ACGGGGATAT TCTGATTCAG CAGTCAGTCT CAGCACGTGT 720 

5 U TCTGGTCTCA GGGG ACTT AT TGTGTTTG AA AATGAA AGCT ATGTCTTAG A ACCAATG AAA 780 
AGTGCAACCA ACAGATACAA ACTCTTCCCA GCGAAGAAGC TGAAAAGCGT CCGGGG ATCA 840 
TGTGG ATCAC ATCACAACAC ACCAAACCTC GCTGCAAAGA ATGTGTTTCC ACCACCCTCT 900 
CAGACATGGG CAAGAAGGCA TAAAAGAGAG ACCCTCAAGG CAACTAAGTA TGTGG AGCTG 960 
GTGATCGTGG CAGACAACCG AGAGTTTCAG AGGCAAGGAA AAGATCTGGA AAAAGTTAAG 1020 

J J CAGCGATTAA TAGAGATTGC TAATCACGTT GACAAGTTTT ACAGACCACT GAACATTCGG 1080 
ATCGTGTTGG TAGGCGTGGA AGTGTGGAAT GACATGGACA AATGCTCTGT AAGTCAGGAC 1 140 
CCATTCACCA GCCTCCATGA ATTTCTGGAC TGGAGGAAGA TGAAGCTTCT ACCTCGCAAA 1200 
TCCCATGACA ATGCGCAGCT TGTCAGTGGG GTTTATTTCC AAGGGACCAC CATCGGCATG 1260 
GCCCCAATCA TGAGCATGTG CACGGCAGAC CAGTCTGGGG GAATTGTCAT GGACCATTCA 1320 

4U GACAATCCCC TTGGTGCAGC CGTGACCCTG GCACATGAGC TGGGCCACAA TTTCGGGATG 1380 
AATCATGACA CACTGGACAG GGGCTGTAGC TGTCAAATGG CGGTTGAGAA AGGAGGCTGC 1440 
ATCATGAACG CTTCCACCGG GTACCCATTT CCCATGGTGT TCAGCAGTTG CAGCAGGAAG 1500 
GACTTGGAGA CCAGCCTGGA GAAAGGAATG GGGGTGTGCC TGTTTAACCT GCCGGAAGTC 1560 
AGGGAGTCTT TCGGGGGCCA GAAGTGTGGG AACAGATTTG TGGAAGAAGG AGAGGAGTGT 1 620 

4 J GACTGTGGGG AGCCAGAGG A ATGTATGAAT CGCTGCTGCA ATGCCACCAC CTGTACCCTG 1 680 
AAGCCGGACG CTGTGTGCGC ACATGGGCTG TGCTGTGAAG ACTGCCAGCT GAAGCCTGCA 1740 
GGAACAGCGT GCAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG CACAGGGGCC 1800 
AGCCCTCACT GCCCAGCCAA CGTGTACCTG CACGATGGGC ACTCATGTCA GGATGTGGAC I860 
GGCTACTGCT ACAATGGCAT CTGCCAGACT CACGAGCAGC AGTGTGTCAC ACTCTGGGGA 1920 

J U CCAGGTGCTA AACCTGCCCC TGGGATCTGC TTTGAGAGAG TCAATTCTGC AGGTGATCCT 1980 
TATGGCAACT GTGGCAAAGT CTCGAAGAGT TCCTTTGCCA AATGCGAGAT GAG AGATGCT 2040 
AAATGTGGAA AAATCCAGTG TCAAGGAGGT GCCAGCCGGC CAGTCATTGG TACCAATGCC 2100 
GTTTCCATAG AAACAAACAT CCCCCTGCAG CAAGGAGGCC GGATTCTGTG CCGGGGGACC 2160 
CACGTGTACT TGGGCG ATGA CATGCCGGAC CCAGGGCTTG TGCTTGCAGG CACAAAGTGT 2220 

5 5 GCAGATGG AA AAATCTGCCT GAATCGTCAA TGTCAAAATA TTAGTGTCTT TGGGGTTCAC 2280 
GAGTGTGCAA TGCAGTGCCA CGGCAGAGGG GTGTGCAACA ACAGGAAGAA CTGCCACTGC 2340 
GAGGCCCACT GGGCACCTCC CTTCTGTGAC AAGTTTGGCT TTGGAGGAAG CACAG ACAGC 2400 
GGCCCCATCC GGCAAGCAGA TAACCAAGGT TTAACCATAG GAATTCTGGT GACCATCCTG 2460 
TGTCTTCTTG CTGCCGGATT TGTGGTTTAT CTCAAAAGGA AGACCTTGAT ACGACTGCTG 2520 

6U TTTACAAATA AG AAGACCAC CATTGAAAAA CTAAGGTGTG TGCGCCCTTC CCGGCCACCC 2580 
CGTGGCTTCC AACCCTGTCA GGCTCACCTC GGCCACCTTG GAAAAGGCCT GATGAGGAAG 2640 
CCGCCAGATT CCTACCCACC G AAGGACAAT CCCAGGAG AT TGCTGCAGTG TCAGAATGTT 2700 
GACATCAGCA GACCCCTCAA CGGCCTG AAT GTCCCTCAGC CCCAGTCAAC TCAGCG AGTG 2760 
CTTCCTCCCC TCCACCGGGC CCCACGTGCA CCTAGCGTCC CTGCCAG ACC CCTGCCAGCC 2820 

65 AAGCCTGCAC TTAGGCAGGC CCAGGGGACC TGTAAGCCAA ACCCCCCTCA GAAGCCTCTG 2880 
CCTGCAGATC CTCTGGCCAG AACAACTCGG CTCACTCATG CCTTGGCCAG GACCCCAGGA 2940 
CAATGGGAGA CTGGGCTCCG CCTGGCACCC CTCAGACCTG CTCCACAATA TCCACACCAA 3000 
GTGCCCAG AT CCACCCACAC CGCCTATATT AAGJGAGAAG CCGACACCTT TTTTCAACAG 3060 
TGAAG ACAG A AGTTTGCACT ATCTTTCAGC TCCAGTTGGA GTTTTTTGTA CCAACTTTTA 3 120 

7 0 GGATTTTTTT TAATGTTTAA AACATCATTA CTATAAGAAC TTTG AGCTAC TGCCGTCAGT 3 1 80 
GCTGTGCTGT GCTATGGTGC TCTGTCTACT TGCACAGGTA CTTGTAAATT ATTA ATTTAT 3240 
GCAGAATGTT GATTACAGTG CAGTGCGCTG TAGTAGGCAT TTTTACCATC ACTGAGTTTT 3300 
CCATGGCAGG AAGGCTTGTT GTGCTTTTAG TATTTTAGTG AACTTGAAAT ATCCTGCTTG 3360 
ATGGGATTCT GGACAGG ATG TGTTTGCTTT CTGATCAAGG CCTTATTGG A AAGCAGTCCC 3420 

75 CCAACTACCC CCAGCTGTGC TTATGGTACC AGATGCAGCT CAAGAGATCC CAAGTAGAAT 3480 
CTCAGTTG AT TTTCTGG ATT CCCCATCTCA GGCCAGAGCC AAGGGGCTTC AGGTCCAGGC 3540 
TGTGTTTGGC TTTCAGGGAG GCCCTGTGCC CCTTGACAAC TGGCAGGCAG GCTCCCAGGG 3600 
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ACACCTGGCA GAAATCTGGC TTCTGGCCAG GAAGCTTTGG TGAGAACCTG GGTTGCAGAC 3660 
AGGAATCTTA AGGTGTAGCC ACACCAGGAT AGAGACTGGA ACACTAGACA AGCCAGAACT 3720 
TGACCCTGAG CTG ACCAGCC GTGAGCATGT TTGGAAGGGG TCTGTAGTGT CACTCA AGGC 3780 
GGTGCTTG AT AG AAATGOCA AGCACTTCTT TTTCTCGCTG TCCTTTCTAG AGCACTGCCA 3 840 
5 CCAGTAGGTT ATTTAGCTTG GGAAAGGTGG TGTTTCTGTA AGAAACCTAC TGCCCAGGCA 3900 
CTGCAAACCG CCACCTCCCT ATACTGCTTG GAGCTGAGCA AATCACCACA AACTGTAATA 3960 
CAATGATCCT GTATTCAGAC AGATGAGGAC TTTCCATGGG ACCACAACTA TTTTCAGATG 4020 
TGAACCATTA ACCAGATCTA GTCAATCAAG TCTGTTTACT GCAAGGTTCA ACTTATTAAC 4080 
AATTAGGCAG ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA TGTTCATGGG 4140 

1 0 TATAGTTCAT GTCTGCTATC ATTATTCGTA GATATTGGAC AAAGAACCTT CTCTATGGGG 4200 
CATCCTCTTT TTCCAACTTG GCTGCAGGAA TCTTTAAAAG ATGCTTTTAA CAGAGTCTGA 4260 
ACCTATTTCT TAAACACTTG CAACCTACCT GTTGAGCATC ACAGAATGTG ATAAGGAAAT 4320 
CAACTTGCTT ATCAACTTCC TAAATATTAT GAGATGTGGC TTGGGCAGCA TCCCCTTGAA 4380 
CTCTTCACTC TTCAAATGCC TGACTAGGGA GCCATGTTTC ACAAGGTCTT TAAAGTGACT 4440 

1 5 AATGGC ATG A GAAATACAAA AATACTCAG A TAAGGTAAAA TGCCATG ATG CCTCTGTCTT 4500 
CTGGACTGGT TTTCACATTA GAAG ACAATT GACAACAGTT ACATAATTCA CTCTGAGTGT 4560 
TTTATGAGAA AGCCTTCTTT TGGGGTCAAC AGTTTTCCTA TGCTTTGAAA CAGAAAAATA 4620 
TGTACCAAGA ATCTTGGTTT GCCTTCCAGA AAACAAAACT GCATTTCACT TTCCCGGTGT 4680 
TCCCCACTGT ATCTAGGCAA CATAGTATTC ATGACTATGG ATAAACTAAA CACGTGACAC 4740 

20 AAACACACAC AAAAGGGAAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTGT 4800 
TTATTCTATA GTTATTAAGT TCTTTAAAAT GTAAAGCCAT GCTGGAAAAT AATACTGCTG 4860 
AGATACATAC AGAATTACTG TAACTGATTA CACTTGGTAA TTGTACTAAA GCCAAACATA 4920 
TATATACTAT TAAAAAGGTT TACAGAATTT TATGGTGCAT TACGTGGGCA TTGTCTTTTT 4980 
AGATGCCCAA ATCCTTAGAT CTGGCATGTT AGCCCTTCCT CCAATTATAA GAGGATATGA 5040 

25 A CCA AAA AAA AAAAAAAAAA AA 



Seq ID NO: 10 Protein sequence: 
Protein Accession*: NP_003465.2 



1 11 21 31 41 51 
I I I I I I 

MAARPLPVSP ARALLLALAG ALLAPCEARG VSLWNEGRAD EWSASVRSG DLWIPVKSFD 60 
SKNHPEVLNI RLQRESK-ELI INLERNEGLI ASSFTETHYL QDGTDVSLAR NYTVILGHCY 120 

3 5 YHGHVRGYSD SAVSLSTCSG LRGLIVFENE SYVLEPMKSA TNRYKLFPAK KLKSVRGSCG 1 80 
SHHNTPNLAA KNVFPPPSQT WARRHKRETL KATKYVELVI VADNREFQRQ GKDLEKVKQR 240 
LIEIANHVDK FYRPLNIRIV LVGVEVWNDM DKCSVSQDPF TSLHEFLDWR KMKLLPRKSH 300 
DNAQLVSGVY FQGTTIGMAP IMSMCTADQS GGIVMDHSDN PLGAAVTLAH ELGHNFGMNH 360 
. DTLDRGCSCQ MAVEKGGCIM NASTGYPFPM VFSSCSRKDL ETSLEKGMGV CLFNLPEVRE 420 

40 SFGGQKCGNR FVEEGEECDC GEPEECMNRC CNATTCTLKP DAVCAHGIjCC EDCQLKPAGT 480 
ACRDSSNSCD LPEFCTOASP HCPANVYLHD GHSCQDVDG Y CYNGICQTHE QOCVTLWGPG 540 
AKPAPGICFE RVNSAGDPYG NCGK.VSKSSF AKCEMRDAKC GKIQCQGGAS RPVIGTNAVS 600 
IETNIPLQQG GRILCRGTHV YLGDDMPDPG LVLAGTKCAD GKJCLNRQCQ NISVFGVHEC 660 
. _ AMQCHGRG VC NNRKNCHCEA HWAPPFCDKF GFGGSTDSGP IRQADNQGLT IGILVT1LCL 720 

45 LAAGFWYLK RKTLIRLLFT NKKTTIEICLR CVRPSRPPRG FQPCQAHLGH LGKGLMRKPP 780 
DSYPPKDNPR RLLQCQNVDI SRPLNGLNVP QPQSTQRVLP PLHRAPRAPS VPARPLPAKP 840 
ALRQAQGTCK PNPPQKPLPA DPLARTTRLT HALARTPGQW ETGLRLAPLR PAPQYPHQVP 900 
RSTHTAYIK 

50 Scq ID NO: 1 1 DNA sequence 

Nucleic Acid Accession U; NM_00 1 394 

Coding sequence: 400-1584(underltned sequences correspond to start and stop codons) 
1 M 21 31 41 51 

55 i i i ii i 

GGCACGAGGG CTG AGCGCCG GAGGAGCGTA GGCAGGGCAG CGCTGGCGCC AGCGGCGACA 60 
GGAGCCGCGC GACCGGCAAA AATACACGGG AGGCCGTCGC CGAAAAG AGT CCGCGGTCCT 120 
CTCTCGTAAA CACACTCTCC TCCACCGGCG CCTCCCCCTC CGCTCTGCGC GCCGCCCGGC 180 

r . TGGGCGCCCG AGGCCGCTCC GACTGCTATG TGACCGCGAG GCTGCGGGAG GAAGGGGACA 240 

60 GGGAAGAAGA GGCTCTCCCG CGGGAGCCCT TGAGGACCAA GTTTGCGGCC ACTTCTGCAG 300 
GCGTCCCTTC TTAGCTCTCG CCTGCCCCTT TCTGCAGCCT AGGCGGCCCA GGTTCTCTTC 360 
TCTTCCTCGC GCGCCCAGCC GCCTCGGTTC CCGGCGACCATGGTGACGAT GGAGGAGCTG 420 
CGGGAGATGG ACTGCAGTGT GCTCAAAAGG CTGATGAACC GGGACGAGAA TGGCGGCGGC 480 
QCGGGCGGCA GCGGCAGCCA CGGCACCCTG GGGCTGCCGA GCGGCGGCAA GTGCCTGCTG 540 

6 5 CTGG ACTGCA G ACCGTTCCT GGCGCACAGC GCGGGCTACA TCCTAGGTTC GGTCAACGTG 600 
CGCTGTAACA CCATCGTGCG GCGGCGGGCT AAGGGCTCCG TGAGCCTGGA GCAGATCCTG 660 
CCCGCCGAGG AGGAGGTACG CGCCCGCTTG CGCTCCGGCC TCTACTCGGC GGTCATCGTC 720 
TACGACGAGC GCAGCCCGCG CGCCGAGAGC CTCCGCGAGG ACAGCACCGT GTCGCTGGTG 780 
GTGCAGGCGC TGCGCCGCAA CGCCG AGCGC ACCG ACATCT GCCTGCTCAA AGGCGGCTAT 840 

70 GAGAGGTTTT CCTCCGAGTA CCCAG AATTC TGTTCTAAAA CCAAGGCCCT GGCAGCCATC 900 
CCACCCCCGG TTCCCCCCAG CGCCACAGAG CCCTTGGACC TGGGCTGCAG CTCCTGTGGG 960 
ACCCCACTAC ACGACCAGGG GGGTCCTGTG GAGATCCTTC CCTTCCTCTA CCTCGGCAGT 1020 
GCCTACCATG CTGCCCGGAG AGACATGCTG GACGCCCTGG GCATCACGGC TCTGTTGAAT 1080 
GTCTCCTCGG ACTGCCCAAA CCACTTTGAA GG ACACTATC AGTACAAGTG CATCCCAGTG 1 1 40 

7 5 GAAG ATAACC ACAAGGCCGA CATCAGCTCC TGGTTCATGG AAGCCATAGA GTACATCGAT 1200 
GCCGTGAAGG ACTGCCGTGG GCGCGTGCTG GTGCACTGCC AGGCGGGCAT CTCGCGGTCG 1260 
GCCACCATCT GCCTGGCCTA CCTGATGATG AAGAAACGGG TG AGGCTGGA GG AGGCCTTC 1320 
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GAGTTCGTTA AGCAGCGCCG CAGCATCATC TCGCCCAACT TCAGCTTCAT GGGGCAGCTG 1380 
CTGCAGTTCG AGTCCCAGGT GCTGGCCACG TCCTGTGCTG CGGAGGCTGC TAGCCCCTCG 1440 
GG ACCCCTGC GGG AGCGGGG CAAG ACCCCC GCCACCCCCA CCTCGCAGTT CX3TCTTCAGC 1 500 
TTTCCGGTCT CCGTGGGCGT GCACTCGGCC CCCAGCAGCC TGCCCTACCT GCACAGCCCC 1560 
ATCACCACCT CTCCCAGCTG TTAG AGCCGC CCTGGGGGCC CCAGAACCAG AGCTGGCTCC 1 620 
CAGCAAGGGT AGGACGGGCC GCATGCGGCA GAAAGTTGGG ACTGAGCAGC TGGGAGCAGG 1680 
CGACCGAGCT CCTTCCCCAT CATTTCTCCT TGGCCAACGA CGAGGCCAGC CAGAATGGCA 1740 
ATAAGGACTC CGAATACATA ATAAAAGCAA ACAG AACACT CCAACTTAGA GCAATAACCG 1800 
GTGCCGCAGC AGCCAGGG AA G ACCTTGGTT TGGTTTATGT GTCAGTTTCA CTTTTCCGAT 1 860 
AGAAATTTCT TACCTCATTT TTTTAAGCAG TAAGGCTTGA AGTGATGAAA CCCACAGATC 1 920 
CTAGCAAATG TGCCCAACCA GCTTTACTAA AGGGGGAGGA AGGGAGGGCA AAGGGATGAG 1980 
AAGACAAGTT TCCCAGAAGT GCCTGGTTCT GGGTACTTGT CCCTTTGTTG TCGTTGTTGT 2040 
AGTTAAAGGA ATTTCATTTT TAAAAGAAAT CTTCGAAGGT GTGGTTTTCA TTTCTCAGTC 2100 
ACCAACAGAT GAATAATTAT GCTTAATAAT AAAGTATTTA TTAAGACTTT CTTCAGAGTA 2160 
TGAAAGTACA AAAAGTCTAG TTACAGTGGA TTTAGAATAT ATTTATGTTG ATGTCAAACA 2220 
GCTGAGCACC GTAGCATGCA GATGTCAAGG CAGTTAGGAA GTAAATGGTG TCTTGTAGAT 2280 
ATGTGCAAGG TAGCATGATG AGCAACTTGA GTTTGTTGCC ACTGAGAAGC AGGCGGGTTG 2340 
GGTGGGAGGA GGAAGAAAGG GAAGAATTAG GTTTGAATTG CTTTTTAAAA AAAAAAGAAA 2400 
AGAAAAAGAC AGCATCTCAC TATGTTGCCA AGGCTCATCT TGAGAAGCAG GCGGGTTGGG 2460 
TGGG AGG AGG AAGAAAGGGA AG AATTAGGT TTG AATTGCT TTTTTAAAAA AAAA 

Seq ID NO: 12 Protein seouence: 
Protcia Accession #: NP_001385 

I 11 21 31 41 51 
I I I I I 1 

MVTMEELREM DCSVLKRLMN RDENGGGAGG SGSHGTLGLP SGGKCLLLDC RPFLAHSAGY 60 
ILGSVNVRCN TIVRRRAKGS VSLEQILPAE EEVRARLRSG LYSAVTVYDE RSPRAESLRE 1 20 
DSTVSLWQA LRRNAERTDI CLLKGGYERF SSEYPEFCSK TKALAAIPPP VPPSATEPLD 1 80 
LGCSSCGTPL HDQGGPVEIL PFLYLGSAYH AARRDMLDAL GITALLNVSS DCPNHFEGHY 240 
QYKCIPVEDN HKADISSWFM EA1EYIDAVK DCRGRVLVHC QAGISRSATI CLAYLMMKKR 300 
VRLEEAFEFV KQRRSIISPN FSFMGQLLQF ESQVLATSCA AEAASPSGPL RERGKTPATP 360 
TSQFVFSFPV SVGVHSAPSS LPYLHSPITT SPSC 

Seq ID NO: 13 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 68-340(undcrlincd sequences concspond to start and stop codons) 



1 II 21 31 41 51 
I ! I I 1 I 

AGCGCCTTGC CTTCTCTTAG GCTTTGAAGC ATTTTTGTCT GTGCTCCCTG ATCTTCAGGT 60 
CACCACCATG AAGTTCTTAG CAGTCCTGGT ACTCTTGGGA GTTTCCATCT TTCTGGTCTC 120 
TGCCCAGAAT CCGACAACAG CTGCTCCAGC TGACACGTAT CCAGCTACTG GTCCTGCTGA 180 
TGATGAAGCC CCTGATGCTG AAACCACTGC TGCTGCAACC ACTGCG ACCA CTGCTGCTCC 240 
TACCACTGCA ACCACCGCTG CTTCTACCAC TGCTCGTAAA GACATTCCAG TTTTACCCAA 300 
ATGGGTTGGG GATCTCCCGA ATGGTAGAGT GTGTCCCTGA GATGGAATCA GCTTGAGTCT 360 
TCTGCAATTG GGTCACAACT ATTCATGCTT CCTGTGATTT CATCCAACTA CTTACCTTGC 420 
CTACGATATC CCCTTTATCT CTAATCAGTT TATTTTCTTT CAAATAAAAA ATAACTATGA 480 
GCGAGCTAAC AT 



Seq ID NO: 14 Protein sequence: 
Protein Accession #: none found 



1 11 21 31 41 51 
I I 1 I I I 

MKFLAVLVLL GVSIFLVSAQ NPTTAAPADT YPATGPADDE APDAETTAAA TTATTAAPTT 60 
ATTAASTTAR KDIPVLPKWV GDLPNGRVCP 

Seq ID NO: 15 DNA sequence 

Nucleic Acid Accession #: NM_0 1 6640.2 

Coding sequence: 39-1 358(underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 
I I ! I I I 

GCTTAAGTTG ACCTCTGGGT CCG GAAT CGC GGGCAAAG AT GGCGGCGGCC AGGTGTTGGA 60 
GGCCTTTGCT ACGCGGTCCG AGGCTTTCAT TGCACACCGC GGCTAATGCC GCCGCCACGG 120 
CTACAGAAAC GACCTCCCAA G ACGTCGCGG CGACCCCCGT CGCGCGGTAC CCGCCG ATTG 180 
TGGCCTCCAT GACAGCCGAC AGCAAAGCTG CACGGCTGCG GCGGATCGAG CGCTGGCAGG 240 
CGACGGTGCA CGCTGCGGAG TCGGTAGACG AGAAGCTGCG AATCCTCACC AAGATGCAGT 300 
TTATGAAGTA CATGGTTTAC CCGCAG ACCT TCGCACTGAA TGCCGACCGC TGGTACCAGT 360 
ACTTCACCAA GACCGTGTTC CTGTCGGGTC TGCCGCCGCC CCCAGCGGAG CCCGAGCCCO 420 
AGCCCGAACC CGAACCTGAA CCTGCGCTGG ACCTCGCGGC GCTGCGTGCG GTCGCCTGCG 480 
ACTGCCTGCT GCAGGAGCAC TTCTACCTGC GGCGCAGGCG GCGCGTGCAC CGTTACGAGG 540 
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AGAGCGAGGT CATATCTTTG CCCTTCCTGG ATCAGCTGGT GTCAACCCTC GTGGGCCTCC 600 
TCAGCCCACA CAACCCGGCC CTGGCCGCTG CCGCCCTCG A TTATAGATGC CCAGTTCATT 660 
TTTACTGGGT GCGTGGTGAA GAAATTATTC CTCGTGGTCA TCGAAGAGGT CGAATTGATG 720 
ACTTGCGATA CCAGATAGAT GATAAACCAA ACAACCAGAT TCGAATATCC AAGCAACTCG 780 
CAGAGTTTGT GCCATTGGAT TATTCTGTTC CTATAG AAAT CCCCACTATA AA ATGTAAAC 840 
CAGACAAACT TCCATTATTC AAACGGCAGT ATGAAAACCA CATATTTGTT GGCTCAAAAA 900 
CTGCAGATCC TTGCTGTTAC GGTCACACCC AGTTTCATCT GTTACCTGAC AAATTAAGAA 960 
GGGAAAGGCT TTTGAGACAA AACTGTGCTG ATCAGATAGA AGTTGTTTTT AGAGCTAATG 1020 
CTATTGCAAG CCTTTTTGCT TGGACTGGAG CACAAGCTAT GTATCAAGGA TTCTGGAGTG 1080 
AAGCAG ATGT TACTCG ACCT TTTGTCTCCC AGGCTGTGAT CACAGATGGA AAATACTTTT 1 1 40 
CCTTTTTCTG CTACCAGCTA AATACTTTGG CACTGACTAC ACAAGCTGAT CAAAATAACC 1200 
CTCGTAAAAA TATATGTTGG GGTACACAAA GTAAGCCTCT TTATGAAACA ATTG AGGATA 1260 
ATGATGTGAA AGGTTTTAAT GATGATGTTC TACTTCAGAT AGTTCACTTT CTACTGAATA 1320 
GACCAAAAGA AGAAAAATCA CAGCTGTTGG AAAACTGAAA AAGCATATTT GATTGAGAAC 1380 
TGTGGGAATA TTTAAATTTT ACTGAAGGAA CAATAATGAT GAGATTTGTA ACTGTCAACT 1440 
ATTAAATACA TTGATTTTTG AGACAAATAT TTCTTATGTC AACCTGTTAT TAG ATCTCTT 1500 
ACTCTGCTCA AATTCATCAC TGAAAG ATTT AATTTTAGTT ACCTTTTGTT GATTTAAAAA 1560 
TAATTGCATT TGTATATTGC TAACTG ATAA G ACAAATTG A GTTATTG AGC TATTAAATGC 1 620 
ACATTTTAAT ATAAATGCAG AAATCCCAAA TAAAATGCTA ACATACTGAA TTCAGTAATT 1680 
AAAAGAACCC ACTGC 



Seq ID NO: 16 Protein sequence: 
Protein Accession U: NP_057724. 1 

1 11 21 31 41 51 
I I I I I I 

MAAARCWRPL LRG PRLSLHT AANA A ATATE TTSQD V A ATP V ARYPP1V AS MTADSKA ARL 60 
RRIERWQATV HAAESVDEKL RILTKMQFMK YMVYPQTFAL NADRWYQYFT KTVFLSGLPP 120 
PPAEPEPEPE PEPEPALDLA ALRAVACDCL LQEHFYLRRR RRVHRYEESE VISLPFLDQL 180 
VSTLVG LLSP HNPALAAAAL DYRCPVHFYW VRGEEHPRG HRRGRIDDLR YQIDDKPNNQ 240 
IRISKQLAEF VPLDYSVPIE IPTLKCKPDK LPLFKRQYEN HIFVGSKTAD PCCYGHTQFH 300 
LLPDKLRRER LLRQNCADQI EWFRANAIA SLFAWTGAQA MYQGFWSEAD VTRPFVSQA V 360 
ITDGKYFSFF CYQLNTLALT TQADQNNPRK NICWGTQSKP LYETIEDNDV KGFNDDVLLQ 420 
1VHFLLNRPK EEKSQLLEN 

Seq ID NO: 17 DNA sequence 

Nucleic Acid Accession #: NM_025059.1 

Coding sequence: 3-2 1 50 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 
I I i I I I 

GCATGAGCCT GGACTGCACC AGCCATATCG CGCTGGGTGC CGCTTCGCCA GCGCCCGAGG 60 
AAACTTACGA TCATCTTTCG G AAGTCCCGG TCACGCGGG A GCAGTTAA AC C ACTATCGGA 1 20 
ATGTGGCTCA AAATGCTCGA AGTGAACTTG CAGCAACTTT GGTCAAATTT GAATGTGCTC 1 80 
AGTCTGAGCT TCAAGACCTC CGATCCAAGA TGCTTTCTAA AGAAGTCTCC TGTCAAGAAC 240 
TGAAAG CTG A AATGGAGAGC TACAAGGAAA ACAATGCCAG AAAATCATCT CTCCTTACCT 300 
CTTTGAGAGA CAGAGTTCAG G AACTAGAAG AAGAATCAGC AGCACTTTCC ACTTCTAAAA 360 
TCAGAACAGA AATCACAGCT CACGCTGCAA TCAAGGAGAA CCAGGAATTA AAGAAGAAAG 420 
TTGTAG AGTT AAATG AAAAA TTACAAAAGT GTTCAAAAG A AAATGAGGAG AATAAG AAAC 480 
AAGTTTCAAA GAATTGCAGG AAACATGAGG AATTTCTGAC TCAACTOCGT GACTGCTTGG 540 
ATCCAGATGA GAGGAATGAC AAGGCATCAG ATGAAGATTT AATTTTAAAG CTTAGAGACC 600 
TGCGCAAAGA AAATG AATTC GTGAAAGGAC AAATTGTTAT TCTTGAAGAG ACTATAAATG 660 
TCCATGAGAT GGAAGCAAAA GCTAGCAGAG AAACGATCAT GAGGCTGGCT TCAGAAGTCA 720 
ACAGAGAGCA GAAAAAAGCT GCCTCCTGTA CTGAAGAGAA AGAGAAGCTG AACCAGGACC 780 
TGCTCAGTGC TGTAG AAGCA AAAGAAGCTC TTGAAAGGGA AGTTAAGATC TTCCAAGAAA 840 
GGCTGCTTGC TGGCCAGCAG GTCTGGGATG CCTCA AAGCA GGAAGTGAGC CTCCTGAAGA 900 
AAAGCTCTTC TGAGTTGGAG AAGAGTTTGA AGGCCAGTCA GGATGCAGTC ACAACCTCAC 960 
AAAGCCAGTA CTCCTCATTT AGGGAGAAAA TCGCAGCCCT CCTTAGGGGC AGATTGAGCA 1020 
TGACTGGGTC CACTGAGGAC ACCATTTTGG AGAAGATTCG AGAAATGGAC AGCCGGGAAG 1080 
AAAGCAGGGA CCGGATGGTC TCCCAGCTTG AAGCCCAA AT ATCTGAGCTT GTTGAACAGT 1 140 
TGGGAAAGGA GTCTGGGTTT CACCAGAAAG CTCTCCAGAG GGCCCAGAAA GCAGAGAATA 1200 
TGTTGGAGAC TCTTCAGGGT CAGCTG ACAC ACCTGGAGGC AGAGCTGGTT TCTGGAGGTG 1260 
TTTTGCGAGA CAACTTGAAT TTTGAGAAAC AAAAATATCT TAAATTTCTG GATCAGCTTT 1320 
CTCAGAAAAT GAAGTTGGAC CAGATGGCTG CCGAACTTGG CTTTGACACG CGGCTGGACG 1380 
TGGTTTTAGC TCGAACAGAG CAGCTGGTTC GTCTTGAGAG CAATGCAGTC ATTGAGAACA 1440 
AGACCATTGC CCACAATTTG CAGAGAAAGC TAAAGACACA GAAAGAGAG A CTGGAGAGCA 1500 
AAGAATTACA CATG AGCCTC CTCCGGCAG A AAATAGCCCA GCTGGAGGAG G AG AAGCAGG 1 560 
CACGCACGGC CTTGGTGGTT GAGAGGGACA ACGCGCATCT TACCATCAGG AACTTGCAGA 1620 
AGAAGGTGGA GAGGCTGCAG AAAGAGCTGA ACACGTGTCG AGACTTGCAC ACCGAGCTCA 1680 
AAGCCAAACT GGCCGACACC AATGAACTGA AGATTAAAAC TTTGGAACAG ACTAAAGCCA 1740 
TTGAAGATCT AAACAAATCC AGAGACCAAC TGGAGAAGAT GAAGGAGAAA GCTGAGAAAA 1800 
AGCTCATGTC TGTC AAGTCA GAACTGGATA CCACAGAAC A TGAGGCTAAG G AG AATAAAG 1 860 
AAAGGGCCAG AAACATGATA GAAGTGGTAA CCAGTGAAAT GAAGACACTA AAAAA ATCTC 1920 
TGGAAGAAGC AGAAAAGAGA GAAAAGCAGC TGGCAGACTT CAGGG AGGTG GTGTCGCAGA 1980 
TGCTAGGCTT G AACGTGACC AGCCTTGCTC TTCCTGATTA TGAAATCATC AAGTGTCTTG 2040 
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AAAGATTGGT CCATTCACAT CAGCATCACT TTGTTACCTG TGCCTGCCTC AAAGATGTGA 2100 
CTACTGGGCA AGAGAGGCAC CCACAAGGCC ATTTACAGCT TCTTCATTG A ACACTGTATC 2160 
TCTTGAGAGA GQTGGCCATA AGACATGGCA CACAATTCCC AATTTCACAA ATTCCTCATG 2220 
TCTTTGAGAT TTGATCAGTT TGTGAATATT TTATGCTTTG ATGATATAGT G AGAATGCAT 2280 
J CACTTGCAAA AACGATCTCA AAAGTGTCAG CCTTAG ATAA ACGTCAGCAT TAAAAAACGC 2340 
CAAAAAAAAA AAAAA AAAGC ATTTTAGG AT CCAGAAGAAT TCCACCAGAT TGCATGAGTT 2400 
AGATTGGGAA ATGGGAGTGG G AG ATAATAT TGGG AGGTAT CTATTTTAAG TCAGGGGCTT 2460 
TACTAGCCGA TTTAGTTCTC ACAATAACCA TGTGGAG AAG CTGTGACATT TTTAATTTAC 2520 
AACCTTTCTG GGGCTCAGAC ATAAAGTTAC CTATCCAAGG TTGCAGTTGG GTAGTGGTGG 2580 
1 U GACCAGGATG GACAACTCAT TGGCCCTGCC TCAAAAGCCA TACCTCTTCT CCTGCTATGC 2640 
AGAATCTGTT TCTCCTGAAT CTCTGTGATG CTGGTGGGAA TTGTTTGCAT AG AGG AAGGA 2700 
CAATAACCCT GCCATCGTGA GTTAATGTCC GGGCTGGTCA CAGTGGTTCA TGCCTGTAAT 2760 
CCCAGCACTT TGGGAGTCCA AGGCAGGCAT ATCATTTGAG GTCAGGAGTT TAAGACCAGC 2820 
CTGGCTAACA TAGTGAGACC CTGTTTCTAC TAAAAATACA AAAATAAGCC AGGTGTGGTG 2880 

1 J GTGCATGACT GTAATCCCAA CTACTCAGCA GGAGAAGCAC TTGAACCCAG GAGACGGAGG 2940 
CGGCAGTG AG CCAAGATTGT GCCACTGCAC TCCAGCCTGG GCGACAGAGT GAAACTACAT 3000 
CTC AG AAAAA AAAAAAAAAA AAAAA AAAAA AAGTTAATGT CCAAAAATGA CAGATTTACA 3060 
AGTGTAAGCT ATATG ATTTC TTCAAAAAGC AAAAGCAATA TACCTAATTC ATTTGGATCA 3 120 
AACTTACATA GGTCTCAGGT CCTGTAAGAA ACTTGCCTGT TCTAACTGTT GCTACCAGAT 31 80 

ZU TATATCTGGT GGTAATTGTT AATGTTTCAG CAGGGCTGGT CTCAGTCCTT TAAAATGGAA 3240 
AGCTTTATTT GGAGCCCAAC CCTATAAGAT G AAGAAATCC TATATAGTCT TATTCACCAA 3300 
TATATCCAAT ACACCCACAG CAATGGTACC TTTTTAAGAT CAGGATTTTA TTATGAATTC 3360 
CTGTCACTTT CTGTTTTCCA TTTAAATTTC TATTTTACAA ATTTTTCAGG GAATCATATT 3420 
CTTAACTTCA CTGAGAAAAA TGTGTTACTC TTTTGG ACAA TTTATCTTAT TTCTATCATA 3480 

ZD TAAGATGTAT TTTTTTATTG TCCTTAAAAG AAGCTCTAGC ATGAAATTAA AGGAAAGGGA 3540 
AAGAAATAGA TCTGGTGCAC CCGAACATTA GGAGAAAATG AAAAATATAC AACCAACCGT 3600 
TCGTG AGTCA TCAAAAAGTC AAAGTC AGCC TGGCCAACAT GGCAAAACTC CGTCTCTGCA 3660 
AAAAATACAA AAATGAGCCC GGTATGTTGG CATATGCCTG TAATCCCAGC TACTCGAGAG 3720 
GCTGAGACAC GAAAATTGCT TGAACCTGGG AGGCGGAGGT TGCAATGAGC CGAGATCGCG 3780 

30 CTACTGCACT CCAGCCTGGG CAACAGAGAG AGACCTTGTC TCAAAAAACA ACAACAACAA 3840 
AAAGTCAAAG TCATAATAAG CAAATTATTG GCTTCTTTCT TCTAGACY AA AAGAAATTAA 3900 
AG AGATGAAA CAATCAATTG CAAGGGTCAA AACTAGATTG GATCTTGGTT TGAATG AAAA 3960 
AAAGCATAAA ATATTCTTGC AATAATTGTA AAAATTTGAA TGTGGACTAA GTCCTAGATT 4020 
ATATTAAAAT ATTTTTAATT TTTTAAGCTT GACAAATGCA CTGATTGTTA TACTTTAAAT 4080 

3 J AACTAAAAAT CTGAGAATCC ACAGTGCTAC AGACAATAAA TGATAAAATG GGAAAAAAAA 41 40 
AAAAAAAAAA A 

Sea ID NO: 18 Protein sequence: 
40 Protein Accession #: NPJJ79335.1 

I It 21 31 41 51 
I 1 I I I I 

MSLDCTSHIA LGAASPAPEE TYDHLSEVPV TREQLNHYRN VAQNARSELA ATLVKFECAQ 60 
45 SELQDLRSKM LSKEVSCQEL KAEMESYKEN NARKSSLLTS LRDRVQELEE ESAAESTSKI 120 
RTE1TAHAAJ KENQELKKXV VELNEKLQKC SKENEENKKQ VSKNCRKHEE FLTQLRDCLD 1 80 
PDERNDKASD EDLILKLRDL RKENEFVKGQ IVILEETINV HEMEAKASRE TIMRLASEVN 240 
REQKKAASCT EEKEKLNQDL LSAVEAKEAL EREVKIFQER LLAGQQVWDA SKQEVSLLKK 300 
SSSELEKSLK ASQDAVTTSQ SQYSSFREKI AALLRGRLSM TGSTEDTILE KIREMDSREE 360 
5 0 SRDRMVSQLE AQISELVEQL GKESGFHQKA LQRAQKAENM LETLQGQLTH LEAELVSGGV 420 
LRDNLNFEKQ KYLKFLDQLS QKMKLDQMAA ELGFDTRLDV VLARTEQLVR LESNAVIENK. 480 
TIAHNLQRKL KTQKERLESK ELHMSLLRQK IAQLEEEKQA RTALWERDN AHLTTRNLQK 540 
KVERLQKELN TCRDLHTELK AKLADTNELK [KTLEQTKAI EDLNKSRDQL EKMKEKAEKK 600 
LMSVK.SELDT TEHEAKENKE RARNMIEVVT SEMKTLKKSL EEAEKREKQL ADFREVVSQM 660 
5 5 LGLNVTSLAL PDYEIIKCLE RLVHSHQHHF VTCACLKDVT TGQERHPQGH LQLLH 

Seq ID NO: 1 9 DNA sequence 

Nucleic Acid Accession #: AF071552, NM_000662 

60 Coding sequence: 441-13 13 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
A c I I I 1 I I 

OD CTTTGTATAA GGCTCAGCTA AAAGGG AAAT TGAGTGGGTC AGGTACCACG GATACTATAC 60 
ACTCTATTGC ATGATTCTCC TGCCTACATC AGAAGACGTT TATAAGCCTA TTTTAAAGGA 120 
TACCAGTTGG AATCTCTCTT TTATTAATCA CCAAG AGAAC CATGAACAAG CTGTTTATCA 1 80 
TTTGACTCAT CATTTAATCT TGATTTCCAG CTTCTCACAC TTGAAAGAAG ACATAATACA 240 
TTTCTCACAG GATTCTGGGA CTATTAACTG AACTTATGTG TGTAAAAGGA ATTCATACAA 300 

70 TG AAAGC ACT AGAAATAATT ATTATACTTA TAACCATTGT ATTTTTACAT GTTTAAAATA 360 
TAGCCATAAT TAGCCTACTC AAATCCAAGT GTAAAAGTAA AATGATTTGC TTTCGTTTTG 420 
TTTTCCTTGC TTAGGGG ATC ATGGACATTG AAGCATATCT TGAAAGAATT GGCTATAAGA 480 
AGTCTAGGAA CAAATTGGAC TTGGAAACAT TAACTGACAT TCTTCAACAC CAGATCCGAG 540 
CTGTTCCCTT TGAG AACCTT AACATCCATT GTGGGG ATGC CATGGACTTA GGCTTAGAGG 600 

75 CCATTTTTGA TCAAGTTGTG AGAAGAAATC GGGGTGGATG GTGTCTCCAG GTCAATCATC 660 
TTCTGTACTG GGCTCTGACC ACTATTGGTT TTGAGACCAC GATGTTGGGA GGGTATGTTT 720 
ACAGCACTCC AGCCAAAAAA TACAGCACTG GCATG ATTCA CCTTCTCCTG CAGGTGACCA 780 
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TTCATGGCAG GAACTACATT GTCGATGCTG GGTTTGGACG CTCATACCAG ATGTGGCAGC 840 
CTCTGGAGTT AATTTCTGGO AAGGATCAGC CTCAGGTGCC TTGTGTCTTC CGTTTGACGG 900 
AAGAGAATGG ATTCTGGTAT CTAGACCAAA TCAGAAGGGA ACAGTACATT CCAAATGAAG 960 
AATTTCTTCA TTCTGATCTC CTAGAAGACA GCAAATACCG AAAAATCTAC TCCTTTACTC 1020 
TTAAGCCTCG AACAATTGAA GATTTTGAGT CTATGAATAC ATACCTGCAG ACATCTCCAT 1080 
CATCTGTGTT TACTAGTAAA TCATTTTGTT CCTTGCAGAC CCCAGATGGG GTTCACTGTT 1 140 
TGGTGGGCTT CACCCTCACC CATAGGAGAT TCAATTATAA GGACAATACA GATCTAATAG 1200 
AGTTCAAGAC TCTGAGTGAG GAAGAAATAG AAAAAGTGCT GAAAAATATA TTTAATATTT 1260 
CCTTGCAGAG AAAGCTTGTG CCCAAACATG GTGATAGATT TTTTACTATT TAGAATAAGG 1320 
AGTAAAACAA TCTTGTCTAT TTGTCATOCA GCTCACCAGT TATCAACTGA CGACCTATCA 1380 
TGTATCTTCT GTACCCTT AC CTTATTTTGA AG AAAATCCT AG ACATCAAA TCATTTCACC 1 440 
TATAAAAATG TCATCATATA TAATTAAACA GCTTTTTAAA GAAACATAAC CACAAACCTT 1500 
TTCAAATAAT AATAATAATA ATAATAATAA ATGTATTTTA AAGATGGCCT GTGGTTATCT 1560 
TGGAAATTGG TGATTTATGC TAGAAAGCTT TTAATGTTGG TTTATTGTTG AATTC 

Seq ID NO: 20 Protein sequence: 
Protein Accession »: NP_000653. 1 



1 11 21 31 41 51 
111)11 

MD1EA YLERi G YKKSRNKLD LETLTDILQH QIRAVPFENL NIHCGDAMDL G LEAIFDQVV 60 
RRNRGGWCLQ VNHLLYWALT TIGFETTMLG GYVYSTPAKK YSTGMIHLLL QVTIDGRNY1 120 
VDAGFGRSYQ MWQPLEUSG KDQPQVPCVF RLTEENGFWY LDQIRREQYI PNEEFLHSDL 180 
LEDSKYRKIY SFTLKPRTIE DFESMNTYLQ TSPSSVFTSK SFCSLQTPDG VHCLVGFTLT 240 
HRRFNYKDNT DLIEFKTLSE EEIEKVLKNI FNISLQRKLV PKHGDRFFT1 

Scq ID NO: 21 DNA sequence 

Nucleic Acid Accession #: NMJJ03714 

Coding sequence : 1 23- 1 03 1 (under! ined sequences correspond to start and stop codons) 



I 11 21 31 41 51 
\ I I I I I 

CGGCACGAGC AAAAAGGAAG AGTGGGAGGA GGAGGGGAAG CGGCGAAGGA GGAAGAGGAG 60 
GAGGAGGAAG AGGGGAGCAC AAAGGATCCA GGTCTCCCGA CGGGAGGTTA ATACCAAGAA 120 
CCATGTGTGC CGAGCGGCTG GGCCAGTTCA TGACCCTGGC TTTGGTGTTG GCCACCTTTG 180 
ACCCGGCGCG GGGGACCGAC GCCACCAACC CACCCGAGGG TCCCCAAG AC AGGAGCTCCC 240 
AGCAG AAAGG CCGCCTGTCC CTGCAGAATA CAGCGGAG AT CCAGCACTGT TTGGTCAACG 300 
CTGGCGATGT GGGGTGTGGC GTGTTTGAAT GTTTCGAGAA CAACTCTTGT GAGATTCGGG 360 
GCTTACATGG GATTTGCATG ACTTTTCTGC ACAACGCTGG AAAATTTGAT GCCCAGGGCA 420 
AGTCATTCAT CAAAGACGCC TTGAAATGTA AGGCCCACGC TCTGCGGCAC AGGTTCGGCT 480 
GCATAAGCCG GAAGTGCCCG GCCATCAGGG AAATGGTGTC CCAGTTGCAG CGGGAATGCT 540 
ACCTCAAGCA CGACCTGTGC GCGGCTGCCC AGGAGAACAC CCGGGTGATA GTGGAG ATGA 600 
TCCATTTCAA GGACTTGCTG CTGCACGAAC CCTACGTGGA CCTCGTGAAC TTGCTGCTGA 660 
CCTGTGGGGA GGAGGTGAAG GAGGCCATCA CCCACAGCGT GCAGGTTCAG TGTGAGCAGA 720 
ACTGGGGAAG CCTGTGCTCC ATCTTGAGCT TCTGCACCTC GGCCATCCAG AAGCCTCCCA 780 
CGGCGCCCCC CGAGCGCCAG CCCCAGGTGG ACAGAACCAA GCTCTCCAGG GCCCACCACG 840 
GGGAAGCAGG ACATCACCTC CCAGAGCCCA GCAGTAGGGA GACTGGCCGA GGTGCCAAGG 900 
GTGAGCGAGG TAGCAAGAGC CACCCAAACG CCCATGCCCG AGGCAGAGTC GGGGGCCTTG 960 
GGGCTCAGGG ACCTTCCGGA AGCAGCGAGT GGGAAGACGA ACAGTCTGAG TATTCTGATA 1020 
TCCGGAGGTQ_AAATG AAAGG CCTGGCCACG AAATCTTTCC TCCACGCCGT CCATTTTCTT 1080 
ATCTATGGAC ATTCCAAAAC ATTTACCATT AGAGAGGGGG GATGTCACAC GCAGGATTCT 1 140 
GTGGGGACTG TGGACTTCAT CGAGGTGTGT GTTCGCGGAA CGGACAGGTG AGATGGAGAC 1200 
CCCTGGGGCC GTGGGGTCTC AGGGGTGCCT GGTGAATTCT GCACTTACAC GTACTCAAGG 1 260 
GAGCGCGCCC GCGTTATCCT CGTACCTTTG TCTTCTTTCC ATCTGTGGAG TCAGTGGGTG 1 320 
TCGGCCGCTC TGTTGTGGGG GAGGTGAACC AGGOAGGGGC AGGGCAAGGC AGGGCCCCCA 1380 
GAGCTGGGCC ACACAGTGGG TGCTGGGCCT CGCCCCGAAG CTTCTGGTGC AGCAGCCTCT 1440 
GGTGCTGTCT CCGCGGAAGT CAGGGCGGCT GGATTCCAGG ACAGGAGTGA ATGTAAAAAT 1500 
AAATATCGCT TAGAATGCAG GAGAAGGGTG GAGAGGAGGC AGGGGCCG AG GGGGTGCTTG 1 560 
GTGCCAAACT GAAATTCAGT TTCTTGTGTG GGGCCTTGCG GTTCAGAGCT CTTGGCGAGG 1620 
GTGGAGGGAG GAGTGTCATT TCTATGTGTA ATTTCTGAGC CATTGTACTG TCTGGGCTGG 1 680 
GGGGGACACT GTCCAAGGOA GTGGCCCCTA TOAGTTTATA TTTTAACCAC TGCTTCAAAT 1740 
CTCG ATTTCA CTTTTTTTAT TTATCCAGTT ATATCTACAT ATCTGTCATC TAAATAAATG 1 800 
GCTTTCAAAC AAAAAAAAAA AAAAAAAAAA AAAAAAA 

Scq ID NO: 22 Protein sequence: 
Protein Accession #: NP_0037O5 



1 U 21 31 41 51 
I I I I I I 

MCAERLGQFM TLALVLATFD PARGTDATNP PEGPQDRSSQ QKGRLSLQNT AEIQHCLVNA 60 
GDVGCGVPEC FENNSCEIRG LHGICMTFLH NAGKFDAQGK SFDCDALKCK AHALRHRFGC 120 
ISRKCPAIRE MVSQLQREC Y LKHDLCAAAQ ENTRVTVEMI HFKDLLLHEP YVDLVNLLLT 1 80 
CGEEVKEAIT HSVQVQCEQN WG SLCS1LSF CTSAIQKPPT APPERQPQVD RTKLSRAHHG 240 
EAGHHLPEPS SRETGRGAKG ERGSKSHPNA HARGRVGGLG AQGPSGSSEW EDEQSEYSDI 300 

356 





WO 02/059377 



PCT/US02/02242 



RR 



15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



Seq ID NO: 23 DNA sequence 

Nucleic Acid Accession #: NM 005264. 1 

Coding sequence: 557- I954(underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 
I I I I I I 

GAATTCCGGC CAGAAGAAAT CTGGCCTCGG AACACGCCAT TCTCCGCGCC GCTTCCAATA 60 
ACCACTAACA TCCCTAACGA GCATCCGAGC CGAGGGCTCT GCTCGGAAAT CGTCCTGGCC 120 
CAACTCGGCC CTTCGAGCTC TCGAAGATTA CCGCATCTAT TTTTTTTTTC TTTTTTTTCT 1 80 
TTTCCTAGCG CAGATAAAGT GAGCCCGGAA AGGGAAGGAG GGGGCGGGGA CACCATTGCC 240 
CTG AAAGAAT AAATAAGTAA ATAAACAAAC TGGCTCCTCG CCGCAGCTGG ACGCGGTCGG 300 
TTG AGTCCAG GTTGGGTCGG ACCTGAACCC CTAAAAGCGG AACCGCCTCC CGCCCTCGCC 360 
ATCCCGGAGC TGAGT CGCC G GCGGCGGTGG CTGCTGCCAG ACCCGGAGTT TCCTCTTTCA 420 
CTGGATGGAG CTGAACTTTG GGCGGCCAGA GCAGCACAGC TGTCCGGGGA TCGCTGCACG 480 
CTGAGCTCCC TCGGCAAGAC CCAGCGGCGG CTCGGGATTT TTTTGGGGGG GCGGGGACCA 540 
GCCCCGCGCC GGCACCATGT TCCTGGCGAC CCTGTACTTC GCGCTGCCGC TCTTGGACTT 600 
GCTCCTGTCG GCCGAAGTGA GCGGCGGAGA CCGCCTGGAT TGCGTGAAAG CCAGTGATCA 660 
GTGCCTGAAG GAGCAGAGCT GCAGCACCAA GTACCGCACG CTAAGGCAGT GCGTGGCGGG 720 
CAAGGAGACC AACTTCAGCC TGGCATCCGG CCTGGAGGCC AAGGATGAGT GCCGCAGCGC 780 
CATGGAGGCC CTG AAGCAGA AGTCGCTCTA CAACTGCCGC TGCAAGCGGG GTATGAAGAA 840 
GGAGAAGAAC TGCCTGCGCA TTTACTGGAG CATGTACCAG AGCCTGCAGG GAAATGATCT 900 
GCTGGAGGAT TCCCCATATG AACCAGTTAA CAGCAGATTG TCAGATATAT TCCGGGTGGT 960 
CCCATTCATA TCAGATGTTT TTCAGCAAGT GGAGCACATT CCCAAAGGGA ACAACTGCCT 1020 
GGATGCAGCG AAGGCCTGCA ACCTCGACGA CATTTGCAAG AAGTACAGGT CGGCGTACAT 1080 
CACCCCGTGC ACCACCAGCG TGTCCAACGA TGTCTGCAAC CGCCGCAAGT GCCACAAGGC 1 140 
CCTCCGGCAG TTCTTTGACA AGGTCCCGGC CAAGCACAGC TACGGAATGC TCTTCTGCTC 1200 
CTGCCGGGAC ATCGCCTGCA CAGAGCGGAG GCGACAGACC ATCGTGCCTG TGTGCTCCTA 1260 
TGAAGAGAGG GAGAAGCCCA ACTGTTTGAA TTTGCAGGAC TCCTGCAAGA CGAATTACAT 1320 
CTGCAGATCT CGCCTTGCGG ATTTTTTTAC CAACTGCCAG CCAGAGTCAA GGTCTGTCAG 1380 
CAGCTGTCTA AAGGAAAACT ACGCTGACTG CCTCCTCGCC TACTCGGGGC TTATTGGCAC 1440 
AGTCATGACC CCCAACTACA TAGACTCCAG TAGCCTCAGT GTGGCCCCAT GGTGTGACTG 1500 
CAGCAACAGT GGGAACGACC TAGAAGAGTG CTTGAAATTT TTGAATTTCT TCAAGGACAA 1560 
TACATGTCTT AAAAATGCAA TTCAAGCCTT TGGCAATGGC TCCGATGTGA CCGTGTGGCA 1620 
GCCAGCCTTC CCAGTACAGA CCACCACTGC CACTACCACC ACTGCCCTCC GGGTTAAGAA 1680 
CAAGCCCCTG GGGCCAGCAG GGTCTGAGAA TGAAATTCCC ACTCATGTTT TGCCACCGTG 1740 
TGCAAATTTA CAGGCACAG A AGCTGAAATC CAATGTGTCG GGCAATACAC ACCTCTGTAT 1800 
TTCCAATGGT AATTATGAAA AAGAAGGTCT CGGTGCTTCC AGCCACATAA CCACAAAATC I860 
AATGGCTGCT CCTCCAAGCT GTGGTCTGAG CCCACTGCTG GTCCTGGTGG TAACCGCTCT 1920 
GTCCACCCTA TTATCTTTAA CAGAAACATC ATAGCTGCAT TAAAAAAATA CAATATGGAC 1980 
ATGTAAAAAG ACAAAAACCA AGTTATCTGT TTCCTGTTCT CTTGTATAGC TGAAATTCCA 2040 
GTTTAGGAGC TCAGTTGAGA AACAGTTCCA TTCAACTGGA ACATTTTTTT TTTTCCTTTT 2100 
AAGAAAGCTT CTTGTGATCC TTCGGGGCTT CTGTGAAAAA CCTGATGCAG TGCTCCATCC 2160 
AAACTCAGAA GGCTTTGGGA TATGCTGTAT TTTAAAGGGA CAGTTTGTAA CTTGGGCTGT 2220 
A AAGCA AACT GGGGCTGTGT TTTCGATGAT GATGATCATC ATGATCATGA TGATTTTAAC 2280 
AGTTTTACTT CTGGCCTTTC CTAGCTAGAG AAGGAGTTAA TATTTCTAAG GTAACTCCCA 2340 
TATCTCCTTT AATGACATTG ATTTCTAATG ATATAAATTT CAGCCTACAT TGATGCCAAG 2400 
CTTTTTTGCC ACAAAGAAGA TTCTTACCAA GAGTGGGCTT TGTGGAAACA GCTGGTACTG 2460 
ATGTTCACCT TTATATATGT ACTAGCATTT TCCACGCTGA TGTTTATGTA CTGTAAACAG 2520 
TTCTGCACTC TTGTACAAAA GAAAAAACCA CCCGGAATTC 

Scq ID NO: 24 Protein sequence: 
Protein Accession #: NPJW525 5 

1 II 21 31 41 51 



MFLATLYFAL PLLDLLLSAE VSGGDRLDCV KASDQCUCEQ SCSTKYRTLR QCVAGKETNF 60 
SLASGLEAKD ECRSAMEALK QKSLYNCRCK RGMKKEKNCL RIYWSMYQSL QGNDLLEDSP 120 
YEPVNSRLSD IFRWPFISD VFQQVEHIPK GNNCLDAAKA CNLDDICKKY RSAYITPCTT 180 
SVSNDVCNRR KCHKALRQFF DKVPAKHSYG MLFCSCRDIA CTERRRQTIV PVCSYEEREK 240 
PNCLNLQDSC KTNYICRSRL ADFFTNCQPE SRSVSSCLKE NYADCLLAYS GLIGTVMTPN 300 
YIDSS SLSVA PWCDCSNSGN DLEECLKFLN FFKDNTCLKN AIQAFGNGSD VTVWQPAFPV 360 
QTTTATTTTA LRVKNKPLGP AGSENEIPTH VLPPCANLQA QKLKSNVSGN THLCISNGNY 420 
EKEGLGASSH ITTKSMAAPP SCGLSPLLVL WTALSTLLS LTETS 

Seq ID NO: 25 DNA sequence 

Nucldc Acid Accession M: FGENESH predicted 

Coding sequence: 1576(entirc sequence represents open reading frame) 

I 11 21 31 41 51 



CTTTTGTTTC GCCATGCCTA GTCTAGTGGT ATCTGGAATA ATGGAAAGAA ATGGGGGCTT 60 
TGGAGAACTA GGATGTTTCG GGGGAAGCGC TAAGGACCGA GGGCTGCTGG AAGACGAGCG 120 
CGCCCTTCAG CTGGCTCTCG ATCAACTCTG CCTCCTGGGT TTGGGGG AGC CCCCCGCCCC 1 80 
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CAGGGCGGGC GAGGACGGGG GAGGTGGGGG GGGCGGCGCC CCCGCGCAGC CGACAGCCCC 240 
CCCGCAGCCG GCGCCGCCGC CGCCGCCCGC GGCGCCCCCG GCCGCCCCGA CGACGGCCCC 300 
CGCAGCGCAG ACGCCCCAGC CCCCCACCGC CCCCAAAGGG GCGAGCGACG CCAAGCTCTG 360 
CGCTCTCTAC AAAGAGGCCG AGCTGCGCCT GAAGGGCAGC AGCAACACCA CGGAGTGTGT 420 
TCCCGTGCCC ACCTCCGAGC ACGTGGCCGA GATCGTGGGC AGGCAAGGCT GCAAGATTAA 480 
GGCCTTGAGG GCCAAGACCA ACACCTACAT CAAGACACCG GTGAGGGGCG AGGAACCAGT 540 
GTTCATGGTG ACAGGGCGAC GGGAGGACGT GGCCACAGCC CGGCGGGAAA TCATCTCAGC 600 
AGCGGAGCAC TTCTCCATGA TCCGTGCCTC CCGCAACAAG TCAGGCGCCG CCTTTGGTGT 660 
GGCTCCTGCT CTGCCCGGCC AGGTGACCAT CCGTGTGCGG GTGCCCTACC GCGTGGTGGG 720 
GCTGGTGGTG GGCCCCAAAG GGGCAACCAT CAAGCGCATC CAGCAGCAAA CCAACACATA 780 
CATTATCACA CCAAGCCGTG ACCGCGACCC CGTGTTCGAG ATCACGGGTG CCCCAGGCAA 840 
CGTGGAGCGT GCGCGCGAGG AGATCGAGAC GCACATCGCG GTGCGCACTG GCAAGATCCT 900 
CG AGTACAAC AATGAAAACG ACTTCCTGGC GGGGAGCCCC G ACGCAGCAA TCGATAGCCG 960 
CTACTCCGAC GCCTGGCGGG TGCACCAGCC CGGCTGCAAG CCCCTCTCCA CCTTCCGGCA 1020 
GAACAGCCTG GGCTGCATCG GCGAGTGCGG AGTGGACTCT GGCTTTGAGG CCCCACGCCT 1080 
GGGTGAGCAG GGCGGGGACT TTGGCTACGG CGGGTACCTC TTTCCGGGCT ATGGCGTGGG 1 140 
CAAGCAGGAT GTGTACTACG GCGTGGCCGA GACTAGCCCC CCGCTGTGGG CGGGCCAGGA 1200 
G AACGCCACG CCCACCTCCG TGCTCTTCTC CTCTGCCTCC TCCTCCTCCT CCTCTTCCGC 1 260 
CAAGGCCCGC GCTGGGCCCC CGGGCGCACA CCGCTCCCCT GCCACTTCCG CGGGACCCGA 1320 
GCTGGCCGGA CTCCCGAGGC GCCCCCCGGG AGAGCCGCTC CAGGGCTTCT CTAAACTTGG 1380 
TGGGGGCGGC CTGCGGAGCC CCGGCGGCGG GCGGGATTGC ATGGTCTGCT TTGAGAGCGA 1440 
AGTGACTGCC GCCCTTGTGC CCTGCGGACA CAACCTGTTC TGCATGGAGT GTGCAGTACG 1500 
CATCTGCG AG AGGACGGACC CAGAGTGTCC CGTCTGCCAC ATC ACAGCCA CGCAAGCCAT 1 560 
CCGAATATTC TCCTAA 

Seq ID NO: 26 Protein sequence: 

Protein Accession #: FGENESH predicted 



1 11 21 31 41 51 
I I I I I I 

FCFAMPSLVV SGDMERNGGF GELGCFGGSA KDRGLLEDER ALQLALDQLC LLGLGEPPAP 60 
RAGEDGGGGG GGAPAQPTAP PQPAPPPPPA APPAAPTTAP AAQTPQPPTA PKGASDAKLC 120 
ALYKEAELRL KGSSNTTECV PVPTSEHVAE 1VGRQGCKIK ALRAKTNTYI KTPVRGEEPV 1 80 
FMVTGRREDV ATARREHSA AEHFSMIRAS RNKSGAAFGV APALPGQVTI RVRVPYRWG 240 
LWGPKG ATI KRIQQQTNTY HTPSRDRDP VFEITG APGN VERAREEIET HIA VRTGK1L 300 
EYNNENDFLA GSPDAAIDSR YSDAWRVHQP GCKPLSTFRQ NSLGC1GECG VDSGFEAPRL 360 
GEQGGDFGYG GYLFPGYGVG K.QDVYYGVAE TSPPLWAGQE NATPTSVLFS SASSSSSSSA 420 
KARAGPPGAH RSPATSAGPE LAGLPRRPPG EPLQGFSKLG GGGLRSPGGG RDCMVCFESE 480 
VTAALVPCGH NLFCMECAVR 1CERTDPECP VCHITATQAI RIFS 

Seq ID NO: 27 DNA sequence 

Nucleic Acid Accession U: FGENESH predicted 

Coding sequence: 1 -2070 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
t I I I I I 

ATGAGCGGTG CGGGGGTGGC GGCTGGGACG CGGCCCCCCA GCTCGCCGAC CCCGGGCTCT 60 
CGGCGCCGGC GCCAGCGCCC CTCTGTGGGC GTCCAGTCCT TGAGGCCGCA GAGCCCGCAG 120 
CTCAGGCAGA GCGACCCGCA GAAACGGAAC CTGGACCTGG AGAAAAGCCT GCAGTTCCTG 180 
CAGCAGCAGC ACTCGG AGAT GCTGGCCAAG CTCCATGAGG AGATCGAGCA TCTGAAGCGG 240 
GAAAACAAGG GTGAGCCGGC GCGGGGCCCT AGGCCGGCCC TGCCTCCCCA GGCACACTCA 300 
ACACTGCCGC TCCCGCAGCA CAGAAACACA GCCATCAACT CCAGCACACG CCTGGGCTCA 360 
GGGGGAACAC AGGACGGGGA GCCCCTCCAG ACTGTCCTTG CCCACCTGGC TGCACTGGCC 420 
CCTGTATGCC AACCCAGTGG GTACAGGTTC TGGGGGACCT GGACAGATGC CGCTACCTCT 480 
AGCCGTGGCT GGACGATGTT ATGCAGCCAA GCACAGCACG TGCTGCTCTC GGGAAGCCCA 540 
GGGCCTGAGG TCATTGCAGG GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTCCCTCCT 600 
CCAAGTAGAG CTGAAATGGG AAGGAACCCC TGGGACAGCC CCTGCCCTGC TAGATCTTTG 660 
CCTCAGATTG CTGCTGTGGC CAGGCCCAGG ATTTCCAGCC CTATGGCTCT GAGTCCTCAC 720 
ATGCTGGGGG CCCAGGGGAT ATGGACACAC TCCATCCAGG GATCCCTTCC TGCCATCTGG 780 
GCAGCAACCA TGGGGACAAA GGGAGGAAGC AGAGTCCTGT TTCCTTGCCA CTTGTCCAAG 840 
GCACTTCCCC ATCCTGACAG CGGCCCCCAC CCAGCCCAGG ATCCTGGGCT GTGGTCTCAA 900 
GCTCACTTCC CATTATCTTT GGGGCTGGGG CTGACATCAG G AGGACATCT GACTGGTGGA 960 
TGGAGCCAGC CTGGGAACAT CGCAGCTGGG GCAGTGCCTA GGGCTCTCCC TTCCCAGGGA 1020 
GACATGGAG A AGGGGGTTGA GGGAGGGCCC TTCCCTAGCC GCTGTGGCAA CTCCAGTGAG 1080 
CTGTTCTGGG CAAAGTGTGG CCCAAGTCGG CAGCCCCAGC CCTGCAGTGC TGGGGACGCT 1140 
GACAGGACAC GGGAAGAGGC CATGCTTTCC CTCGGGACCT GCTGTTCCAT GTGTCCCAAG 1200 
CCCTCCTGCT TTCCAG ATGG COCCTCAGGA AACCACCTTT CCAGGGCCTC TGCTCCCTTG 1 260 
GGCGCTCGCT GGGTCTGCAT CAACGGAGTG TGGGTAGAGC CGGGAGGACC CAGCCCTGCC 1320 
AGGCTGAAGG AGGGCTCCTC ACGGACACAC AGGCCAGGAG GCAAGCGTGG GCGTCTTGCG 1380 
GGCGGTAGCG CCGACACTGT GCGCTCTCCT GCAGACAGCC TCTCCATGTC AAGCTTCCAG 1440 
TCTGTCAAGT CCATCTCTAA TTCAGCCAAC TCTCAAGGCA AGGCCAGGCC CCAGCCCGGC 1500 
TCCTTCAACA AGCAAGATTC AAAAGCTGAC GTCTCCCAGA AGGCGGACCT GGAAGAGGAG 1560 
CCCCTACTTC ACAACAGCAA GCTGGACAAA GTTCCTGGGG TACAAGGGCA GGCCAGAAAG 1620 
GAGAAAGCAG AGGCCTCTAA TGCAGGAGCT GCCTGTATGG GGAACAGCCA GCACCAGGGC 1680 
AGGCAGATGG GGGCGGGGGC ACACCCCCCA ATGATCCTGC CCCTTCCCCT GCGAAAGCCC 1740 
ACCACACTTA GGCAGTGCG A AGTGCTCATC CGCG AGCTGT GG AATACCAA CCTCCTGCAG 1 800 
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ACCCAAGAGC TGCGGCACCT CAAGTCCCTC CTGGAAGGGA GCCAGAGGCC CCAGGCAGCC 1 860 
CCGGAGGAAG CTAGCTTTCC CAGGGACCAA GAAGCCACGC ATTTCCCCAA GGTCTCCACC 1920 
AAGAGCCTCT CCAAGAAATG CCTGAGCCCA CCTGTGGCGG AGCGTGCCAT CCTGCCCGCA 1980 
CTGAAGCAGA CCCCGAAGAA CAACTTTGCC GAGAGGCAGA AGAGGCTGCA GGCAATGCAG 2040 
AAACGGCGCC TGCATCGCTC AGTGCTTTGA 

Seq ID NO: 28 Protein sequence: 

Protein Accession if; FGENESH predicted 



1 11 21 31 41 51 
I I I I I I 

MSGAGVAAGT RPPSSPTPGS RRRRQRPSVG VQSLRPQSPQ LRQSDPQKRN LDLEKSLQFL 60 
QQQHSEMLAK LHEEIEHLKR ENKGEPARGP RPALPPQAHS TLPLPQHRNT AINSSTRLGS 120 
GGTQDGEPLQ TVLAHLAALA PVCQPSGYRF WGTWTDAATS SRGWTMLCSQ AQHVLLSGSP 1 80 
GPEVIAGRQV ATGCSPDLPP PSRAEMGRNP WDSPCPARSL PQ1AAVARPR ISSPMALSPH 240 
MLGAQG IWTH SIQGSLPAIW AATMGTKGGS RVLFPCHLSK. ALPHPDSGPH PAQDPGLWSQ 300 
AHFPLSLGLG LTSGGHLTGG WSQPGNIAAG A VPRALPSQG DMEKGVEGGP FPSRCGNSSE 360 
LFWAKCGPSR QPQPCSAGDA DRTREEAMLS LGTCCSMCPK PSCFPDGPSG NHLSRASAPL 420 
GARWVCTNGV WVEPGGPSPA RLKEGSSRTH RPGGKRGRLA GGSADTVRSP ADSLSMSSFQ 480 
SVKSISNSAN SQGKARPQPG SFNKQDSKAD VSQKADLEEE PLLHNSKLDK VPG VQGQARK 540 
EKAEASNAGA ACMGNSQHQG RQMGAG AHPP MILPLPLRKP TTLRQCEVLI RELWNTNLLQ 600 
TQELRHLKSL LEGSQRPQAA PEEASFPRDQ EATHFPKVST KSLSKKCLSP PVAERAILPA 660 
LKQTPKNNFA ERQKRLQAMQ KRRLHRSVL 

Seq ED NO: 29 DMA sequence 

Nucleic Acid Accession #: NM_0 123 192 

Coding sequence: 1 38-2405 (underlined sequences correspond to start and stop codons) 



! 11 21 31 41 51 
I I I I I I 

CTCGTGCCGA ATTCGGCACG AG ACCGCGTG TTCGCGCCTG GTAGAGATTT CTCGAAGACA 60 
CCAGTGGGCC CGTGTGGAAC CAAACCTGCG CGCGTGGCCG GGCCGTGGGA CAACGAGGCC 120 
GOGGAGACGA AGGCGCAATG GCGAGG AAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 1 80 
TCTCTCTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 240 
AAATTAGTCC GAATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 300 
ATCATCTACA ACAGCTTTTC TACCGCTATG GAG AAAATAA TTCTTTGTCA GTTGAAGGGT 360 
TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 420 
ACCACGACCA TCACTCAGAC CACG AGCATC ACTCAGACCA TGAGCGTCAC TCAGACCATG 480 
AGCATCACTC AGACCACGAG CATCACTCTG ACCATG ATCA TCACTCTCAC CATAATCATG 540 
CTGCTTCTGG TAAAAATAAG CG AAAAGCTC TTTGCCCAGA CCATGACTCA G ATAGTTCAG 600 
GTAAAGATCC TAGAAACAGC CAGGGGAAAG GAGCTCACCG ACCAGAACAT GCCAGTGGTA 660 
GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 720 
TCTCTGAAGG AACTCACTTT CTAGAGACAA TAG AGACTCC AAGACCTGGA AAACTCTTCC 780 
CCAAAGATGT AAGCAGCTCC ACTCCACCCA GTGTCACATC AAAGAGCCGG GTGAGCCGGC 840 
TGGCTGGTAG GAAAACAAAT G AATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 900 
GAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG 960 
GCATGGGCAT CCAGGTTCCG CTG AATGCAA CAG AGTTCAA CTATCTCTGT CCAGCCATCA 1020 
TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAGAAG GCTGAAATCC 1080 
CTCCAAAG AC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA 1 1 40 
TCAGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 1 200 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT G ATGCTTTTT 1 260 
TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAGAACCAG 1320 
CAATGG AAAT G AAA AG AGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC ATAGAAGAAA 1 380 
GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 1440 
TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 1500 
AG AAG AAACC TGAA AATG AT G ATG ATGTGG AG ATTAAG AA GCAGTTGTCC AAGTATGAAT 1 560 
CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1620 
GAGCAGACTC ACAAG AGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGGAAGAAG 1 680 
AAGAGGTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 1740 
GGTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 1 800 
TTCACCACCA TCATG ACTAC CATCATATTC TCCATCATCA CCACCACCA A AACCACCATC 1 860 
CTCACAGTCA CAGCCAGCGC TACTCTCGGG AGGAGCTGAA AGATGCCGGC GTCGCCACTT 1920 
TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 1980 
GTGCTGCTTT TACTG AAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 2040 
ATGAGTTGCC TCATG AATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGTTA 2100 
AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 2160 
GAATTTTCAT TGGTCATTAT G CTG AAA ATG TTTCTATGTG GATATTTGCA CTTACTGCTG 2220 
GCTTATTCAT GTATGTTGCT CTGGTTG ATA TGGTACCTGA AATGCTGCAC AATGATGCTA 2280 
GTGACCATGG ATGTAGCCGC TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG 2340 
GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 2400 
TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 2460 
AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTGATTT 2520 
TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 2580 
TATTCTATCT TGG AGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 2640 
TAAACAAGAG ATTTGGCATG ACATGTTCTG TATGTTTCAG GGAAAA ATGT CTTTAATGCT 2700 
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TTTTCAAGAA CTAACACAGT TATTCCTATA CTGG ATTTTA GGTCTCTOAA GAACTGCTGG 2760 
TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 2820 
AGCAAAGAAA TAAACXJAGAA AAGAGAAGAA TCTGAGA ATT GGGGAGGCAT AG ATTCTTAT 2880 
AAAAATCACA AAATTTGTTG TAAATTAG AG GGGAGAAATT TAGAATTAAG TATAAAAAGG 2940 
CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTG TCA GGATTATTTC CCGTAAAAAC 3000 
GTAGTGAGCA CTCTCATATA CTAATTAGTO TACATTTAAC TTTGTATAAT ACAGAAATCT 3060 
AAATATATTT AATGAATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATG 3 120 
TTCGTGCGGG TTATATACCA G ATG AGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 3 1 BO 
TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 3240 
CAAAATTATC AGAGTAGTAA AACTTTGATA TATATGAGGA TATTAAAACT ACACTAAGTA 3300 
TCATTTGATT CGATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 3360 
GAGCAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 3420 
GATGTTTCTT TTTTACACAA TAAATTCCTT ATATCAGCTT G 

Scq ID NO: 30 Protein sequence: 
Protein Accession #: NP 03645 1 .2 

1 I! 21 31 41 51 
I I I I I I 

MARKLSVLLI LTFALSVTNP LHELKAAAFP QTTSOSPNW ESGDNVDLAI STRQYHLQQL 60 
FYRYGENNSL SVEGFRKLLQ NIGIDKDCRI HIHHDHDHHS DHEHHSDHER HSDHEHHSDH 120 
EHHSDHDHHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKG AHRPE HASGRRNVKD 1 80 
SVSASEVTST VYNTVSEGTH FLETIETPRP GKLFPKDVSS STPPSVTSKS RVSRLAGRKT 240 
NESVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGMGIQV PLNATEFNYL CPAIINQIDA 300 
RSCUHTSEK KAEIPPKTYS LQIAWVGGFI AiSHSFLSL LGVILVPLMN RVFFKFLLSF 360 
LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEPAMEMKR GPLFSHLSSQ NIEESAYFDS 420 
TWKGLTALGG LYFMFLVEHV LTLQCQFKDK KKKNQKKPEN DDDVEIKKQL SKYESQLSTN 480 
EEKVDTDDRT EGYLRADSQE PSHFDSQQPA VLEEEEVMIA HAHPQEVYNE YVPRGCKNKC 540 
HSHFHDTLGQ SDDLIHHHHD YHHILHHHHH QNHHPHSHSQ RYSREELKDA GVATLAWMVI 600 
MGDGLHNFSD G LAIGAAFTE G LSSGLSTSV A VFCHELPHE LGDF AVLUCA GMTVKQAVLY 660 
NALSAMLAYL GMATGIFIGH YAENVSMWIF ALTAGLFMYV ALVDMVPEML HNDASDHGCS 720 
RWGYFFLQNA GMLLGFGIML USIFEHKIV FRINF 

Scq IO NO: 31 DNA sequence 

Nucleic Acid Accession U: NM_002 184.1 

Coding sequence: 256-301 2(underiined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
11)111 

GAGCAGCCAA AAGGCCCGCG GAGTCGCGCT GGGCCGCCCC GGCGCAGCTG AACCGGGGGC 60 
CGCGCCTGCC AGGCCGACGG GTCTGGCCCA GCCTGGCGCC AAGGGGTTCG TGCGCTGTGG 120 
AGACGCGGAG GGTCGAGGCG GCGCGGCCTG AGTGAAACCC AATGGAAAAA GCATGACATT 180 
TAGAAGTAGA AGACTTAGCT TCAAATCCCT ACTCCTTCAC TTACTAATTT TGTGATTTGG 240 
AAATATCCGC GCAAOATGTT GACGTTGCAG ACTTGGGTAG TGCAAGCCTT GTTTATTTTC 300 
CTCACCACTG AATCTACAGG TGAACTTCTA GATCCATGTG GTTATATCAG TCCTGAATCT 360 
CCAGTTGTAC AACTTCATTC TAATTTCACT GCAGTTTGTG TGCTAAAGGA AAAATGTATG 420 
GATTATTTTC ATGTAAATGC TAATTACATT GTCTGGAAAA CAAACCATTT TACTATTCCT 480 
AAGGAGCAAT ATACTATC AT AAACAGAACA GCATCCAGTG TCACCTTTAC AGATATAGCT 540 
TCATTAAATA TTCAGCTCAC TTGCAACATT CTTACATTCG GACAGCTTGA ACAGAATGTT 600 
TATGG AATCA CAATAATTTC AGGCTTGCCT CCAGAAAAAC CTAAAAATTT GAGTTGCATT 660 
GTGAACGAGG GGAAGAAAAT GAGGTGTGAG TGGGATGGTG GAAGGGAAAC ACACTTGGAG 720 
ACAAACTTCA CTTTAAAATC TGAATGGGCA ACACACAAGT TTGCTGATTG CAAAGCAAAA 780 
CGTGACACCC CCACCTCATG CACTGTTGAT TATTCTACTG TGTATTTTGT CAACATTGAA 840 
GTCTGGGTAG AAGCAGAGAA TGCCCTTGGG AAGGTTACAT CAGATCATAT CAATTTTGAT 900 
CCTGTATATA AAGTGAAGCC CAATCCGCCA CATAATTTAT CAGTGATCAA CTCAGAGGAA 960 
CTGTCTAGTA TCTTAAAATT GACATGGACC AACCCAAGTA TTAAGAGTGT TATAATACTA 1020 
AAATATAACA TTCAATATAG GACCAAAGAT GCCTCAACTT GG AGCCAG AT TCCTCCTGAA 1 080 
G ACACAGCAT CCACCCGATC TTCATTCACT GTCCAAGACC TTAAACCTTT TACAGAATAT 1 1 40 
GTGTTTAGGA TTCGCTGTAT GAAGGAAGAT GGTAAGGGAT ACTGGAGTGA CTGGAGTGAA 1200 
GAAGCAAGTG GGATCACCTA TGAAGATAGA CCATCTAAAG CACCAAGTTT CTGGTATAAA 1260 
ATAGATCCAT CCCATACTCA AGGCTACAGA ACTGTACAAC TCGTGTGGAA GACATTGCCT 1320 
CCTTTTGAAG CCAATGGAAA AATCTTGGAT TATGAAGTGA CTCTCACAAG ATGGAAATCA 1380 
CATTTACAAA ATTACACAGT TAATGCCACA AAACTGACAG TAAATCTCAC AAATGATCGC 1440 
TATCTAGCAA CCCTAACAGT AAGAAATCTT GTTGGCAAAT CAGATGCAGC TGTTTTAACT 1500 
ATCCCTGCCT GTGACTTTCA AGCTACTCAC CCTGTAATGG ATCTTAAAGC ATTCCCCAAA 1560 
GATAACATGC TTTGGGTGGA ATGGACTACT CCAAGGGAAT CTGTAAAGAA ATATATACTT 1620 
GAGTGGTGTG TGTTATCAGA TAAAGCACCC TGTATCACAG ACTGGCAACA AGAAGATGGT 1680 
ACCGTGCATC GCACCTATTT AAGAGGGAAC TTAGCAG AG A GCAAATGCTA TTTG ATAACA 1 740 
GTTACTCCAG TATATGCTG A TGG ACCAGG A AGCCCTG AAT CCATAAAGGC ATACCTTAAA 1 800 
CAAGCTCCAC CTTCCAAAGG ACCTACTGTT CGGACA AAAA AAGTAGGGAA AAACGAAGCT 1860 
GTCTTAGAGT GGGACCAACT TCCTGTTGAT GTTCAGAATG GATTTATCAG AAATTATACT 1920 
ATATTTTATA GAACCATCAT TGGAAATGAA ACTGCTGTGA ATGTGGATTC TTCCCACACA 1980 
GAATATACAT TGTCCTCTTT GACTAGTGAC ACATTGTACA TGGTACGAAT GGCAGCATAC 2040 
ACAGATGAAG GTGGGAAGGA TGGTCCAGAA TTCACTTTTA CTACCCCAAA GTTTGCTCAA 2100 
GGAGAAATTG AAGCCATAGT CGTGCCTGTT TGCTTAGCAT TCCTATTGAC AACTCTTCTG 2160 
GGAGTGCTGT TCTGCTTTAA TAAGCG AGAC CTAATTAA AA AACACATCTG GCCTAATGTT 2220 
CCAGATCCTT CAAAGAGTCA TATTGCCCAG TGGTCACCTC ACACTCCTCC AAGGCACAAT 2280 
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TTTAATTCAA AAGATCAAAT GTATTCAGAT GGCAATTTCA CTGATGTAAG TGTTGTGGAA 2340 
ATAGAAGCAA ATGACAAAAA GCCTTTTCCA GAAGATCTGA AATCATTGGA CCTGTTCAAA 2400 
AAGGAAAAAA TTAATACTGA AGGACACAGC AGTGGTATTG GGGGGTCTTC ATGCATGTCA 2460 
TCTTCTAGGC CAAGCATTTC TAGCAGTGAT GAAAATGAAT CTTCACAAAA CACTTCG AGC 2520 
ACTGTCCAGT ATTCTACCGT GGTACACAGT GGCTACAGAC ACCAAGTTCC GTCAGTCCAA 2580 
GTCTTCTCAA GATCCGAGTC TACCCAGCCC TTGTTAGATT CAGAGGAGCG GCCAG AAG AT 2640 
CTACAATTAG TAGATCATGT AGATGGCGGT GATGGTATTT TGCCCAGGCA ACAGTACTTC 2700 
AAACAG AACT GCAGTCAGCA TGAATCCAGT CCAGATATTT CACATTTTGA AAGGTCAAAG 2760 
CAAGTTTCAT CAGTCAATG A GGAAGATTTT GTTAG ACTTA AACAGCAGAT TTCAG ATCAT 2820 
A TTTCA CAAT CCTGTGGATC TGGGCAAATG AAAATGTTTC AGGAAGTTTC TGCAGCAGAT 2880 
GCTTTTGGTC CAGGTACTGA GGGACAAGTA GAAAGATTTG AAACAGTTGG CATGGAGGCT 2940 
GCGACTGATG AAGGCATGCC TAAAAGTTAC TTACCACAGA CTGTACGGCA AGGCGGCTAC 3000 
ATGCCTCAGT_GA AGG ACT AG TAGTTCCTGC TACAACTTCA GCAGTACCTA TAAAGTAAAG 3060 
CTA AAATGAT TTTATCTGTG AATTC 

Scq ID NO: 32 Protein sequence: 
Proicin Accession it: NP_002 1 75. 1 

I 11 21 31 41 51 
I I I i I I 

MLTLQTWVVQ ALFIFLTTES TGELLDPCGY ISPESP WQL HSNFTAVCVL KEKCMDYFHV 60 
NANYIVWKTN HFTIPKEQYT IINRTASSVT FTDIASLNIQ LTCNILTFGQ LEQNVYGITI 120 
ISGLPPEKPK NLSCIVNEGK KMRCEWDGGR ETHLETNFTL KSEW ATHKf A DCKAKRDTPT 1 80 
SCTVDYSTW FVNIEVWVEA ENALGKVTSD HINFDPVYKV KPNPPHNLSV INSEELSSIL 240 
KLTWTNPSIK SVIILKYNIQ YRTKDASTWS QIPPEDTAST RSSFTVQDLK PFTEYVFR1R 300 
CMKEDGKGYW SDWSEEASG1 TYEDRPSKAP SFWYKIDPSH TQGYRTVQLV WKTLPPFEAN 360 
GKILDYEVTL TRWKSHLQNY TVNATKLTVN LTNDRYLATL TVRNLVGKSD AAVLTIPACD 420 
FQATHPVMDL KAFPKDNMLW VEWTTPRESV KKYILEWCVL SDKAPCITDW QQEDGTVHRT 480 
YLRGNLAESK CYUTVTPVY ADGPGSPESI KAYLKQAPPS KGPTVRTKKV GKNEAVLEWD 540 
QLPVDVQNGF 1RNYTIFYRT IIGNETAVNV DSSHTEYTLS SLTSDTLYMV RMAAYTDEGG 600 
KDGPEFTFTT PKFAQGEIEA IVVPVCLAFL LTTLLGVLFC FNKRDLIKKH IWPNVPDPSK 660 
SHIAQWSPHT PPRHNFNSKD QMYSDGNFTD VSVVEIEAND KJCPFPEDLKS LDLFKKEKJN 720 
TEGHSSGIGG SSCMSSSRPS ISSSDENESS QNTSSTVQYS TWHSGYRHQ VPSVQVFSRS 780 
ESTQPLLDSE ERPEDLQLVO HVDGGDGILP RQQYFKQMCS QHESSPDISH FERSKQVSSV 840 
NEEDFVRLKQ Q1SDHISQSC GSGQMKMFQE VSAADAFGPG TEGQVERFET VGMEAATDEG 900 
MPKSYLPQTV RQGGYMPQ 

Scq ID NO: 33 DNA sequence 

Nucleic Acid Accession NM J) 18255.1 

Coding sequence: 1 1 -2491 (underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 
I I I I I I 

AGTTGGCGAC ATGGTGGCAC CCGTGCTGGA GACTTCTCAC GTGTTTTGCT GCCCAAACCG 60 
GGTGCGGGGA GTCCTGAACT GGAGCTCTGG GCCCAGAGGA CTTCTGGCCT TTGGCACGTC 120 
CTGCTCCGTG GTGCTCTATG ACCCCCTGAA AAGGGTTGTT GTTACCAACT TGAATGGTCA 1 80 
CACCGCCCGA GTCAATTGCA TACAGTGGAT TTGTAAACAG GATGGCTCCC CTTCTACTGA 240 
ATTAGTTTCT GG AGGATCTG ATAATCAAGT GATTCACTGG GAAATAGAGG ATAATCAGCT 300 
TTTAAAAGCA GTGCATCTTC AAGGCCATGA AGGACCTGTT TATGCGGTGC ATGCTGTTTA 360 
CCAGAGGAGG ACATCAGATC CTGCATTATG TACACTGATC GTTTCTGCAG CTGCAGATTC 420 
TGCTGTTCGA CTCTGGTCTA AAAAGGGTCC AGAAGTAATG TGCCTTCAGA CTTTAAACTT 480 
TGG AAATGGA TTTGCTTTGG CTCTCTGCTT ATC T n ITTG CCAAATACTG ATGTACCAAT 540 
ATTAGCATGT GGCAATGATG ATTGCAGAAT TCACATATTT GCTCAACAAA ATGATCAGTT 600 
TCAGAAAGTG CTTTCTCTCT GTGGACATGA GGATTGGATT AGAGGAGTGG AATGGGCAGC 660 
CTTTGGTAGA GATCTTTTCC TAGCAAGCTG TTCACAAG AT TGCCTGATAA GAATATGGAA 720 
GCTGTATATA AAGTCAACAT CTTTAG AAAC TCAGGATG AC GATAACATAA GACTG AAAGA 780 
AAATACTTTT ACCATAGAAA ATG AAAGTGT TAAAATAGCA TTTGCTGTTA CTCTGG AGAC 840 
AGTGCTAGCC GGTCATGAAA ACTGGGTAAA TGCAGTTCAC TGGCAACCTG TGTTTTACAA 900 
AG ATGGTGTC CTACAGCAGC CAGTGAG ATT ATTATCTGCT TCCATGGATA AAACCATGAT 960 
TCTCTGGGCT CCAGATGAAG AGTCAGG AGT TTGGCTAG AA CAGGTTCGAG TAGGTGAAGT 1020 
AGGTGGGAAT ACTTTGGGAT TTTATGATTG CCAGTTCAAT G AAGATGGCT OCATG ATCAT 1 080 
TGCTCATGCT TTCCACGGAG CGTTGCACCT TTGG AAACAG AATACAGTTA ACCCAAGAGA 1 140 
GTGGACTCCA GAGATTGTCA TTTCAGGACA CTTTGATGGT GTCCAAGACC TAGTCTGGGA 1200 
TCCAGAAGGA GAATTTATTA TCACTGTTGG TACTGATCAG ACAACTAGAC TTTTTGCTCC 1260 
ATGGAAGAGA AAAGACCAAT CACAGGTGAC TTGGCATGAA ATTGCAAGGC CTCAGATACA 1320 
TGGGTATGAC CTG AAATGTT TGGCAATGAT TAATCGGTTT CAGTTTGTAT CTGGAGCAGA 1 380 
TGAAAAAGTT CTTCGGGTTT TTTCTGCACC TCGGAATTTT GTGGAAAATT TTTGTGCCAT 1440 
TACAGGACAA TCACTGAATC ATGTGCTCTG TAATCAAGAT AGTGATCTTC CAGAAGGAGC 1500 
CACTGTCCCT GCATTGGGAT TATCAAATAA AGCTGTCTTT CAGGG AGATA TAGCTTCTCA 1 560 
GCCTTCTGAT G A AG AGG AGC TGTTAACTAG TACTGGTTTT GAGTATCAGC AGGTGGCCTT 1620 
TCAGCCCTCC ATACTTACTG AGCCTCCCAC TGAGG ATCAT CTTCTGCAGA ATACTTTGTG 1 680 
GCCTGAAGTT CAAAA ACTAT ATGGGCACGG TTATG AAATA TTTTGTGTTA CTTGTAACAG 1 740 
TTCAAAGACT CTGCTTGCCT CAGCTTGTAA GGCAGCTAAG AAAGAGCATG CAGCTATCAT 1800 
TCTTTGGAAC ACTACATCTT GGAAACAGGT GCAGAATTTA GTTTTCCACA GTTTGACAGT 1860 
CACGCAGATG GCCTTCTCAC CTAATGAG AA GTTCTTACTA GCTGTTTCCA GAGATCGAAC 1920 
CTGGTCATTG TGGAAAAAGC AGG ATACAAT CTCACCTGAG TTCGAGCCAG TTTTTAGTCT 1980 
TTTTGCCTTC ACCAACAAAA TTACTTCTGT GCACAGTAGA ATTATTTGGT CTTGTGATTG 2040 
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GAGTCCTOAC AGCAAGTATT TCTTCACTGG GAGTCGAGAC AAAAAGGTGG TTGTCTGGGG 2100 
TGAGTGCGAC TCCACTGATG ACTGTATTGA GCACAACATT GGCCCCTGCT CCTCAGTCCT 2 1 60 
GGACGTGGGT GGGGCTGTGA CAGCTGTCAG CGTCTGCCCA GTGCTCCACC CTTCTCAACG 2220 
ATACGTGGTT GCAGTAGG AT TGGAGTGTGG AAAGATTTGC TTATATACCT GG AAAAAGAC 2280 
TGATCAAGTT CCAGAAATAA ATGACTGG AC CCACTGTGTA GAAACAAGTC AAAGCCAAAG 2340 
TCATACACTG GCTATCAGAA AATTATGCTG GAAGAATTGC AGTGGAAAAA CTGAACAGAA 2400 
GG AAGCAGAA GGTGCTG AGT GGTTACACTT TGCAAGCTGT GGTGAAGATC ACACTGTG AA 2460 
GATACACAGA GTCAATAAAT GTGCACTG TA AT GG 

Scq ID NO: 34 Protein sequence: 
Protein Accession U: NP_060725. 1 

I 11 21 31 41 51 

I I I I I I 

MVAPVLETSH VFCCPNRVRG VLNWSSGPRG LLAFGTSCSV VLYDPUCRVV VTNLNGHTAR 60 
VNCIQW1CK.Q DGSPSTELVS GGSDNQVIHW ElEDNQLLKA VHLQGHEGPV YAVHAVYQRR 120 
TSDPALCTLI VSAAADSAVR LWSKKGPEVM CLQTLNFGNG FALALCLSFL PNTDVPILAC 1 80 
GNDDCRIMF AQQNDQFQKV LSLCGHEDWI RGVEWAAFGR DLFLASCSQO CURIWKLY1 240 
KSTSLETQDD DNIRLKENTF TIENESVK1A FAVTLETVLA GHENWVNAVH WQPVFYKDGV 300 
LQQPVRLLSA SMDKTMILWA PDEESGVWLE QVRVGEVGGN TLGFYDCQFN EDGSMT1AHA 360 
FHG AtHLWKQ NTVNPREWTP EIVISGHFDG VQDLVWDPEG EFIITVGTDQ TTRLFAPWKR 420 
KDQSQVTWHE IARPQIHGYD LKCLAMINRF QFVSGADEKV LRVFSAPRNF VENFCAITGQ 480 
SLNHVLCNQD SDLPEGATVP ALGLSNKAVF QGDIASQPSD EEELLTSTGF EYQQVAFQPS 540 
ILTEPPTEDH LLQNTLWPEV QKLYG HGYEI FCVTCNSSKT LLASACKAAJC KEHAAHLWN 600 
TTSWKQVQNL VFHSLTVTQM AFSPNEKFLL A VSRDRTWSL WKKQDTISPE FEPVFSLFAF 660 
TNKITSVHSR 1IWSCDWSPD SfCYFFTGSRD KKVWWGECD STDDCIEHNI GPCSSVLDVG 720 
GAVTAVSVCP VLHPSQRYVV AVGLECGKIC LYTWKKTDQV PEINDWTHCV ETSQSQSHTL 780 
AIRKLCWKNC SGKTEQKEAE GAEWLHFASC GEDHTVKIHR VNKCAL 

Seq ID NO: 35 DNA sequence 

Nucleic Acid Accession U: NM_022 1 3 1 

Coding sequence: 1 1 -2878 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I 1 I 

TGCTGCGAGG ATGCTGCCTG G GCGGCTGTG CTGGGTGCCG CTCCTGCTGG CGCTGGGCGT 60 
GGGGAGCGGC AGCGGCGGTG GCGGGG ACAG CCGGCAGCGC CGCCTCCTCG CGGCTAAAGT 1 20 
CAATAAGCAC AAGCCATGG A TCGAGACTTC ATATCATGGA GTCATAACTG AGAACAATGA 1 80 
CACAGTCATT TTGGACCCAC CACTGGTAGC CCTGGATAAA GATGCACCGG TTCCTTTTGC 240 
AGGGGAAATC TGTGCGTTCA AGATCCATGG CCAGGAGCTG CCCTTTGAGG CTGTGGTGCT 300 
CAACAAGACA TCAGGAGAGG GCCGGCTCCG TGCCAAGAGC CCCATTGACT GTGAGTTGCA 360 
GAAGGAGTAC ACATTCATCA TCCAGGCCTA TGACTGTGGT GCTGGGCCCC ACGAGACAGC 420 
CTGGAAAAAG TCACACAAGG CCGTGGTCCA TATACAGGTG AAGGATGTCA ACGAGTTTGC 480 
TCCCACCTTC AAAGAGCCAG CCTACAAGGC TGTTGTGACG GAGGGCA AG A TCTATG ACAG 540 
CATTCTGCAG GTGGAGGCCA TTGACGAGGA CTGCTCCCCA CAGTACAGCC AGATCTGCAA 600 
CTATGAAATC GTCACCACAG ATGTGCCTTT TGCCATCGAC AGAAATGGCA ACATCAGGAA 660 
CACTGAGAAG CTGAGCTATG ACAAACAACA CCAGTATGAG ATCCTGGTGA CCGCCTACGA 720 
CTGTGGACAG AAGCCCGCTG CTCAGGACAC CCTGGTGCAG GTGGATGTGA AGCCAGTTTG 780 
CAAGCCTGGC TGGCAAGACT GG ACCAAGAG GATTGAGTAC CAGCCTGGCT CCGGGAGCAT 840 
GCCCCTGTTC CCCAGCATCC ACCTGGAGAC GTGCGATGGA GCCGTGTCTT CCCTCCAGAT 900 
CGTCACAGAG CTGCAGACTA ATTACATTGG GAAGGGTTGT GACCGGGAGA CCTACTCTGA 960 
GAAATCCCTT CAGAAGTTAT GTGG AGCCTC CTCTGGCATC ATTGACCTCT TGCCATCCCC 1 020 
TAGCGCTGCC ACCAACTGG A CTGCAGGACT GCTGGTGGAC AGCAGTGAGA TG ATCTTCAA 1080 
GTTTGACGGC AGGCAGGGTG CCAAAATCCC CGATGGGATT GTGCCCAAGA ACCTGACCGA 1 140 
TCAGTTCACC ATCACCATGT GGATG AAACA CGGCCCCAGC CCTGGTGTG A GAGCCG AG AA 1 200 
GGAAACCATC CTCTGCAACT CAGACAAAAC CGAAATGAAC CGGCATCACT ATGCCCTGTA 1260 
TGTGCACAAC TGCCGCCTCG TCTTTCTCTT GCGG AAGGAC TTCGACCAGG CTGACACCTT 1320 
TCGCCCCGCG GAGTTCCACT GGAAGCTGGA TCAGATTTGT GACAAAGAGT GGCACTACTA 1 380 
TGTCATCAAT GTGG AGTTTC CTGTGGTAAC CTTATACATG GATGGAGCAA CATATGAACC 1440 
ATACCTGGTG ACCAACGACT GGCCCATTCA TCCATCTCAC ATAGCCATGC AACTCACAGT 1500 
CGGCGCTTGT TGGCAAGG AG GAG AAGTCAC CAAACCACAG TTTGCTCAGT TCTTTCATGG 1 560 
AAGCCTGGCC AGTCTCACCA TCCGCCCTGG CAAAATGGAA AGCCAGAAGG TGATCTCCTG 1620 
CCTGCAGGCC TGCAAGGAAG GGCTGGACAT TAATTCCTTG GAAAGCCTTG GCCAAGGAAT 1680 
AAAGTATCAC TTCAACCCCT CGCAGTCCAT CCTGGTG ATG G AAGGTGACG ACATTGGGA A 1 740 
CATTAACCGT GCTCTCCAGA AAGTCTCCTA CATCAACTCC AGGCAGTTCC CAACGGCGGG 1800 
TGTGCGGCGC CTCAA AGTAT CCTCCAAAGT CCAGTGCTTT GGGGAAGACG TATGCATCAG 1 860 
TATCCCTGAG GTAGATGCCT ATGTGATGGT CCTCCAGGCC ATCGAGCCCC GGATCACCCT 1920 
CCGGGGCACA GACCACTTCT GGAGACCTGC TGCCCAGTTT GAAAGTGCCA GGGOAGTGAC 1980 
CCTCTTCCCT G AT ATCAAG A TTGTGAGCAC CTTCGCCAAA ACCGAAGCCC CCGGGGACGT 2040 
G AAAACCACA GACCCCAA AT CAG A AGTCTT AG AGG AAATG CTTCATAACT TAGATTTCTG 2100 
TGACATTTTG GTGATCGGAG GGGACTTGGA CCCAAGGCAG GAGTGCTTGG AGCTCAACCA 21 60 
CAGTGAGCTC CACCAACGAC ACCTGG ATGC CACTAATTCT ACTGCAGGCT ACTCCATCTA 2220 
CGGTGTGGGC TCCATGAGCC GCTATGAGCA GGTGCTACAT CACATCCGCT ACCGCAACTG 2280 
GCGTCCGGCT TCCCTTGAGG CCCGGCGTTT CCGGATTAAG TGCTCAGAAC TCAATGGGCG 2340 
CTACACTAGC AATGAGTTCA ACTTGGAGGT CAGCATCCTT CATGAAGACC AAGTCTCAGA 2400 
TAAGGAGCAT GTCA ATCATC TGATTGTGCA GCCTCCCTTC CTCCAGTCTG TCCATCATCC 2460 
TGAGTCCCGG AGTAGCATCC AGCACAGTTC AGTGGTCCCA AGCATTGCCA CAGTGGTCAT 2520 
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CATCATCTCC GTGTGCATGC TTGTGTTTGT CGTGGCCATG GGTGTGTACC GGGTCCGGAT 2580 
CGCCCACCAG CACTTCATCC AGGAG ACTG A GGCTGCCAAG G AATCTGAGA TGG ACTGGGA 2640 
CG ATTCTGCG CTG ACTATCA CAGTCAACCC CATGGAGAAA CATGAAGGAC CAGGGCATGG 2700 
GGAAGATGAG ACTGAGGGAG AAGAGGAGGA AGAAGCCGAG GAAGAAATGA GCTCCAGCAG 2760 
TGGCTCTGAC GACAGCGAAG AGGAGGAGGA GGAGGAAGGG ATGGGCAGAG GCAGACATGG 2820 
GCAGAATGG A GCCAGGCAAG CCCAGCTGGA GTGGGATGAC TCCACCCTCC CCTACTAGTG 2880 
CCCAGGGGTC TGCTGCCTGG CCCACATGTC CCTTTTGTAA ACCCTGACCC AGTGTATGCC 2940 
CATGTCTATC ATACCTCACC TCTGATGTCT GTGACATGTC TGGGAAGGCC TTCTCCAGCT 3000 
TCCTGGAGCC CACCCTTTAA GCCTTGGGCA CTCCCTGTGT TTCATCCATG GGGAAGTTCC 3060 
AAG AAGCCCA GCATGGCCAT CAGTGAGG AC TTCAGGGTAG ACTTTGTCCT GTAGCCTCCA 3 1 20 
CTTCTGCCCT AAGTTCCCCA GCATCCTGAC TACCTGTCTG CAGAGTTTGC CTTTGTTTTT 3180 
TCCTGCAGGG AAGAAGGCCC ACCTTTGTGT CACTCACCTC CCCAGGCTCA GAGTCCCCAA 3240 
GGCCCTGGGG TTCCAACTCA CTGTGCGTCT CCTCCACACA GACCAGTAGG TTCTCCTATG 3300 
CTGACTCCAG GTTGCTTCAT ACAAGGAGGG TGGTTGAACT TCACACACGT AAGGTCTTAG 3360 
TGCTTAACAG TTTAAAGGAA AGTCCTTGTT GAGGCAGAAC TAAGTTTACA GGGAAAGGTA 3420 
CACACATTCT CTCTCTCTCT CTCTCTCTGT CTATCTAGTT CCCCAGCTTG GAGAGCCTTT 3480 
CCCCTTGCTT CTTTCTGAGG CCATATAAGC TTATAAGAAA AGTCCCAAAC CAAGAATAGG 3540 
TCCTTGGCCA CAAGC AGGGT CTGATCCCCC ATCAG AGCTA TCTG AGCCTG CCTGTCTGGG 3600 
CACCTGCTGC AACCATGCAG CTACCCTGCC AGGGGCACTC AGCAAACAGA ACCACAGGGC 3660 
CCAGG AGGCA TTCCACACAG GCACTGCCCC AGGACAACAC AACAAGGACA GTCACAACAA 3720 
GGACAACAAG GACACAACAC AACACACAAC AAGGACAGTC ACAACAAGCC TAGAGCCAGA 3780 
AAGCAGATGG AAATGCTAAT GAGGTCAAAC GTAGGCTTCA TGGTGGGTGG AGTGGGGGTG 3840 
GCTGGGCTCC CCCAGGACAG AGGGGACCCT GAGGTTGGCA AGGCTCTCAC CACTCAGCCT 3900 
TATGGTCCCT TATCTCCTAT CTTCCCTCTT GAGAAAATAC ACGCTTTCTG CATGTATTAG 3960 
AAACGCACGA GCTCCACCAA GTCTACAATG AAAGTTTGAA ATTTAACTGC AAGGAATTAG 4020 
AAGCATATTT GCA ATCATTG CAGCTTCTTC TTTCTTCTGC TCATAAAAGG AGGAACACTT 4080 
TAGATAGAGG GCAAATATAT CTGAAAACCT AATTTCTTTC TTTTTTTGAT AAGGAAATCT 4140 
TTTCCATCTC CATCCTAACA TGCACAACCT GTGAAGAG AA TTGTTTCTAT AGTAACTGGT 4200 
CTGTGATCTT TTGTGGCCAA GAGAATAGCA GGCAAGAATT AGGGCCTTGA CAGAATTTCC 4260 
ACGAAGCTCT GAGAACATGT TTGTTTCG A A TGTCTGATTC CTCTTTGTCA TCAATGTGTA 4320 
TGCTCTGTCC CCATCCTTCA CTCCTCCTCA AGCTCACACC AATTGGTTTG GCACAGGCAC 4380 
AGAGCTGGTC CCTAGTTAAG TGGCATTTAT GTTAAAAAAA AATAGTTCAG AATCTCAGCC 4440 
TTTTCTTTGT GTCATCAAAA CAGCTTAAGA AGGGGACTAC TGCCAATGTC CTCTAGTCTG 4500 
ACCTCCACCC AGGGAGGACC CATGGCAGGT CTTTTCAACT TTCTGATTCA TGAGAACAAC 4560 
CTTGTGAAGC TTTTCCCACC TCCTAAAGTG TTTTCTGCAT CTGTTCCTTC CTTTGGACCT 4620 
CACAACAAAT CCTGTGAAGT AACTGAGACA TCTGTTGTTA GATACATTTT TGTGATGAGT 4680 
AAACTGAGGC TTCG 

Seq ID NO: 36 Protein sequence: 
Protein Accession #: NPJ)714 14.1 

1 11 21 31 41 51 

i I I I I I 

MLPGRLCWVP LLLALGVGSG SGGGGDSRQR RLLAAKVNKH KPW1ETSYHG VITENNDTV1 60 
LDPPLVALDK DAPVPFAGEI CAFKIHGQEL PFEAWLNKT SGEGRLRAKS PIDCELQKEY 120 
TFIIQAYDCG AGPHETAWKK SHKAVVHIQV KDVNEFAPTF KEPA YKAVVT EGKJYDSILQ 1 80 
VEAIDEDCSP QYSQICNYEI VTTDVPFAID RNGNIRNTEK IS YDKQHQYE ILVTA YDCGQ 240 
KPAAQDTLVQ VDYKPVCKPG WQDWTKRIEY QPGSGSMPLF PSIHLETCDG AVSSLQrVTE 300 
LQTNYIGKGC DRETYSEKSL QKLCGASSGI IDLLPSPSAA TNWTAGLLVD SSEMIFKFDG 360 
RQGAKIPDGI VPKNLTDQFT 1TMWMKHGPS PGVRAEKETI LCNSDKTEMN RHHYALYVHN 420 
CRLVFLLRKD FDQADTFRPA EFHWKLDQIC DKEWHYYVIN VEFPWTLYM DG ATYEPYLV 480 
TNDWPIHPSH IAMQLTVGAC WQGGEVTKPQ FAQFFHGSLA SLURPGKME SQKVISCLQA 540 
CKEGLDINSL ESLGQGKYH FNPSQSILVM EGDDIGNINR ALQKVSYINS RQFPTAGVRR 600 
LKVSSKVQCF GEDVCISIPE VDA YVMVLQA IEPRITLRGT DHFWRPAAQF ESARGVTLFP 660 
DIKIVSTFAK TEAPGDVKTT DPKSEVLEEM LHNLDFCDIL VIGGDLDPRQ ECLELNHSEL 720 
KQRHLDATNS TAGYSIYGVG SMSRYEQVLH HIRYRNWRPA SLEARRFRIK CSELNGRYTS 780 
NEFNLEVSIL HEDQVSDKEH VNHUVQPPF LQSVHHPESR SSIQHSSWP SIATWIHS 840 
VCMLVFWAM GVYRVRIAHQ HFIQETEAAK ESEMDWDDSA LTTTVNPMEK. HEGPGHGEDE 900 
TEGEEEEEAE EEMSSSSGSD DSEEEEEEEG MGRGRHGQNG ARQAQLEWDD STLPY 

Seq ID NO: 37 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 1 43-874 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

GGGAGGGAGA GAGGCGCGCG GGTGAAAGGC GCATTGATGC AGCCTGCGGC GGCCTCGGAG 60 
CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG 120 
CGCTGCCCGG CAGCCGGGAG CCATGCGACC CCAGG GCCCC GCCGCCTCCC CGCAGCGGCT 180 
CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG CCGTCGAGCG CCTCTGAGAT 240 
CCCCAAGGGG AAGCAAAAGG CGCAGCTCCG GCAGAGGG AG GTGGTGGACC TGTATAATGG 300 
AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAGAC GGGAGCCCTG GGGCCAATGG 360 
CATTCCGGGT ACACCTGGGA TCCCAGGTCG GGATGGATTC AAAGGAGAAA AGGGGGAATG 420 
TCTG AGGGAA AGCTTTGAGG AGTCCTGGAC ACCCAACTAC AAGCAGTGTT CATGGAGTTC 480 
ATTGAATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC 540 
AAATAGTGCT CTAAGAGTTT TGTTCAGTGG CTCACTTCGG CTAAAATGCA GAAATGCATG 600 
CTGTCAGCGT TGGTATTTCA CATTCAATGG AGCTGAATGT TCAGGACCTC TTCCCATTGA 660 
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AGCTATAATT TATTTGGACC AAGG AAGCCC TGA AATGA AT TCAACAATTA ATATTCATCG 720 
CACTTCTTCT GTGGAAGGAC TTTGTGAAGG AATTGGTGCT GGATTAGTGG ATGTTGCTAT 780 
CTGGGTTGGC ACTTGTTCAG ATTACCCAAA AGG AG ATGCT TCTACTGGAT GGAATTCAGT 840 
TTCTCGCATC ATTATTGAAG AACTACCAAA ATAAATGCTT TAATTTTCAT TTGCTACCTC 900 
TTTTTTTATT ATGCCTTGGA ATGGTTCACT TAAATGACAT TTTAAATAAG TTTATGTATA 960 
CATCTGAATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGATT TCACACTGTT 1020 
TTTAAATCTA GCATTATTCA TTTTGCTTCA ATCAAAAGTG GTTTCAATAT TTTTTTTAGT 1080 
TGGTTAGAAT ACTTTCTTCA TAGTCACATT CTCTCAACCT ATAATTTGGA ATATTGTTGT 1 140 
GGTCTTTTGT TTTTTCTCTT AGTATAGCAT TTTTAAAAAA ATATAAAAGC TACCAATCTT 1 200 
TGTACAATTT GTAAATGTTA AGAATTTTTT TTATATCTGT TAAATAAAAA TTATTTCCAA 1260 
CAACCTT AAA AAA AAAAAA A AAAA 

Seq ID NO: 38 Protein sequence: 
Protein Accession U\ none found 

1 11 21 31 41 5t 
111)11 

MRPQGPAASP QRLRG LLLLL LLQLPAPSS A SEIPKGKQKA QLRQREV VDL YNGMCLQGPA 60 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 120 
GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPL PIEAJIYLDQ 1 80 
GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIW VGTCSD YPKGDASTG W NSVSR1IIEE 240 
LPK 

Seq ID NO: 39 DNA sequence 

Nucleic Acid Accession #: NM_000949 

Coding sequence: 285-2 1 53(underlined sequences correspond to start and stop codous) 
1 11 21 31 41 SI 

GGAGGCTGAA ATCCCCAG AC GCCGGTTTTC TGGGCTGGGC TTTCTGCTTA CTCACTCCTT 60 
CTCCCTCTTT CTGG ATTTTA CCGACCGTTC GCG AAACAGC TTTCCACACA ATGG AGCTTC 1 20 
ATGTCCTCGT GCAGGAAGTA CTCATCGACT GATGTGGCAG ACTTT GCTCC CTGACAAAAC 1 80 
TAAAGAACTC TCCTATTCAT GGAGGCGAAC ACTGAGGATG CTTTCCACAT GAACCCTGAA 240 
GTGAACTTCT GATACATTTC CTGCAGCAAG AGAAGGCAGC CAACATGAAG GAAAATGTGG 300 
CATCTGCAAC CGTTTTCACT CTGCTACTTT TTCTCAACAC CTGCCTTCTG AATGGACAGT 360 
TACCTCCTGG AAA ACCTG AG ATCTTTAAAT GTCGTTCTCC CAATAAGGAA ACATTCACCT 420 
GCTGGTGGAG GCCTGGG ACA G ATGG AGG AC TTCCTACCAA TTATTCACTG ACTTACCACA 480 
GGG AAGG AG A G AC ACTC ATG C ATG AATGTC C AG ACTACAT AACCGGTGGC CCCAACTCCT 540 
GCCACTTTGG CAAGCAGTAC ACCTCCATGT GG AGGACATA CATCATGATG GTCAATGCCA 600 
CTAACCAGAT GGGAAGCAGT TTCTCGGATG AACTTTATGT GGACGTGACT TACATAGTTC 660 
AGCCAGACCC TCCTTTGG AG CTGGCTGTGG AAGTAAAACA GCCAGAAG AC AGAAAACCCT 720 
ACCTGTGGAT TAAATGGTCT CCACCTACCC TGATTGACTT AAAAACTGGT TGGTTCACGC 780 
TCCTGTATGA AATTCGATTA AAACCCG AG A AAGCAGCTGA GTGGGAGATC CATTTTGCTG 840 
GGCAGCAAAC AGAGTTTAAG ATTCTCAGCC TACATCCAGG ACAG AAATAC CTTGTCCAGG 900 
TTCGCTGCAA ACCAGACCAT GGATACTGGA GTGCATGGAG TCCAGCGACC TTCATTCAGA 960 
TACCTAGTG A CTTCACCATG AATG ATACA A CCGTGTGG AT CTCTGTGGCT GTCCTTTCTG 1 020 
CTGTCATCTG TTTG ATTATT GTCTGGGCAG TGGCTTTGAA GGGCTATAGC ATGGTGACCT 1 080 
GCATCTTTCC GCCAGTTCCT GGGCCAAA AA TAAAAGG ATT TGATGCTCAT CTGTTGGAGA 1 1 40 
AGGGCAAGTC TGAAGAACTA CTGAGTGCCT TGGGATGCCA AGACTTTCCT CCCACTTCTG 1200 
ACTATGAGG A CTTGCTGGTG GAGTATTTAG AAGTAG ATGA TAGTGAGGAC CAGCATCTAA 1 260 
TGTCAGTCCA TTCA AAAGAA CACCCAAGTC AAGGTATG AA ACCC ACATAC CTGG ATCCTG 1 320 
ACACTGACTC AGGCCGGGGG AGCTGTGACA GCCCTTCCCT TTTGTCTGAA AAGTGTGAGG 1380 
AACCCCAGGC CAATCCCTCC ACATTCTATG ATCCTGAGGT CATTGAGAAG CCAGAGAATC 1440 
CTG AAACAAC CCACACCTGG GACCCCCAGT GCATAAGCAT GGAAGGCAAA ATCCCCTATT 1 500 
TTCATGCTGG TGG ATCCAAA TGTTCAACAT GGCCCTTACC ACAGCCCAGC CAGCACAACC 1 560 
CCAGATCCTC TTACCACAAT ATTACTGATG TGTGTGAGCT GGCTGTGGGC CCTGCAGGTG 1620 
CACCGGCCAC TCTGTTG AAT GAAGCAGGTA AAG ATGCTTT AAAATCCTCT CAAACCATTA 1 680 
AGTCTAGAGA AGAGGGAAAG GCAACCCAGC AGAGGGAGGT AGAAAGCTTC CATTCTGAGA 1740 
CTG ACCAGG A TACGCCCTGG CTGCTGCCCC AGGAGAAAAC CCCCTTTGGC TCCGCTAAAC 1 800 
CCTTGG ATTA TGTGGAG ATT CACAAGGTCA ACAAAGATGG TGCATTATCA TTGCTACCAA 1 860 
AACAGAGAGA GAACAGCGGC AAGCCCAAGA AGCCCGGGAC TCCTGAGAAC AATAAGGAGT 1920 
ATGCCAAGGT GTCCGGGGTC ATGGATAACA ACATCCTGGT GTTGGTGCCA GATCCACATG 1980 
CTAAAAACGT GGCTTGCTTT GAAG AATCAG CCAAAGAGGC CCCACCATCA CTTGAACAGA 2040 
ATCAAGCTGA GAAAGCCCTG GCCAACTTCA CTGCAACATC AAGCAAGTGC AGGCTCCAGC 2100 
TGGGTGGTTT GG ATTACCTG G ATCCCGC AT GTTTTACACA CTCCTTTCAC TGATAGCTTG 2 1 60 
ACTAATGG AA TG ATTGGTTA AAATGTGATT TTTCTTCAGG TAACACTACA G AGTACGTGA 2220 
AATG CTC AAG AATGTAGTCA GACTG ACACT ACTAAAGCTC CCAGCTCCTT TCATGCTCCA 2280 
TTTTTAACCA CTTGCCTCTT TCTCC AGCAG CTGATTCCAG AACAAATCAT TATGTTTCCT 2340 
AACTGTG ATT TGTAGATTTA CTTTTTGCTG TTAGTTATAA AACTATGTGT TCAATGAAAT 2400 
AAAAGCACAC TGCTTAGTAT TCTTGAGGGA CAATGCCAAT AGGTATATCC TCTGGAAAAG 2460 
GCTTTCATGA TTTGGCATGG GACAG ACGGA AATGAAATTG TCAAAATTGT TTACCATAGA 2520 
AAG ATG AC A A AAG AAAATTT TCCACATAGG AAAATGCCAT GAAAATTGCT TTTGAAAAAC 2580 
AACTGCATAA CCTTTACACT CCTCGTCCAT TTTATTAGGA TTACCCAAAT ATAACCATTT 2640 
AAAGAA AG AA TGCATTCCAG AACAAATTGT TTACATAAGT TCCTATACCT TACTGACACA 2700 
TTGCTGATAT GCAAGTAAGA AAT 



Seq ID NO: 40 Protein sequence: 
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Protein Accession tt: NP 000940. 1 

1 11 21 31 41 51 

5 MKENVASATV FTLLLFLNTC LLNGQLPPGK PEIFKCRSPN KETFTCWWRP GTDGGLPTNY 60 

SLTYHREGET LMHECPDYIT GGPNSCHFGK QYTSMWRTYI MMVNATNQMG SSFSDELYVD 120 
VTYIVQPDPP LELAVEVKQP EDRKPYLWIK WSPPTUDUC TGWFTLLYEI RLKPEKAAEW 180 
EIHFAGQQTE FKILSLHPGQ KYLVQVRCKP DHGYWSAWSP ATFIQIPSDF TMNDTTVW1S 240 
VAVLSAVICL HVWAVALKG YSMVTCIFPP VPGPKIKGFD AHLLEKGK.SE ELLSALGCQD 300 

1 0 FPPTSDYEDL LVEYLEVDDS EDQHLMSVHS KEHPSQGMKP TYLDPDTDSG RGSCDSPSLL 360 
SEK.CEEPQAN PSTFYDPE VI EKPENPETTH TWDPQCISME GK1PYFHAGG SKCSTWPLPQ 420 
PSQHNPRSSY HNITDVCELA VGPAGAPATL LNEAGKDALK SSQTDCSREE GKATQQREVE 480 
SFHSETDQDT PWLLPQEKTP FGSAKPLDYV EIHKVNKDGA LSLLPKQREN SGKPKKPGTP 540 
ENNKEYAKVS G VMDNNILVL VPDPHAKNVA CFEESAKEAP PSLEQNQAEK ALANFTATSS 600 

1 5 KCRLQLGGLD YLDPACFTHS FH 

Seq ID NO: 41 DNA sequence 

Nucleic Acid Accession #: none found, Eos cloned sequence 

Coding sequence: 1 -1 572 (underlined sequences correspond to start and stop codons) 

20 

1 U 21 31 41 51 

ATGACCCAAA ATAAATTAAA GCTTTGTTCC AAAGCCAATG TGTATACTGA AGTGCCTGAT 60 
GGAGG ATGGG GCTGGGCGGT AGCTGTTTCA TTTTTCTTCG TTGAAGTCTT CACCTACGGC 1 20 
25 ATCATCAAGA CATTTGGTGT CTTCTTTAAT GACTTAATGG ACAGTTTTAA TGAATCCAAT 180 
AGCAGGATCT CATGGATAAT CTCAATCTGT GTGTTTGTCT TAACATTTTC AGCTCCCCTC 240 
GCCACAGTCC TGAGCAATCG TTTCGGACAC CGTCTGGTAG TGATGTTGGG GGGGCTACTT 300 
GTCAGCACCG GGATGGTGGC CGCCTCCTTC TCACAAGAGG TTTCTCATAT GTACGTCGCC 360 
ATCGGCATCA TCTCTGGTCT GGGATACTGC TTTAGTTTTC TCCCAACTGT AACCATCCTA 420 
3 0 TCACAATATT TTGGCAAAAG ACGTTCCATA GTCACTGCAG TTGCTTCCAC AGGAGAATGT 480 
TTCGCTGTGT TTGCTTTCGC ACCAGCAATC ATGGCTCTG A AGGAGCGCAT TGGCTGG AGA 540 
TACAGCCTCC TCTTCGTGGG CCTACTACAG TTAAACATTG TCATCTTCGG AGCACTGCTC 600 
AGACCCATCT TTATCAGAGG ACCAGCGTCA CCGAAAATAG TCATCCAGGA AAATCGGAAA 660 
GAAGCGCAGT ATATGCTTG A AAATGAGAAA ACACGAACCT CAATAGACTC CATTGACTCA 720 
35 GG AGTAG AAC TAACTACCTC ACCTA AAAAT GTGCCTACTC ACACTAACCT GG AACTGG AG 780 
CCGAAGGCCG ACATGCAGCA GGTCCTGGTG AAGACCAGCC CCAGGCCAAG CGAAAAGAAA 840 
GCCCCGCTAT TAG ACTTCTC CATTTTGAAA GAGAAAAGTT TTATTTGTTA TGCATTATTT 900 
GGTCTCTTTG CAACACTGGG ATTCTTTGCA CCTTCCTTGT ACATCATTCC TCTGGGCATT 960 
AGTCTGGGCA TTGACCAGGA CCGCGCTGCT TTTTTATTAT CTACGATGGC CATTGCAGAA 1020 
40 GTTTTCGGAA GGATCGGAGC TGGTTTTGTC CTCAACAGGG AGCCCATTCG TAAGATTTAC 1080 
ATTG AGCTCA TCTGCGTCAT CTTATTGACT GTGTCTCTGT TTGCCTTTAC TTTTGCTACT 1140 
GAATTCTGGG GTCTAATGTC ATGCAGCATA TTTTTTGGGT TTATGGTTGG AACAATAGGA 1200 
GGGACTCACA TTCCACTGCT TGCTGAGGAT GATGTCGTGG GCATTGAGAA GATGTCTTCT 1260 
GCAGCTGGGG TCTACATCTT CATTCAGAGC ATAGCAGGAC TGGCTGGACC GCCCCTTGCA 1320 
45 GGTTTGTTGG TGGACCAAAG TAAGATCTAC AGCAGGGCCT TCTACTCCTG CGCAGCTGGC 1380 
ATGGCCCTGG CTGCTGTGTG CCTCGCCCTG GTGAGACCGT GTAAGATGGG ACTGTGCCAG 1440 
CATCATCACT CAGGTGAAAC AAAGGTAGTG AGCCATCGTG GGAAGACTTT ACAGGACATA 1500 
CCTG AAGACT TTCTGG AAAT GGATCTTGCA AAAAATG AGC ACAG AGTTCA CGTGCAAATG 1 560 
GAGCCGGTATGA 



50 



Seq ID NO: 42 Protein sequence: 

Protein Accession U: none found, Eos doncd sequence 



1 11 21 31 41 51 

55 i i i j i I 

MTQNKLKLCS KANVYTEVPD GGWGWAVAVS FFFVEVFTYG IIKTFGVFFN DLMDSFNESN 60 
SRISWIISIC VFVLTFSAPL ATVLSNRFGH RLVVMLGGLL VSTGMVAASF SQEVSHMYVA 120 
IGIISGLG YC FSFLPTVTIL SQYFGKRRSI VTA VASTGEC FAVFAFAPAI MALKERIGWR 1 80 
YSLLFVGLLQ LNTVIFG ALL RPIFIRGPAS PKIVIQENRK EAQYMLENFJC TRTSIDSIDS 240 
60 GVELTTSPKN VPTHTNLELE PKADMQQVLV KTSPRPSEKK APLLDFSILK EKSFICYALF 300 
GLFATLGFFA PSLYIIPLGI SLGIDQDRAA FLLSTMAIAE VFGRIGAGFV LNREPIRKIY 360 
IEUCVILLT VSLFAFTFAT EFWGLMSCSI FFGFMVGTIG GTHIPLLAED DWGIEKMSS 420 
AAGVYIFIQS IAGLAGPPLA GLLVDQSKIY SRAFYSCAAG MALAAVCLAL VRPCKMGLCQ 480 
HHHSGETKW SHRGKTLQDI PEDFLEMDLA KNEHRVHVQM EPV 
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Scq ID NO: 43 DNA s 

Nucleic Acid Accession #: FGENESH predicted ORF 

Coding sequence: 1-1749 (underlined sequences correspond to start and stop codons) 

70 1 11 21 31 41 51 

ATGCTGTCTG G CTTCTTG AT G AGTCCCAGT ACCCAGCACA GAGCACAGTA CACTCCCGGA 60 
GGAAAGAAAC TTCCGTGGGA GGCTTCCATC GGTGCGCACA CCTCCCGAGG GCGAGGCAGC 120 
GACCGGGAGA GGGAGAGCCG GCCGGAGGCT GCCGGGCTCC TGTGGGACCG CGCTGCAGCC 180 
75 GGGGAGGCGG AG AAGGGGAA CCGGGGCGAG CCGCCCGCCT GGATCCGCGC CCAGCAGCAG 240 
CCGCGGCCGC CGCCAGCTGG GCAGGCTCCC GGG ACTGCGG CTGGGGGCGC GCAGG ACCCT 300 
CGCCTGCGTC CTGGACGTTC CCGGGGGAGG GTCCGGTTGC CAGTGAAACC TCCAGAGGCT 360 
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TCCGGACGAC AGCCCCGGGG GCCTTCTGAC TGCATCCCGA GATTTCCATC AGCGAGTGCA 420 
ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGGACGGGGA TGGCTTAGGA 480 
GCTCCTGGAC CTAGGGCCCG GCGTCGTCGC CTCCTGGGCG TCGCGGCAOA GGGGAGTGGC 540 
CCGCGCGGAA AGCGCCGCGG G ACAGTCAGT GACGAGGCCC CGGGGTCGCC GGGGCCACGA 600 
CTTCTCGGAG ACCGTCCTGC GCTCTCTGGA GACGCGCTGT CCGCGCCCAG GGTGGTGCCA 660 
TGTGGGGCGC TCGCCGCTCG TCCGTCTCCT CATCCTGGAA CGCCGCTTCG CTCCTGCAGC 720 
TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGAGGGCCCA GCGGCGAGTA CTGCCACGGC 780 
TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 840 
GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 900 
GCGCGCCTGG ACCAGGGCGG CTGCG ACAAT GACCGCCAGC AGGGCGCTGG CG AGCCTGGC 960 
CGGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAG GGOTACCCAA 1020 
GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC AGCGGTGCTC CCCTGAAGGC 1080 
TCCCCGAAAG GAAGGCAGCT CCTCAGGGCT TT(XCGGGGC TGCTGCCCCG TGCCAGACGC 1 140 
CGCGGATTCC CATCTTCTCC ACGCGGCGGC CCCTCTCCCC TGCAGCGGCC CGCCTTGCCC 1200 
ATCTACGTGC CGTTCCTCAT TGTTGGCTCC GTGTTTGTCG CCTTTATCAT CTTGGGGTCC 1260 
CTGGTGGCAG CCTGTTGCTG CAGATGTCTC CGGCCTAAGC AGGATCCCCA GCAGAGCCGA 1320 
GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCCCATGA TCCCCAGTGC CAGCACCTCC 1380 
CGGGGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CGCCAACTCC 1440 
GGGGCCCGGG CGCCCCCAAC AAGGTCACAG ACCAACTGTT GCTTGCCGGA AGGGACCATG 1500 
AACAACGTGT ATGTCAAC AT GCCCACG AAT TTCTCTGTGC TG AACTGTCA GCAGGCCACC 1 560 
CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 1620 
CACGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 1680 
AGGCAGATTC AGTCCCCCTT CCCTCACACG AACAGTGAAC AGAAGATGTA CCCAGCGGTG 1740 
ACTGTATAAC CGAGAGTCAC TGGTGGGTTC CTTTACTGAA GGGAGACGAA GGCAGGGGTG 1800 
GATTCTCGAG GTGGAAGTCC GCACATGTCG GTGGTATTTA TGGCACGATT CCTTTGGATG 1860 
GCTTCATTTG CCCCCAG ACT GTATG AAAAC ATCTCCGAAT TAGCATTTCT GG ATATGTTT 1 920 
CATCCAGGGT ATCATTGATT TATGATGGAA AACCGGCCTC AGCTGGAGAT GACTGTGATG 1980 
TTGCTGATGG GTGTATAACA AATGCTTGAG TCCGAAGTGC CCTTGAGATA TGGTTGACGA 2040 
AAGAATTTTA TAAACTGATA AATTAAGGAT TTTTATTATG TTGTTATTAT TATTTCTTTT 2100 
TTGTTGTTGA CTGCACAGGA TCAAAATGCC TGTTATCTCC CTTTTACTGG GACTTTTTTT 2160 
TTTTTTTTTT TTI 1 li 1 1 AA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGGAGTGCAG 2220 
TGGTGCGATC TCGGCTCACT GCAACTTCAG CCTCCTGG AT TCAGGCAACA CTCCTGCCTC 2280 
AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 
TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 
TCTGCCTGTC TCAGCCTCCC AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 
TGAGCCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 
ATTCTAAAAG GAAACCTGTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA GTATTCCACA 2580 
GGCACACCTT AATTTCATTG TAAAAAG ATA TATATATTTT GTCTATTTTT GTGCTTTTGG 2640 
GGGCCTATTT TGTGCTTTTT TACCTTATGT AGAGATCTTA TTACAAAGTG ATTTTCTACA 2700 
TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 
TGGGATGATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2820 
TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2880 . 
TATATTGAAA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 
TGGTTATGGT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 
GAGTGCAAGT AATTACTATA CTTGTAAATG AAGATCAGTA TTTCTGCCTA GATCTGATAA 3060 
AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAGAAATG TGTTACAAAG ATACTTAGTA 3120 
TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3180 
GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 
GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 
ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 
CAAATTCAAG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 
ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATCGCT 3480 
GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCGATCAT CTTTCTATTC 3540 
TGTTTCATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATGATACTCA 3600 
TTTTTTAATA ACCAAAGGCA GGGGAAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

Scq ID NO: 44 Protein sequence: 

Protein Accession 0: FGENESH predicted 

1 11 21 31 41 51 
] I I i 1 I 

MLSGFLMSPS TQHRAQYTPG GKKLPWEASI GAHTSRGRGS DRERESRPEA AGLLWDRAAA 60 
GEAEKGNRGE PPAW1RAQQQ PRPPPAGQAP GTAAGG AQDP RLRPGRSRGR VRLPVKPPEA 120 
SGRQPRGPSD CIPRFPSASA THKAVPKGTG PPAEDGDGLG APGPRARRRR LLG VAAEGSG 1 80 
PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRVVP CGALAARPSP HPGTPLRSCS 240 
CCWLRCWRRG RGPSGEYCHG WLDAQG VWRI GFQCPERFIX3 GDATICCGSC ALRYCCSSAE 300 
ARLDQGGCDN DRQQGAGEPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 360 
SPKGRQLLRA FPGIXPRARR RGFPSSPRGG PSPLQRPALP IYVPFLIVGS VFVAF1ILGS 420 
LVAACCCRCL RPKQDPQQSR APGGNRLMET IPM1PSASTS RGSSSRQSST AASSSSSANS 480 
GARAPPTRSQ TNCCLPEGTM NNVYVNMPTK FSVLNCQQAT QIVPHQGQYL HPPYVGYTVQ 540 
HDSVPMTAVP PFMDGLQPGY RQIQSPFPHT NSEQKMYPAV TV 

Scq ID NO: 45 DNA sequence 

Nucleic Acid Accession #: NM_002285 

Coding sequence: 55-3738 (underlined sequences correspond to start and stop codons) 
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1 11 21 31 41 SI 
I I I I 1 I 

G $™ AGCCT CGGCGGGG GC GGTAGCGGCG GCGGCGACGC TGACACCTCC CACCATGGAC 60 
C ^ GC JI C 1 GAC J TAGCCCTGCT CCAGGAATGG GACCTCGAGT CACTGTGTGT CTATGAACCA 120 

^t^F- AA ^J G CATTACGGAG GAAAGAACGA GAAAGAAGAA ATCAAGAAAC TCAACAGGAT 180 
^ GC ££I JTAATTCTAG TTACTCTCTC TTCAGTGAGC CCTACAAGAC TAACAAGGGG 240 
S^iiSTP CCAACCGGAT CCAG AACACT TTAGGCAATT ATG ATGAAAT GAAAGACTTT 300 
TTAACTGATA G AACCAATCA GAGTCATCTC GTTGGAGTTC CCAAACCTGG GGTTCCTCAG 360 
1 0 ^^?^^ C ^ GA i CGA TGAACATTTT CTTGCAGATT CAAGAGCCCA GAACCAGCCC 420 
IU TCGTCTATCT GTAGCACTAC AACTTCCACA CCAGCAGCTG TCCCCGTGCA GCAGAGTAAA 480 
AGAGGCACTA TGGGCTGGCA G AAGGCTGGG CACCCACCCT CTG ACGGCCA ACAGAG AGCA 540 
ACACAACAGG GCTCTCTCAG GACCTTGCTT GGAGATGGTG TTGGCAGACA GCAGCCTCGG 600 
C^^r T G ^I? T GGAGG ™ GGG CTTCAGACCC AGGAGAGGCC ACctSTg TsO 
GCGGCCAAGC ACAGCAGCAG CGGACACTGT GTTCAGAACT TTCCTCCATC CCTAGCTTCA 720 
^%5^ GCC TGGTCCAG CA G AAACCGACC GCGTATGTG A GGCCAATGGA CGGCCAAGAT 780 
CAGGCCCCTG ATGAGTCTCC TAAGCTGAAG TCGTCTTCGG AAACCAGCGT GCACTGCACA 840 
TCATACAGGG GAGTCCCTGC CAGCAAGCCG GAGCCTGCCA GAGCCAAGGC CAAGCTCTCC 900 
^S^S^a 1^??™ G ° GGGAGGAG AQTAGATCTG GAGAAACCAA CAGCTGTGTT 960 
20 AAArTrrAAr Ja^SS ^ GACCTGGCTT CCACCACTTT CTGCTATTCA AGCACCTGGC 1020 
U AAAGTGGAAC CAACCAAATT TCCATTTCCA AATAAGGACT CTCAGCTTGT ATCCTCTGGA 1080 
CACAATAATC CAAAGAAAGG TGATGCAGAG CCAGAGAGTC CAGACAATGG CACATCGAAT 1 140 
AGA ^ CAATGC TGGAAGATGA CCTTAAGCTA AGCAGTGATG AAGAGGAGAA TGAACAGCAG 1200 
GGAGCTGAGA GAACGGCTCT CCGCGCTCTC TCTGACAGCG CCGTGGTCCA GCAGCCCAAC 1260 
?<> l GC rt^F CGGTGCCTTC CAGCAAGGGC AGCAGCAGCA GCAGCAGCAG CGGCACGAGC 1320 
AGCTCCTCCA GCGACTCAGA GAGCAGCTCC GGATCTGACTCGGAGACCGA GAGCAGCTCC 1380 
AGCGAGAGTG AGGGCAGCAA GCCCCCCCAC TTCTCCAGCC CCG AGGCTG A ACCGGCATCC 1440 
J C J™ AAGT GGCAGCTGGA TAAATGGCTA AACAAAGTTA ATCCCCACAA GCCTCCTATT 1500 
CTGATCCAAA ATGAAAGCCACGGGTCAGAG AGCAATCAGT ACTACAACCC GGTGAAAGAG 1560 
30 ATTAATArrA r^rrrr a a"^^ AGTCCCCGAC GTTTGCCAGC CCAGCCTGAG AGAGAAGGAG 1620 
A I^™ A CT^AAGGA GGAGCAAAGG CCAAGGACAG CCAACAAGGC CCCTGGGAGT 1680 
AAA ^ CG ™ A AGCAGAAGTC CCCGCCCGCG GCCGTGGCCG TGGCGGTGAG CGCAGCCGCC 1740 
CCGCCACCCG CAGTGCCCTG TGCGCCCGCG GAGAACGCGC CCGCGCCTGC CCGGAGGTCC 1800 
GCGGGCAAGA AGCCCACCAG GCGCACCGAG AGGACCTCAG CCGGGGACGG CGCCAACTGC 1860 
-X S S A ™S C ™ AGGAGCCCGC GGCCGCGGAC GCGCTGGGGA CGAGCGTGGT GGTCCCCCCG 1920 
JD 2^^^ t ttJf CAGGCC CTGTGGCAAC AACAGAGCGA GCCACCGCAA GGAGCTGCGC 1980 
TCCTCCGTGA CCTGCGAGAA GCGCCGCACG CGGGGGCTAA GCAGGATCGT CCCCAAATCC 2040 
^r^r^Irr 1^^^ G !S GTCATCT TCATCCTCCT CCTCGGACTC CGACCTGGAG 2100 
G ° AGGAGTACCC TCTGTCCAAA GCACAGACCG TGGCTGCCTC TGCCTCCTCC 2160 
AC\ ?2 ^™ TC AGAGGCTGAA GGAGGCCGCT GCCAACGGGG GCAGTGGTCC TAGGGCCCCT 2220 
4U GTAGGCTCCA TCAACGCCAG GACCACCAGT GACATCGCCA AGGAGCTGGA GGAGCAGTTC 2280 
TACACACTGG TCCCCTTTGG CCGGAACGAA CTTCTCTCCC CTCTA AAGG A CAGTGATGAG 2340 
ATCAGGTCTC TCTGGGTCAA AATCG ACCTG ACCCTCCTGT CCAGGATCCC AGAACACCTG 2400 
^S AG rP^S A 5 GGG1ATT GAGCGCCCCT GCCACCAAGG ACTCTGAGAG CGCACCGCCC 2460 
AS AG ^ CACCT CGGACAG ACC TGCAGAAAAG GCTTTGCCAA AATCCAAGAG GAAACGCAAG 2520 
I5T? AGAAG 5 AAGACG ACTA CAGGGAG ATC AAGAAGTCCC AGGGAGAGAA AG ACAGCTCT 2580 
] CAAG J£l GG CCACCTCCAC CAGTAATACT TTGTCTGCAA ACCACTGCAA CATGAACATC 2640 
AACAGTGTGG CAATACCAAT AAATAAAAAT GAAAAAATGC TTCGGTCGCC CATCTCACCC 2700 
^?£J GATG GATCTAAACA CAAATACACC AGCG AGGACT TAACTTCTTC CAGCCG ACCT 2760 
SO ^nn?n A ^ ?™J TTAC '"•GAGCCTCT TCCAGCAAAA AGCCTAAGGC CGACAGCCAG 2820 
^^f C TS AC S GG 5 GAGA CCTCACGAAA GCAGCTCACA ACAATTCTGA AAACATTCCC 2880 
™ GAG ^ G I ACGGCCGCA GACGAAGCCG TGGTCTCCAG GCTCCAACGG CCACAGGGAC 2940 
TGCAAGAGGC AGAAACTTGT CTTCGATGAT ATGCCTCGCA GTGCCGATTA TTTTATGCAA 3000 
GAA ^ C J A ^ AC GAATGAAGCA TAAAGCAGAT GCAATGGTGG AAAAGTTTGG AAAGGCTTTG 3060 
S 5 ^5I A T GC I G AAGCAGCATT GTCGTTTATC GAGTGTGG AA ATGCAATGGA ACAAGGCCCC 3 120 
AATCTCCTTA TTACCTG ATG TATTCAG AAA CAGTAGAGCT CATCAGGTAT 3180 
5ST A J GAGAC TAAAAACCCA CTCAGGCCCC AATGCCACAC CAGAAG ACAA ACAACTGGCT 3240 
GCA ™: TGTT ACCGATGCCT GGCCCTCCTG TACTGGCGGA TGTTTCGACT CAAAAGGGAC 3300 
CACGCTGTAA AGTATTCAAA AGCACTAATC GACTATTTCA AGAACTCATC TAAAGCCGCC 3360 
60 ^ GC ^ A JS C ^ G Z5 GGG GCGGAGTGGA AAGAGCACTG GAACCCCATC CCCCATTTCT 3420 
OU CCCAACCCCT TTCCCGGCAG CTCCGTGGGG TCTCAGGGCA GCCTCTCCAA CGCCAGCGCC 3480 
CTGTCCCCGT CGACCATCGT CAGCATCCCA CAGCGCATCC ACCAG ATGGC GGCCAACCAC 3540 
551 a^Ya ^^^^ G ^Z OTGCACAGC TACGACTACT GGGAG ATGGC CGACAACCTG 3600 
GCCAAGGAAA ACCGAGAATT CTTCAACGAC CTGGATCTGC TCATGGGGCC GGTCACCCTG 3660 
CACAGCAGCA TGGAGCACCT GGTCCAGTAC TCCCAACAGG GCCTGCACTG GCTGCGGAAC 3720 
65 AG ™ C ^ CC ™J CAmLGG ACCTCACCCT GGGGCCAGAG TGGGCTCTGG ScAGA mo 

Saa^t^ 



SeqIDNO: 46 Protein sequence: 
/U Protein Accession #: NPJW2276 

1 11 21 31 41 
I I I J 



MDSFDLALLQ EWDLESLCVY EPDRNALRKK ERERRNQETQ QDDGTFNSSY SLFSEPYKTN 60 
75 KGDELSNRJQ NTLGNYDEMK DFLTDRTNQS HLVGVP^GV PQTPVNKIDE HFVADSRAON U0 

5LVQQ] 
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QDQAPDESPK UCSSSETSVH CTSYRGVPAS KPEPARAKAK LSKFSIPKQG EESRSGETNS 300 
CVEEIIREMT WLPPLSA1QA PGKVEPTKFP FPNKDSQLVS SGHNNPKKGD AEPESPDNGT 360 
SNTSMLEDDL KLSSDEEENE QQAAQRTALR ALSDSAWQQ PNCRTSVPSS KGSSSSSSSG 420 
TSSSSSDSES SSGSDSETES SSSESEGSKP PHFSSPEAEP ASSNKWQLDK WLNKVNPHKP 480 
PILIQNESHG SESNQYYNPV KEDVQDCGKV PDVCQPSLRE KEHCSTCKEE QRPRTANKAP 540 
GSKG VKQKSP PA AVAVAVSA AAPPPAVPCA PAENAPAPAR RSAGKKPTRR TERTSAGDGA 600 
NCHRPEEPAA ADALGTSVW PPEPTKTRPC GNNRASHRKE LRSSVTCEKR RTRGLSRIVP 660 
KSKEFIETES SSSSSSSDSD LESEQEEYFL SKAQTVAASA SSGNDQRLKE AAANGGSGPR 720 
AP VGSINART TSDIAKELEE QFYTLVPFGR NELLSPLKDS DE1RSLWVKI DLTLLSRIPE 780 
HLPQEPGVLS APATKDSESA PPSHTSDTPA EKALPKSKRK RKCDNEDDYR EJKKSQGEKD 840 
SSSRLATSTS NTLSANHCNM NINSVAIPIN KNEKMLRSPI SPLSDASKHK YTSEDLTSSS 900 
RPNGNSLFTS ASSSKKPKAD SQLQPHGGDL TKAAHNNSEN IPLHKSRPQT KPWSPGSNGH 960 
RDCKRQKLVF DDMPRSADYF MQEAKRMKHK ADAMVEKFGK ALNYAEAALS FIECGNAMEQ 1020 
GPMESKSPYY LMYSETVELI RYAMRUCTHS GPNATPEDKQ LAALCYRCLA LLYWRMFRLK 1080 
RDHAVKYSKA LIDYFKNSSK. AAQAPSPWGA SGKSTGTPSP ISPNPFPGSS VGSQGSLSNA 1140 
SALSPSTIVS IPQRIHQMAA NHVSITNSIL HSYDYWEM AD NLAKENREFF NDLDLLMGPV 1200 
TLHSSMEHLV QYSQQGLHWL RNSAHLS 

Scq ID NO: 47 DNA sequence 

Nucleic Acid Accession ff: NM_033 15 1 

Coding sequence: 35 1 -4499(underlined sequences correspond to start and stop codons) 



I 11 21 31 41 51 

i i I ! I I 

ACTGGG ATAA AGCAAGAAGA CTG ATTTTAT GAGCAGGGGT TTGATACATC AAAGG AGATT 60 
GCCCAGGATC AAGGGTGCGG TGTTGGGGGT GGGTTGGGGA GGGTGGTTAG AGAAGGTTTC 120 
ACTAAGTGAT TTGGGCCTG A GGCCTGAGAA GATGTTTAAA AAGAGGGATC AAGCACAGGC 1 80 
TAAGGAGAGG AAAGAGCAGG CACCCAAACC TCTGCATGGC CCCAATATGC TCCCTGCAGG 240 
GTAGTGCCCC CTCTTCTGGC TGCTCAAGGC GAGATCTAAG CTTCTTCTAA CTCCTGCTGT 300 
CTTTTC ATAT TCTCTG ATTC TGGG AAACGA AGAATTGGCA GGAACTGAAA ATGACTAGGA 360 
AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCGTGGC ATCGACATAG 420 
GCGATGACAT GGTTTCAGG A CTTATTTATA AAACCTATAC TCTCCAAG AT GGCCCCTGGA 480 
GTCAGCAAGA GAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCG TGGGGGAAGT 540 
ATGATGCTGC CTTG AGAACC ATGATTCCCT TCCGTCCCAA GCCGAGGTTT CCTGCCCCCC 600 
AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC ACCCCGCTCA 660 
TGATCCAAAG CTTACGG AGT CGCTTAGATG AG AACACCAT CCCTCCACTG TCAGTCCATG 720 
ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GGAAGAAGAA GTCTCAAGGC 780 
GAGGGATTGA AAAAGCTTCA GTGCTTCTGG TGATGCTGAG GTTCCAGAGA ACAAGGTTGA 840 
TTTTCG ATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG CCAATATTGA 900 
TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGG GAATGTTGTC CATGGAGTGG 960 
GACTCTGCTT TGCCCITI I I' CTCTCCG AAT GTGTGAAGTC TCTGAGTTTC TCCTCCAGTT 1020 
GGATCATCAA CCAACGCACA GCCATCAGGT TCCG AGCAGC TGTTTCCTCC TTTGCCTTTG 1080 
AGAAGCTCAT CCAATTTAAG TCTGTAATAC ACATCACCTC AGGAGAGGCC ATCAGCTTCT 1 140 
TCACCGGTGA TGTAAACTAC CTGTTTGAAG GGGTGTGCTA TGGACCCCTA GTACTGATCA 1200 
CCTGCGCATC GCTGGTCATC TGCAGCATTT CTTCCTACTT CATTATTGG A TACACTGCAT 1 260 
TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG ACAAGAATGG 1320 
CTGTG AAGGC TCAGCATCAC ACATCTGAGG TCAGCGACCA GCGCATCCGT GTGACCAGTG 1 380 
AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATG GGAGAAACCA TTTGCAAAAA 1440 
TCATTGAAGA CCTAAGAAGG AAGGAAAGGA AACTATTGG A GAAGTGCGGG CTTGTCCAGA 1500 
GCCTGACAAG TATAACCTTG TTCATCATCC CCACAGTGGC CACAGCGGTC TGGGTTCTCA 1560 
TCCACACATC CTTAAAGCTG AAACTCACAG CGTCAATGGC CTTCAGCATG CTGGCCTCCT 1620 
TGAATCTCCT TCGGCTGTCA GTGTTCTTTG TGCCTATTGC AGTCAAAGGT CT CACG AATT 1680 
CCAAGTCTGC AGTGATGAGG TTCAAGAAGT TTTTCCTCCA GGAGAGCCCT GTTTTCTATG 1 740 
TCCAGACATT ACAAGACCCC AGCAAAGCTC TGGTCTTTGA GGAGGCCACC TTGTCATGGC 1800 
AACAGACCTG TCCCGGGATC GTCAATGGGG CACTGGAGCT GGAGAGGAAC GGGCATGCTT 1860 
CTG AGGGGAT GACCAGGCCT AGAGATGCCC TCGGGCCAGA GG AAGAAGGG AACAGCCTGG 1920 
GCCCAGAGTT GCACAAGATC AACCTGGTGG TGTCCAAGGG GATGATGTTA GGGGTCTGCG 1 980 
GCAACACGGG GAGTGGTAAG AGCAGCCTGT TGTCAGCCAT CCTGGAGGAG ATGCACTTGC 2040 
TCGAGGGCTC GGTGGGGGTG CAGGGAAGCC TGGCCTATGT CCCCCAGCAG GCCTGGATCG 2100 
TCAGCGGGAA CATCAGGGAG AACATCCTCA TGGGAGGCGC ATATGACAAG GCCCGATACC 2160 
TCCAGGTGCT CCACTGCTGC TCCCTGAATC GGGACCTGGA ACTTCTGCCC TTTGGAGACA 2220 
TGACAGAGAT TGGAGAGCGG GGCCTCAACC TCTCTGGGGG GCAGAAACAG AGGATCAGCC 2280 
TGGCCCGCGC CGTCTATTCC GACCGTCAGA TCTACCTGCT GGACGACCCC CTGTCTGCTG 2340 
TGGACGCCCA CGTGGGGAAG CACATTTTTG AGGAGTGCAT TAAGAAGAC A CTCAGGGGGA 2400 
AGACGGTCGT CCTGGTG ACC CACCAGCTGC AGTACTTAGA ATTTTGTGGC CAGATCATTT 2460 
TGTTGGAAAA TGGG AAA ATC TGTGAAAATG GAACTCACAG TGAGTTAATG CAGAAAAAGG 2520 
GGAAATATGC CCAACTTATC CAG AAGATGC ACAAGGAAGC CACTTCGGAC ATGTTGCAGG 2580 
ACACAGCAAA GATAGCAGAG AAGCCAAAGG TAG AAAGTCA GGCTCTGGCC ACCTCCCTGG 2640 
AAGAGTCTCT CAACGGAAAT GCTGTGCCGG AGCATCAGCT CACACAGGAG GAGGAGATGG 2700 
AAGAAGGCTC CTTGAGTTGG AGGGTCTACC ACCACTACAT CCAGGCAGCT GGAGGTTACA 2760 
TGGTCTCTTG CATAATTTTC TTCTTCGTGG TGCTGATCGT CTTCTTAACG ATCTTCAGCT 2820 
TCTGGTGGCT GAGCTACTGG TTGGAGCAGG GCTCGGGGAC CAATAGCAGC CGAGAGAGCA 2880 
ATGGAACCAT GGCAGACCTG GGCAACATTG CAGACAATCC TCAACTGTCC TTCTACCAGC 2940 
TGGTGTACGG GCTCAACGCC CTGCTCCTCA TCTGTGTGGG GGTCTGCTCC TCAGGGATTT 3000 
TCACCAAAGT CACGAGGAAG GCATCCACGG CCCTGCACAA CAAGCTCTTC AACAAGGTTT 3060 
TCCGCTGCCC CATGAGTTTC TTTGACACCA TCCCAATAGG CCGGCTTTTG AACTGCTTCG 3120 
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CAGGGGACTT GGAACAGCTG G ACCAGCTCT TGCCCATCTT TTCAGAGCAG TTCCTGGTCC 3 1 80 
TGTCCTTAAT GGTGATCGCC GTCCTGTTG A TTGTCAGTGT GCTGTCTCCA TATATCCTGT 3240 
TAATGGGAGC CATAATCATG GTTATTTGCT TCATTTATTA TATGATGTTC AAGAAGGCCA 3300 
TCGGTGTGTT CAAGAGACTG G AGAACTATA GCCGGTCTCC TTTATTCTCC CACATCCTCA 3360 
ATTCTCTGCA AGGCCTG AGC TCCATCCATG TCTATGGAAA AACTGAAGAC TTCATCAGCC 3420 
AGTTTAAGAG GCTGACTGAT G CGCAG A AT A ACTACCTGCT GTTGTTTCTA TCTTCCACAC 3480 
GATGGATGGC ATTGAGGCTG GAGATCATGA CCAACCTTGT GACCTTGGCT GTTGCCCTGT 3540 
TCGTGGCTTT TGGCATTTCC TCCACCCCCT ACTCCTTTAA AGTCATGGCT GTCAACATCG 3600 
TGCTGCAGCT GGCGTCCAGC TTCCAGGCCA CTGCCCGGAT TGGCTTGGAG ACAGAGGCAC 3660 
AGTTCACGGC TGTAGAGAGG ATACTGCAGT ACATGAAGAT GTGTGTCTCG GAAGCTCCTT 3720 
TACACATGGA AGGCACAAGT TGTCCCCAGG GGTGGCCACA GCATGGGGAA ATCATATTTC 3780 
AGGATTATCA CATGAAATAC AGAGACAACA CACCCACCGT GCTTCACGGC ATCAACCTGA 3840 
CCATCCGCGG CCACGAAGTG GTGGGCATCG TGGGAAGGAC GGGCTCTGGG AAGTCCTCCT 3900 
TGGGCATGGC TCTCTTCCGC CTGGTGGAGC CCATGGCAGG CCGGATTCTC ATTGACGGCG 3960 
TGGACATTTG CAGCATCGGC CTGGAGG ACT TGCGGTCCAA GCTCTCAGTG ATCCCTCAAG 4020 
ATCCAGTGCT GCTCTCAGGA ACCATCAGAT TCAACCTAGA TCCCTTTGAC CGTCACACTG 4080 
ACCAGCAGAT CTGGGATGCC TTGGAGAGGA CATTCCTGAC CAAGGCCATC TCAAAGTTCC 4140 
CCAAAAAGCT GCATACAGAT GTGGTGGAAA ACGGTGGAAA CTTCTCTGTG GGGGAGAGGC 4200 
AGCTGCTCTG CATTGCCAGG GCTGTGCTTC GCAACTCCAA GATCATCCTT ATCGATGAAG 4260 
CCACAGCCTC CATTGACATG GAG ACAGACA CCCTGATCCA GCGCACAATC CGTGAAGCCT 4320 
TCCAGGGCTG CACCGTGCTC GTCATTGCCC ACCGTGTCAC CACTGTGCTG AACTGTGACC 4380 
ACATCCTGGT TATGGGCAAT GGGAAGGTGG TAGAATTTGA TCGGCCGGAG GTACTGCGGA 4440 
AGAAGCCTGG GTCATTGTTC GCAGCCCTCA TGGCCACAGC CACTTCTTCA CTGAGATAAG 4500 
GAGATGTGGA GACTTCATGG AGGCTGGCAG CTGAGCTCAG AGGTTCACAC AGGTGCAGCT 4560 
TCGAGGCCCA CAGTCTGCGA CCTTCTTGTT TGG AGATGAG AACTTCTCCT GGAAGCAGGG 4620 
GTAAATGTAG GGGGGGTGGG GATTGCTGGA TGGAAACCCT GGAATAGGCT ACTTGATGGC 4680 
TCTCAAGACC TTAGAACCCC AGAACCATCT AAGACATGGG ATTCAGTGAT CATGTGGTTC 4740 
TCCTTTTAAC TTACATGCTG AATAATTTTA TAATAAGGTA AAAGCTTATA GTTTTCTGAT 4800 
CTGTGTTAGA AGTGTTGCAA ATGCTGTACT GACTTTGTAA AATATAAAAC TAAGGAAAAC 4860 
TC 

Seq ID NO: 48 Protein sequence: 
Protein Accession U: NP J 49 i 63.2 

NTTRKRTWVPNSSGGLVNRGIDIGDDMVSGUYKTYTLQDGPW 

MIPFRPKPRFPAPQPLDN AG LFSYLTVS WLTPLMIQ SUlSRLDEhn"IPPI^VHDASDKNVQRLHRLWEEE 

VSRRGIEKASVLLVMLRfQRTlU.IFDALLGICFCIASVLGPILnPKlLEYSEEQLGNVVHGVGLCFALF 

I^ECVKSL^FSSSWITNQRTAIRFRAAVSSFAFEKUQFKS^ 

VLITCASLVICSISSYFIIGYTAFIAILCYLLVIT>LAVFMTRMAVICAQHHTSEVSDQRIRVTSEVLTCIK 

L1KMYT W EKPFAjaiEDLRRKERKLLEKCGLVC^LTSTTLFIIPTV ATA VW VUHTS LKLKLTASM AFSM 

LASLNLLRLSVFFVPIAVKGLTNSKSAVMRFKKFFLQESPVFYVQTLQDPSKALVFEEATLSWQQTCPGI 

VNGALELERNGHASTOMTRPRI)Al^PEEEGNSLGPEUiKINLVVSKGMMLGVCGNTGSGK5SLLSAILEE 

MHLUGSVGVQGS1AYWQ^AWIVSGNIRE^^LMGGAYDKARYLQ^1.HCGSL^^LELLPFGDMTEIGER 

GLNL^GQKQRISLARAVYSDRQIYLLDDPLSAVDAHVGKlflFEECIKKTLRGKTVVLVTHQLQYLEFCG 

QIILLENG K1CENGTHS ELMQKKGKY AQLIQKM HKEATSDMIXJDTAKIAEKPKVESQ ALATSUiESLN N 

AWEHQLTQEEEMEEGSLSWRVY1WIQAAGGYMVSCIIFFTVVUWLTIFSFWWLSYWLEQGSGTNSS 

RESNGTMADLGNIADNPQLSFYQLVYG LNALLLIC VG VCSSG EFTKVTRKASTALHNKLFNKVFRCPMSF 

FDTIPIGR1XNCTAGDLEQIJ3QLLJ1FSEQFLVLSI^VIAVLU 

KKAIGVFKRLEhTYSRSPLFSHIU^SLQGLSSIHW 

ETMTr^VTLAVALFVAFGISSTPYSFKVMAVMVUJLASSFQAT^ 

EAPLHMEGTSCPQG WPQHG ElffQD YHMK YRBNTPTVLHGINLTTRGHEVVG IVGRTGSGKSSLGMALFR 
LVEPMAGRIUDGVDICSlGLEDLR^KLS\ftPQDP\l,LSGTIRFNLDPFDRHTDO^ 
SKFPKKLHTDWENGGr^SVGERQLI^IARAVLRNSKIIU^ 
VIAHRVTTVLNCDHILVMGNGKVVEEDRPEVLRKKPGSLFAALMATATSSLR 

Seq ID NO: 49 DNA sequence 

Nucleic Acid Accession #: NMJJ33419 

Coding sequence: 1 8-9 80 (underlined sequences correspond to start and stop codons) 
1 U 21 31 41 5t 

I I ! 1 I I 

CGAGCCAGGG AGAAAGGATG GCCGGCCTGG CGGCGCGGTT GGTCCTGCTA GCTGGGGCAG 60 
CGGCGCTGGC GAGCGGCTCC CAGGGCGACC GTGAGCCGGT GTACCGCGAC TGCGTACTGC 120 
AGTGCGAAGA GCAGAACTGC TCTGGGGGOG CTCTG AATCA CTTCCGCTCC CGCCAGCCAA 1 80 
TCTACATGAG TCTAGCAGGC TGG ACCTGTC GGGACG ACTG TAAGTATGAG TGTATGTGGG 240 
TCACCGTTGG GCTCTACCTC CAGGAAGGTC ACAAAGTGCC TCAGTTCCAT GGCAAGTGGC 300 
CCTTCTCCCG GTTCCTGTTC TTTCAAGAGC CGGCATCGGC CGTGGCCTCG TTTCTCAATG 360 
GCCTGGCCAG CCTGGTGATG CTCTGCCGCT ACCGCACCTT CGTGCCAGCC TCCTCCCCCA 420 
TGTA CCACAC CTGTGTGGCC TTCGCCTGGG TGTCCCTCAA TGCATGGTTC TGGTCCACAG 480 
TTTTCCACAC CAGGGACACT GACCTCACAG AGAAAATGGA CTACTTCTGT GCCTCCACTG 540 
TCATCCTACA CTCAATCTAC CTGTGCTGCG TCAGGACCGT GGGGCTGCAG CACCCAGCTG 600 
TGGTCAGTGC CTTCCGGGCT CTCCTGCTGC TCATGCTGAC CGTGCACGTC TCCTACCTGA 660 
GCCTCATCCG CTTCGACTAT GGCTACAACC TGGTGGCCAA CGTGGCTATT GGCCTGGTCA 720 
ACGTGGTGTG GTGGCTGGCC TGGTGCCTGT GGAACCAGCG GCGGCTGCCT CACGTGCGCA 780 
AGTGCGTGGT GGTGGTCTTG CTGCTGCAGG GGCTGTCCCT GCTCGAGCTG CTTGACTTCC 840 
CACCGCTCTT CTGGG TCCTG GATGCCCATG CCATCTGGCA CATCAGCACC ATCCCTGTCC 900 
ACGTCCTCTT TTTCAGCTTT CTGG AAGATG ACAGCCTGTA CCTGCTGAAG GAATCAGAGG 960 



369 



WO 02/059377 



ACAAGTTCAA GCTGGACTGA AGACCTTGGA GCGAGTCTGC CCCAGTGGGG ATCCTGCCCC 1020 
CGCCCTGCTG GCCTCCCTTC TCCCCTCAAC CCTTGAGATG ATTTTCTCTT TTCAACTTCT 1080 
TGAACTTGGA CATG AAGGAT GTGGGCCCAG AATCATGTGG CCAGCCCACC CCCTGTTGGC 1 140 
CCTCACCAGC CTTGGAGTCT GTTCTAGGGA AGGCCTCCC A GCATCTGGG A CTCGAGAGTG 1 200 
GGCAGCCCCT CTACCTCCTG GAGCTGAACT GGGGTGGAAC TGAGTGTGCT CTTAGCTCTA 1260 
CCGGGAGGAC AGCTGCCTGT TTCCTCCCCA TCAGCCTCCT CCCCACATCC CCAGCTGCCT 1 320 
GGCTGGGTCC TGAAGCCCTC TGTCTACCTG GGAGACCAGG GACCACAGGC CTTAGGGATA 1380 
CAGGGGGTCC CCTTCTGTTA CCACCCCCCA CCCTCCTCCA GGACACCACT AGGTGGTGCT 1440 
GGATGCTTGT TCTTTGGCCA GCCAAGGTTC ACGGCGATTC TCCCCATGGG ATCTTGAGGG 1500 
ACCAAGCTGC TGGG ATTGGG AAGGAGTTTC ACCCTGACCA TTGCCCTAGC CAGGTTCCCA 1 560 
GGAGGCCTCA CCATACTCCC TTTCAGGGCC AGGGCTCCAG CAAGCCCAGG GCAAGGATCC 1620 
TGTGCTGCTG TCTGGTTGAG AGCCTGCCAC CGTGTGTCGG GAGTGTGGGC CAGGCTGAGT 1680 
GCATAGGTGA CAGGGCCGTG AGCATGGGCC TGGGTGTGTG TGAGCTCAGG CCTAGGTGCG 1740 
CAGTGTGGAG ACGGGTGTTG TCGGGGAAGA GGTGTGGCTT CAAAGTGTGT GTGTGCAGGG 1800 
GGTGGGTGTG TTAGCGTGGG TTAGGGGAAC GTGTGTGCGC GTGCTGGTGG GCATGTGAGA 1860 
TGAGTGACTG CCGGTGAATG TGTCCACAGT TGAGAGGTTG GAGCAGGATG AGGGAATCCT 1920 
GTCACCATCA ATAATCACTT GTGGAGCGCC AGCTCTGCCC AAGGCGCCAC CTGGGCGGAC 1 980 
AGCCAGGAGC TCTCCATGGC CAGGCTGCCT GTGTGCATGT TCCCTGTCTG GTGCCCCTTT 2040 
GCCCGCCTCC TGCAAACCTC ACAGGGTCCC CACACAACAG TGCCCTCCAG AAGCAGCCCC 2100 
TCGGAGGCAG AGGAAGG AAA ATGGGGATGG CTGGGGCTCT CTCCATCCTC CTTTTCTCCT 2160 
TGCCTTCGCA TGGCTGGCCT TCCCCTCCAA AACCTCCATT CCCCTGCTGC CAGCCCCTTT 2220 
GCCATAGCCT GATTTTGGGG AGGAGGAAGG GGCGATTTG A GGGAGAAGGG GAGAAAGCTT 2280 
ATGGCTGGGT CTGGTTTCTT CCCTTCCCAG AGGGTCTTAC TGTTCCAGGG TGGCCCCAGG 2340 
GCAGGCAGGG GCCACACTAT GCCTGCGCCC TGGTAAAGGT GACCCCTGCC ATTTACCAGC 2400 
AGCCCTGGCA TGTTCCTGCC CCACAGGAAT AGAATGGAGG GAGCTCCAGA AACTTTCCAT 2460 
CCCAAAGGCA GTCTCCGTGG TTGAAGCAGA CTGGATTTTT GCTCTGCCCC TGACCCCTTG 2520 
TCCCTCTTTG AGGG AGGGGA GCTATGCTAG GACTCCAACC TCAGGGACTC GGGTGGCCTG 2580 
CGCTAGCTTC TTTTG ATACT GAAAACTTTT AAGGTGGGAG GGTGGCAAGG GATGTGCTTA 2640 
ATAAATCAAT TCCAAGCCTC AAAAAAAAAA AAAAAAAAAA AA 

Scq ID NO: 50 Protein sequence: 
Protein Accession U: NP_219487.1 

1 II 21 31 41 51 
I I I I I I 

MKDVGPESCG QPTPCWPSPA LESVLGKASQ HLGLESGQPL YLLELNWGGT ECALSSTGRT 60 
AACFLPISLL PTSPAAWLGP EALCLPGRPG TTGLRDTGGP LLLPPPTLLQ DTTRWCWMLV 120 
LWPAKVHGDS PHGILRDQAA GIGK£f HPDH CPSQVPRRPH HTPFQGQGSS KPRARILCCC 180 
LVESLPPCVG SVGQAECIGD RAVSMGLGVC ELRPRCAVWR RVLSGKRCGF KVCVCRGWVC 



Seq ID NO: 51 DNA sequence 

Nucleic Acid Accession #: XM J059O98. 1 

Coding sequence: 178-61 8(underiined sequences correspond to start and stop codons) 

1 II 21 31 41 51 
I I I I I I 

GATGTACACT CTGAAGTG AG CACATTCCTG TTGGCAGGAC ATGACACCTT GGCAGCAAGC 60 
ATCTCCTGGA TCCTTTACTG CCTGGCTCTG AACCCTGAGC ATCAAGAGAG ATGCCGGGAG 120 
GAGGTCAGGG GCATCCTGGG GGATGGGTCT TCTATCACTT GGGACCAGCT GGGTGAGATG 1 80 
TCGTACACCA CAATGTGCAT CAAGGAGACG TGCCGATTG A TTCCTGCAGT CCCOTCCATT 240 
TCCAGAG ATC TCAGCAAGCC ACTTACCTTC CCAGATGGAT GCACATTGCC TGCAGGGATC 300 
ACCGTGGTTC TTAGTATTTG GGGTCTTCAC CACAACCCTG CTGTCTGGAA AAACCCAAAG 360 
GTCTTTGACC CCTTG AGGTT CTCTCAGGAG AATTCTGATC AGAGACACCC CTATGCCTAC 420 
TTACCATTCT CAGCTGGATC AAGGAACTGC ATTGGGCAGG AGTTTGCCAT GATTGAGTTA 480 
AAGGTAACCA TTGCCTTGAT TCTGCTCCAC TTCAGAGTGA CTCCAGACCC CACCAGGCCT 540 
CTTACTTTCC CCAACCATTT TATCCTCAAG CCCAAGAATG GGATGTATTT GCACCTGAAG 600 
AAACTCTCTG AATGTTAOAT CTCAGGGTAC AATGATTAAA CGTACTTTGT TTTTCGAAGT 660 
TAAATTTACA GCTAATGATC CAAGCAGATA GAAAGGGATC AATGTATGGT GGGAGGATTG 720 
GAGGTTGGTG GGATAGGGGT CTCTGTGAAG AGATCCAAAA TCATTTCTAG GTACACAGTG 780 
TGTCAGCTAG ATCTGTTTCT ATATAACTTT GGGAGATTTT CAGATCTTTT CTGTTAAACT 840 
TTCACTACTA TTAATGCTGT ATACACCAAT AGACTTTCAT ATATTTTCTG TTGTTTTTAA 900 
AATAGTTTTC AG AATTATGC AAGTAATAAG TGCATGTATG CTCACTGTCA AAAATTCCCA 960 
ACACTAGAAA ATCATGTAGA ATAAAAATTT TAAATCTCAC TTCACTTAGC CGACATTCCA 1020 
TGCCCTGACC AATCCTACTG CTTTTCCTAA AAACAGAATA ATTTGGTGTG CATTCTTTCA 1080 
GACTTTTTCC TATACATTTT ATATGTAGAA ATGTAGCAAT GTATTTGTAT AGATGTGATC 1 140 
ATTCCTATAT TGTTATTGAT TTTTTTCACT TAATAAAAAT TCACCTTATT CCTT 

Seq ID NO: 52 Protein sequence: 
Prote in Access ion U: XP J>59098 . 1 

1 11 21 31 41 51 
t I I I i 1 

MSYTTMC1KE TCRLIPAVPS ISRDLSKPLT FPDGCTLPAG ITWLSIWGL HHNPAVWKNP 60 
KVFDPLRFSQ ENSDQRHPYA YLPFSAGSRN C1GQEFAMIE UCVTIALILL HFRVTPDPTR 1 20 
PLTFPNHFIL KPKNGMYLHL KKLSEC 
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Seq ID NO: 53 DNA sequence 

Nucleic Acid Accession ft: NM 0309 1 6 

Coding sequence: 1-1533 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
III!)! 

ATGCCCCTGT CCCTGGCAGC CGAGATGTGG GGGCCTGAGG CCTGGCTGCT GCTGCTGCTA 60 
CTGCTGGCAT CATTTACAGG CCGGTGCCCC GCGGGTGAGC TGGAGACCTC AGACGTGGTA 120 
ACTGTGGTGC TGGGCCAGGA CGCAAAACTG CCCTGCTTCT ACCGAGGGGA CTCCGGCGAG 1 80 
CAAGTGGGGC AAGTGGCATG GGCTCGGGTG GACGCGGGCG AAGGCGCCCA GGAACTAGCG 240 
CTACTGCACT CCAAATACGG GCTTCATGTG AGCCCGGCTT ACGAGGGCCG CGTGGAGCAG 300 
CCGCCGCCCC CACGCAACCC CCTGGACGGC TCAGTGCTCC TGCGCAACGC AGTGCAGGCG 360 
GATGAGGGCG AGTACGAGTG CCGGGTCAGC ACCTTCCCCG CCGGCAGCTT CCAGGCGCGG 420 
CTGCGGCTCC GAGTGCTGGT GCCTCCCCTG CCCTCACTGA ATCCTGGTCC AGCACTAGAA 480 
GAGGGCCAGG GCCTGACCCT GGCAGCCTCC TGCACAGCTG AGGGCAGCCC AGCCCCCAGC 540 
GTGACCTGGG ACACGGAGGT CAAAGGCACA ACGTCCAGCC GTTCCTTCAA GCACTCCCGC 600 
TCTGCTGCCG TCACCTCAGA GTTCCACTTG GTGCCTAGCC GCAGCATGAA TGGGCAGCCA 660 
CTG ACTTGTG TGGTGTCCCA TCCTGGCCTG CTCCAGGACC AAAGGATCAC CCACATCCTC 720 
CACGTGTCCT TCCTTGCTGA GGCCTCTGTG AGGGGCCTTG AAGACCAAAA TCTGTGGCAC 780 
ATTGGCAGAG AAGGAGCTAT GCTCAAGTGC CTGAGTG AAG GGCAGCCCCC TCCCTCATAC 840 
AA CTGGA CAC GGCTGGATGG GCCTCTGCCC AGTGGGGTAC GAGTGGATGG GGACACTTTG 900 
GGCTTTCCCC CACTGACCAC TGAGCACAGC GGCATCTACG TCTGCCATGT CAGCAATGAG 960 
TTCTCCTCAA GGGATTCTCA GGTCACTGTG GATGTTCTTG ACCCCCAGGA AGACTCTGGG 1020 
AAGCAGGTGG ACCTAGTGTC AGCCTCGGTG GTGGTGGTGG GTGTGATCGC CGCACTCTTG 1080 
TTCTGCCTTC TGGTGGTGGT GGTGGTGCTC ATGTCCCGAT ACCATCGGCG CAAGGCCCAG 1 140 
CAGATGACCC AGAAATATGA GGAGGAGCTG ACCCTGACCA GGGAGAACTC CATCCGGAGG 1200 
CTGCATTCCC ATCACACGGA CCCCAGG AGC CAGCCGGAGG AGAGTGTAGG GCTGAGAGCC 1260 
GAGGGCCACC CTGATAGTCT CAAGGACAAC AGTAGCTGCT CTGTGATGAG TGAAGAGCCC 1320 
GAGGGCCGCA GTTACTCCAC GCTGACCACG GTGAGGG AG A TAGAAACACA GACTGAACTG 1380 
CTGTCTCCAG GCTCTGGGCG GGCCGAGGAG GAGGAAGATC AGGATGAAGG CATCAAACAG 1440 
GCCATGAACC ATTTTGTTCA GGAGAATGGG ACCCTACGGG CCAAGCCCAC GGGCAATGGC 1500 
ATCTACATCA ATGGGCGGGG ACACCTGGTC TGA 

Seq ID NO: 54 Protein sequence: 
Protein Accession #: NP J 1 2 1 78. 1 

1 II 21 31 41 51 
I I I I I I 

MPLSLGAEMW GPEAWLLLLL LLASFTGRCP AGE LETS DW TVVLGQDAKL PCFYRGDSGE 60 
QVGQVAWARV DAGEGAQELA LLHSICYGLHV SPAYEGRVEQ PPPPRNPLDG SVLLRNAVQA 120 
DEGEYECRVS TFPAGSFQAR LRLRVLVPPL PSLNPGPALE EGQGLTLAAS CTAEGSPAPS 180 
VTWDTEVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQP LTCWSHPGL LQDQRJTHIL 240 
HVSFLAEASV RGLEDQNLWH IGREGAMLKC LSEGQPPPSY NWTRLDGPLP SGVRVDGDTL 300 
GFPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQEDSG KQVDLVSASV VWGVIAALL 360 
FCLLVWWL MSRYHRRKAQ QMTQK.YEEEL TLTRENSIRR LHSHHTDPRS QPEESVGLRA 420 
EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIETQTEL LSPGSGRAEE EEDQDEGKQ 480 
AMNHFVQENG TLRAKPTGNG rYINGRGHLV 

Seq ID NO: 55 DNA sequence 

Nucleic Acid Accession #: AF007 1 70. 1 

Coding sequence: 73-1 725 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

AAGGAGGCGG CCTCCGGGAA AAGCGACCGC AGGACTCCTG AGAGCAGCCT CCATGAGGCC 60 
CTGGACCAGT GCATGACCGC CCTGGACCTC TTCCTCACCA ACCAGTTCTC AGAAGCACTC 120 
AGCTACCTCA AGCCCAGAAC CAAGGAAAGC ATGTACCACT CACTGACATA TGCCACCATC 1 80 
CTGGAGATGC AGGCCATGAT GACCTTTGAC CCTCAGGACA TCCTGCTTGC CGGCAACATG 240 
ATGAAGGAGG CACAG ATGCT GTGTCAGAGG CACCGGAGGA AGTCTTCTGT AACAGATTCC 300 
TTCAGCAGCC TGGTGAACCG CCCCACGCTG GGCCAATTCA CTGAAGAAGA AATCCACGCT 360 
GAGGTCTGCT ATGCAGAGTG CCTGCTGCAG CGAGCAGCCC TGACCTTCCT GCAGGACGAG 420 
AAC ATGGTG A GCTTC ATCAA AGGCGGC ATC AAAGTTCGAA ACAGCTAOC A G ACCTAC AAG 480 
GAGCTGGACA GCCTTGTTCA GTCCTCACAA TACTGCAAGG GTGAGAACCA CCCGCACTTT 540 
GAAGGAGGAG TGAAGCTTGG TGTAGGGGCC TTCAACCTGA CACTGTCCAT GCTTCCTACT 600 
AGGATCCTGA GGCTGTTGGA GTTTGTGGGG TTTTCAGGAA ACAAGGACTA TGGGCTGCTG 660 
CAGCTGGAGG AGGGAGCGTC AGGGCACAGC TTCCGCTCTG TGCTCTGTGT CATGCTCCTG 720 
CTGTGCTACC ACACCTTCCT CACCTTCGTG CTCGGTACTG GGAACGTCAA CATCGAGGAG 780 
GCCGAGAAGC TCTTGAAGCC CTACCTGAAC CGGTACCCTA AGGGTGCCAT CTTCCTGTTC 840 
TTTGCAGGGA GGATTGAAGT CATTAAAGGC AACATTGATG CAGCCATCCG GCGTTTCGAG 900 
GAGTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATGTGCTA CTGGGAGCTG 960 
ATGTGGTGCT TCACCTACAA GGGCCAGTGG AAGATGTCCT ACTTCTACGC CGACCTGCTC 1020 
AGCAAGGAGA ACTGCTGGTC CAAGGCCACC TACATTTACA TG AAGGCCGC CTACCTCAGC 1080 
ATGTTTGGGA AGGAGGACCA CAAGCCGTTC GGGGACGACG AAGTGGAATT ATTTCGAGCT 1140 
GTGCCAGGCC TGAAGCTCAA GATTGCTGGG AAATCTCTAC CCACAGAGAA GTTTGCCATC 1200 
CGGAAGTCCC GGCGCTACTT CTCCTCCAAC CCTATCTCGC TGCCAGTGCC TGCTCTGGAA 1260 
ATGATGTACA TCTGGAACGG CTACGCCGTG ATTGGGAAGC AGCCGAAACT CACGGATGGG 1320 
ATACTTGAGA TTATCACTAA GGCTGAAGAG ATGCTGG AGA AAGGCCCAGA GAACGAGTAC 1380 
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TCAGTGGATG ACGAGTGCTT GGTGAAATTG TTGAAAGGCC TGTGTCTGAA ATACCTGGGC 1440 
CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAGATTAAA 1500 
TATG ACCACT ACTTG ATCCC AAACGCCCTG CTGG AGCTGG CCCTGCTGCT TATGGAGCA A 1 S60 
GACAGAAACG AAGAGGCCAT CAAACTTTTG GAATCTGCCA AGCAAAACTA CAAGAATTAC 1620 
TCCATGG AGT CAAGG ACACA CTTTCGAATC CAGGCAGCCA CACTCCAAGC CAAGTCTTCC 1 680 
CTAGAGAACA GCAGCAG ATC CATGGTCTCA TCAGTCTCCT TGIAGCTTTG TGCAGCAGTT 1 740 
CCGGGCTGGA AGACAGAGAC AGCTGGACAG AGCTCCTGAA AACATTTCAA AATACCCCCT 1800 
CCCCCTGCCC TGCCCTGCCT TTGGGGTCCA CCGGCACTCC AGTTGGATGG CACAACATAG I860 
TGTATCCGTG CAGAAGCCGA GCTGGCATTT TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 1920 
GGCAG AGCAG GTGG AGCCCT CTGCCTGCCC TATCACACAT ACGGGTACTT GCTTTTCACT 1 980 
GTGATGTTTA AGAG AATGTA TGAACAGTTT ACATTTTCCT TAGAAATACA TTGATGGGAT 2040 
CACAGTTGGC TTTAAAAACC AACAACAATC AACCACCTGT AAGTCTTTGT CTTCACCTAT 2 100 
TATCATCTGG AGGTAAATCT CTTTATATGA TGATGCCAAA GGGCAAATTG CTTTTCAAAT 2 160 
TCAGCAAGTT CTCAGCTTGT GTGACGGAAG GTCCTTCAGA GGACCTGAGG AATGCCTGGG 2220 
AGAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGGATGT ACACAGACAC 2280 
CCACTACCTT ACTACTCACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA TTTAAAAATC 2140 
AAGCACGTCT TTTTAGTGAG ATAAAATCTG AGCTCTTCTG TAGAAAAATC AATCTCTACC 2400 
AGTAGAAAAT GCGAGGGCTT GATGGAAGAG CTGTGTAGCC CTTTCTATGC CAAAGCCAGG 2460 
AAATTTGGGG GGCAG G AG G A GGTTCTCAGA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 2520 
CTGAAACCAC TGGG AATAAT TTATGAAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 2580 
CATTTGCTTA CTGACAGCAT TTTTGTTAAA ACTGTTATTC TTGAAAAAAA AAAAAAAAAA 2640 
AA 

Seq ID NO: 56 protein sequence: 
Protein Accession AAC39582. 1 

1 11 21 31 41 51 
I I I 1 I I 

MTALDLFLTN QFSEALSYLK PRTKESMYHS LTYATILEMQ AMMTFDPQDI LLAGNMMKEA 60 
QMLCQRHRRK SSVTDSFSSL VNRPTLGQFT EEEIHAEVCY AECLLQRAAL TFLQDENM VS 120 
FIKGGiKVRN SYQTYKELDS LVQSSQYCKG ENHPHFEGGV KLGVGAFNLT LSMLPTRJLR 1 80 
LLEFVGFSGN KDYGLLQLEE GASGHSFRSV LCVMLLLCYH TFLTFVLGTG NVNIEEAEKL 240 
LKPYLNRYPK GAIFLFFAGR IEVKGNTDA AIRRFEECCE AQQHWKQFHH MCYWELMWCF 300 
TYKGQWKMSY FYA DLLS KEN CWSKATYIYM KAAYLSMFGK EDHKPFGDDE VELFRAVPGL 360 
KLKIAGKSLP TEKFAIRKSR RYFSSNPISL PVPALEMMYI WNGYAVIGKQ PKLTDGILE1 420 
ITKAEEMLEK GPENEYSVDD ECLVKLLKGL CLKYLGRVQE AEENFRSISA NEKK1KYDHY 480 
LIPNALLELA LLLMEQDRNE EAIKLLESAK QNYKNYSMES RTHFRIQAAT LQAKSSLENS 540 
SRSMVSSVSL 

Seq ID NO: 57 DNA sequence 

Nucleic Acid Accession »: NM_006670. 1 

Coding sequence: 1-927 (underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 
I I I I I I 

ATGCCTGGGG GGTGCTCCCG GGGCCCCGCC GCCGGGGACG GGCGTCTGCG GCTGGCGCGA 60 
CTAGCGCTGG TACTCCTGGG CTGGGTCTCC TCGTCTTCTC CCACCTCCTC GGCATCCTCC 1 20 
TTCTCCTCCT CGGCGCCGTT CCTGGCTTCC GCCGTGTCCG CCCAGCCCCC GCTGCCGGAC 1 80 
CAGTGCCCCG CGCTGTGCGA GTGCTCCGAG GCAGCGCGCA CAGTCAAGTG CGTTAACCGC 240 
AATCTGACCG AGGTGCCCAC GGACCTGCCC GCCTACGTGC GCAACCTCTT CCTTACCGGC 300 
AACCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCAACTGCCC 360 
AGCCTCAGGC ACCTGGACTT AAGTAATAAT TCGCTGGTGA GCCTGACCTA CGTGTCCTTC 420 
CGCAACCTGA CACATCTAGA AAOCCTCCAC CTGGAGGACA ATGCCCTCAA GGTCCTTCAC 480 
AATGGCACCC TGGCTGAGTT GCAAGGTCTA CCCCACATTA GGGTTTTCCT GGACAACAAT 540 
CCCTGGGTCT GCGACTGCCA CATGGCAGAC ATGGTGACCT GGCTCAAGG A AACAGAGGTA 600 
GTGCAGGGCA AAGACCGGCT CACCTGTGCA TATCCGGAAA AAATGAGG AA TCGGGTCCTC 660 
TTGGAACTCA ACAGTGCTG A CCTGGACTGT GACCCGATTC TTCCCCCATC CCTGCAAACC 720 
TCTTATGTCT TCCTGGGTAT TGTTTTAGCC CTGATAGGCG CTATTTTCCT CCTGGTTTTG 780 
TATTTGAACC GCAAGGGGAT AAAAAAGTGG ATGCATAACA TCAGAGATGC CTGCAGGGAT 840 
CACATGGAAG GGTATCATTA CAGATATGAA ATCAATGCGG ACCCCAGATT AACAAACCTC 900 
AGTTCTAACT CGG ATGTCCT CGAGTGA 

Seq ID NO: 58 Protein seauence 
Protein Accession ft: NPJJ06661 . 1 



1 11 21 31 41 51 
I I I I I I 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS FSSSAPFLAS AVSAQPPLPD 60 
QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLASNHFLY LPRDVLAQLP 120 
SLRHLDLSNN SLVSLTYVSF RNLTHLESLH LEDNALKVLH NGTLAELQGL PHIRVFLDNN 1 80 
PWVCDCHMAD MVTWLKETEV VQGKDRLTCA YPEKMRNRVL LELNSADLDC DPILPPSLQT 240 
SYVFLGIVLA LIGAIFLLVL YLNRKGUCKW MHN1RDACRD HMEGYHYRYE INADPRLTNL 300 
SSNSDVLE 



Seq ID NO: 59 DNA sequence 

Nucleic Acid Accession it: NM_024022 
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Coding sequence: l-1362(uuderiined sequences correspond to start and stop codons) 

1 II 21 31 41 51 
I I I I I I 

ATGGGGGAAA ATGATCCGCC TGCTGTTGAA GCCCCCTTCT CATTCCGATC G CTTTTT GGC 60 
CTTGATGATT TGAAAATAAG TCCTGTTGCA CCAGATGCAG ATGCTGTTGC TGCACAGATC 120 
CTGTCACTGC TGCCATTGA A GTTTTTTCCA ATCATCGTCA TTGGGATCAT TGCATTGATA 1 80 
TTAGCACTGG CCATTGGTCT GGGCATCCAC TTCGACTGCT CAGGGAAGTA CAGATGTCGC 240 
TCATCCTTTA AGTGTATCGA GCTGATAGCT CGATGTGACG GAGTCTCGGA TTGCAAAGAC 300 
GGGGAGGACG AGTACCGCTG TGTCCGGGTG GGTGGTCAGA ATGCCGTGCT CCAGGTGTTC 360 
ACAGCTGCTT CGTGG AAGAC CATGTGCTCC GATGACTGGA AGGGTCACTA CGCAAATGTT 420 
GCCTGTGCCC AACTGGGTTT CCCAAGCTAT GTGAGTTCAG ATAACCTCAG AGTGAGCTCG 480 
CTGGAGGGGC AGTTCCGGGA GGAGTTTGTG TCCATCGATC ACCTCTTGCC AGATGACAAG 540 
GTG ACTGCAT TACACCACTC AGTATATGTG AGGGAGGGAT GTGCCTCTGG CCACGTGGTT 600 
ACCTTGCAGT GCACAGCCTG TGGTCATAGA AGGGGCTACA GCTCACGCAT CGTGGGTGGA 660 
AACATGTCCT TGCTCTCGCA GTGGCCCTGG CAGGCCAGCC TTCAGTTCCA GGGCTACCAC 720 
CTGTGCGGGG GCTCTGTCAT CACGCCCCTG TGGATCATCA CTGCTGCACA CTGTGTTTAT 780 
GACTTGTACC TCCCCAAGTC ATGGACCATC CAGGTGGGTC TAGTTTCCCT GTTGGACAAT 840 
CCAGCCCCAT CCCACTTGGT GGAGAAGATT GTCTACCACA GCAAGTACAA GCCAAAGAGG 900 
CTGGGCAATG ACATCGCCCT TATGAAGCTG GCCGGGCCAC TCACGTTCAA TGAAATGATC 960 
CAGCCTGTGT GCCTGCCCAA CTCTGAAGAG AACTTCCCCG ATGG AAAAGT GTGCTGGACG 1020 
TCAGGATGGG GGGCCACAGA GGATGGAGGT GACGCCTCCC CTGTCCTGAA CCACGCGGCC 1080 
GTCCCTTTGA TTTCCAACAA G ATCTGCAAC CACAGGGACG TGTACGGTGG CATCATCTCC 1 1 40 
CCCTCCATGC TCTGCGCGGG CTACCTGACG GGTGG CGTGG ACAGCTGCCA GGGGGACAGC 1200 
GGGGGGCCCC TGGTGTGTCA AGAGAGGAGG CTGTGGAAGT TAGTGGGAGC GACCAGCTTT 1 260 
GGCATCGGCT GCGCAGAGGT GAACAAGCCT GGGGTGTAC A CCCGTGTCAC CTCCTTCCTG 1 320 
GACTGGATCC ACGAGCAGAT GGAGAGAGAC CTAAAAACC T GA 

Seo ID NO: 60 Protein sequence 
Protein Accession #: NPJ)76927 

1 11 21 31 41 51 
I I I I 1 I 

MGENDPPAVE APFSFRSLFG UDDLKISPVA PDADAVAAQI LSLLPLKFFP IIVIGIIALI 60 
LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 120 
TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 1 80 
VTALHHSVYV REGCASGHVV TLQCTACGHR RGYSSRJVGG NMSLLSQWPW QASLQFQGYH 240 
LCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSLLDN PAPSHLVEKI VYHSKYKPKR 300 
LGNDIALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEDGG DASPVLNHAA 360 
VPL1SNKICN HRDVYGGI1S PSMLCAGYLT GGVDSCQGDS GGPLVCQERR LWKLVGATSF 420 
G1GCAEVNKP GVYTRVTSFL DWIHEQMERD LKT 

Seq ID NO: 61 DNA sequence 

Nucleic Acid Accession NM_006475 

Coding sequence: 28-2538 (underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 
I I i I I I 

AACAG AACTG CAACGGAG AG ACTCAAGATG ATT<XCTTTT TACCCATGTT TTCTCTACTA 60 
TTGCTGCTTA TTGTTAACCC TATAAACGCC AACAATCATT ATGACAAGAT CTTGGCTCAT 120 
AGTCGTATCA GGGGTCGGGA CCAAGGCCCA AATGTCTGTG CCCTTCAACA GATTTTGGGC 1 80 
ACCAAAAAGA AATACTTCAG CACTTGTAAG AACTGGTATA AAAAGTCCAT CTGTGGACAG 240 
AAAACG ACTG TTTTATATGA ATGTTGCCCT GGTTATATGA GAATGGAAGG AATG AAAGGC 300 
TGCCCAGCAG TTTTGCCCAT TGACCATGTT TATGGCACTC TGGGCATCGT GGGAGCCACC 360 
ACAACGCAGC GCTATTCTGA CGCCTCAAAA CTGAGGGAGG AGATCGAGGG AAAGGGATCC 420 
TTCACTTACT TTGCACCGAG TAATGAGGCT TGGGACAACT TGGATTCTGA TATCCGTAG A 480 
GGTTTGGAGA GCAACGTGAA TGTTGAATTA CTGAATGCTT TACATAGTCA CATGATTAAT 540 
AAGAGAATGT TGACCAAGGA CTTAAAAAAT GGCATGATTA TTCCTTCAAT GTATAACAAT 600 
TTGGGGCTTT TCATTAACCA TTATCCTAAT GGGGTTGTCA CTGTTAATTG TGCTCGAATC 660 
ATCCATGGGA ACCAGATTGC AACAAATGGT GTTGTCCATG TCATTG ACCG TGTGCTTACA 720 
CAAATTGGTA CCTCAATTCA AGACTTCATT GAAGCAGAAG ATGACCTTTC ATCTTTTAGA 780 
GCAGCTGCCA TCACATCGGA CATATTGGAG GCCCTTGGAA GAGACGGTCA CTTCACACTC 840 
TTTGCTCCCA CCAATGAGGC TTTTGAGAAA CTTCCACG AG GTGTCCTAGA AAGGTTCATG 900 
GGAGACAAAG TGGCTTCCGA AGCTCTTATG AAGTACCACA TCTTAAATAC TCTCCAGTGT 960 
TCTGAGTCTA TTATGGGAGG AGCAGTCTTT GAGACGCTGG AAGGAAATAC AATTGAGATA 1020 
GGATGTGACG GTGACAGTAT AACAGTAAAT GGAATCAAAA TGGTGAACAA AAAGGATATT 1080 
GTGACAAATA ATGGTGTG AT CCATTTGATT GATCAGGTCC TAATTCCTGA TTCTGCCAAA 1 140 
CAAGTTATTG AGCTGGCTGG AAAACAGCAA ACCACCTTCA CGGATCTTGT GGCCCAATTA 1200 
GGCTTGGC AT CTGCTCTG AG GCCAG ATGG A GAATACACTT TGCTGGCACC TGTG AATAAT 1 260 
GCATTTTCTG ATG ATACTCT CAGCATGGTT CAGCGCCTCC TTAAATTAAT TCTGCAG AAT 1 320 
CACATATTGA AAGTAAAAGT TGGCCTTAAT GAGCTTTACA ACGGGCAAAT ACTGG AAACC 1380 
ATCGGAGGCA AACAG CTC AG AGTCTTCGTA TATCGTACAG CTGTCTGCAT TGAAAATTCA 1440 
TGCATGGAGA AAGGGAGTAA GCAAGGGAGA AACGGTGCGA TTCACATATT CCGCGAGATC 1500 
ATCAAGCCAG CAGAGAAATC CCTCCATGAA AAGTTAAAAC AAGATAAGCG CTTTAGCACC 1560 
TTCCTCAGCC TACTTGAAGC TGCAGACTTG AAAGAGCTCC TGACACAACC TGG AG ACTGG 1620 
ACATTATTTG TGCCAACCAA TGATGCTTTT AAGGGAATGA CTAGTGAAGA AAAAGAAATT 1680 
CTGATACGGG ACAAAAATGC TCTTCAAAAC ATCATTCTTT ATCACCTGAC ACCAGGAGTT 1740 
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TTCATTGGAA AAGGATTTGA ACCTGGTGTT ACTAACATTT TAAAGACCAC ACAAGGAAGC 1800 
AAAATCTTTC TGAAAGAAGT AAATGATACA CTTCTGGTGA ATGAATTGAA ATCAAAAGAA 1860 
TCTGACATCA TG ACAACA AA TGGTGTAATT CATGTTGTAG ATAAACTCCT CTATCCAGCA i 920 
GACACACCTG TTGGAAATGA TCAACTGCTG GAAATACTTA ATAAATTAAT CAAATACATC 1980 
CAAATTAAGT TTGTTCGTGG TAGCACCTTC AAAGAAATCC CCGTGACTGT CTATACAACT 2040 
AAAATTATAA CCAAAGTTGT GGAACCAAAA ATTAAAGTGA TTGAAGGCAG TCTTCAGCCT 2100 
ATTATCAAAA CTGAAGGACC CACACTAACA AAAGTCAAAA TTGAAGGTGA ACCTGAATTC 2160 
AGACTGATTA AAGAAGGTG A AACAATAACT GAAGTGATCC ATGGAGAGCC AATTATTAAA 2220 
AAATACACCA AAATCATTGA TGGAGTGCCT GTGGAAATAA CTGAAAAAGA GACACGAGAA 2280 
GAACGAATCA TTACAGGTCC TGAAATAAAA TACACTAGGA TTTCTACTGG AGGTGGAGAA 2340 
ACAGAAGAAA CTCTGAAGAA ATTGTTACA A GAAGAGGTCA CCAAGGTCAC CAAATTCATT 2400 
GAAGGTGGTG ATGGTCATTT ATTTGAAGAT GAAGAAATTA AAAGACTGCT TCAGGGAGAC 2460 
ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG ACGATTAAGG 2520 
GAAGGTCG7T CTCAGTGAAA ATCCAAAAAC CAGAAAAAAA TGTTTATACA ACCCTAAGTC 2580 
AATAACCTGA CCTTAGAAAA TTGTGAGAGC CAAGTTGACT TCAGGAACTG AAACATCAGC 2640 
ACAAAGAAGC AATCATCAAA TAATTCTGAA CACAAATTTA ATATTTTTTT TTCTGAATGA 2700 
GAAACATGAG GGAAATTGTG GAGTTAGCCT CCTGTGGTAA AGGAATTGAA GAAAATATAA 2760 
CACCTTACAC CCTTTTTCAT CTTGACATTA AAAGTTCTGG CTAACTTTGG AATCCATTAG 2820 
AGAAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATCG AAGAGTTGTG AACTGTTATC 2880 
CCATTGAAAA GACCGAGCCT TGTATGTATG TTATGGATAC ATAAAATGCA CGCAAGCCAT 2940 
TATCTCTCCA TGGGAAGCTA AGTTATAAAA ATAGGTGCTT GGTGTACAAA ACM 1 1 TATA 3000 
TCAAAAGGCT TTGCACATTT CTATATGAGT GGGTTTACTG GTAAATTATG TTATTTTTTA 3060 
CAACTAATTT TGTACTCTCA G AATGTTTGT CATATGCTTC TTGCAATGCA TATTTTTT AA 3 1 20 
TCTCAAACGT TTCAATAAAA CCA 1 11 11 CA GATATAAAGA GAATTACTTC AAATTGAGTA 3180 
ATTCAGAAAA ACTCAAGATT TAAGTTAAAA AGTGGTTTGG ACTTGGGAA 

Seq ID NO: 62 Protein sequence: 
Protein Accession U: NPJJ06466 

1 11 21 31 41 51 
(till! 

MtPFLPMFSL LLLLIVNPIN ANNHYDKILA HSRIRGRDQG PNVCALQQIL GTKKKYFSTC 60 
KNWYKKSICG QKTTVLYECC PGYMRMEGMK GCPAVLPIDH VYGTLGIVGA TTTQRYSDAS 120 
KLREEIEGKG SFTYFAPSNE AWDNLDSD1R RGLESNVNVE LLNALHSHMI NKRMLTKDLK 1 80 
NGMIIPSMYN NLGLFINHYP NGVVTVNCAR DHGNQIATN GWHVIDRVL TQIGTSIQDF 240 
IEAEDDLSSF RAAA1TSD1L EALGRDGHFT LFAPTNEAFE KXPRGVLERF MGDKVASEAL 300 
MKYH1LNTLQ CSESEMGGAV FETLEGNTIE 1GCDGDSITV NGIKMVNKKD IVTNNGVIHL 360 
IDQVLIPDSA KQVIELAGKQ QTTFTDLVAQ LGLASALRPD GEYTLLAPVN NAFSDDTLSM 420 
VQRLLKULQ NHtLKVKVGL NELYNGQILE TIGGKQLRVF VYRTAVC1EN SCMEKGSKQG 480 
RNGA1HIFRE IDCPAEKSLH EKLKQDKRFS TFLSLLEAAD LKELLTQPGD WTLFVPTNDA 540 
FKGMTSEEKE ELIRDKNALQ NI1LYHLTPG VF1GKGFEPG VTNILKTTQG SKEFLKEVND 600 
TLLVNELKSK ESDIMTTNGV IHVVDKXLYP ADTPVGNDQL LEILNKUKY IQIKFVRGST 660 
FKEIPVTVYT TKHTKWEP KDCVIEGSLQ PIIKTEGPTL TKVKIEGEPE FRUKEGETI 720 
TEVIHGEPII KKYTKIIDGV PVE1TEKETR EERIITGPEI KYTR1STGGG ETEETLKKLL 780 
QEEVTKVTKF IEGGDGHLFE DEE1KRLLQG DTPVRKLQAN KKVQGSRRRL REGRSQ 

Seq ID NO: 63 DNA sequence 

Nucleic Acid Accession #: NMJ>20974 

Coding sequence: 81-3080 (underlined sequences correspond to start and stop codons) 

1 tl 21 31 41 51 
I I I 1 I I 

GGCGTCCGCG CACACCTCCC CGCGCCGCCG CCGCCACCGC CCGCACTCCG CCGCCTCTGC 60 
CCGCAACCGC TGAGCCATCC ATGGGGGTCO CGGGCCGCAA CCGTCCCGGG GCGGCCTGGG 1 20 
CGGTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGCGGGG GCCGTCCCGC 1 80 
CGGGTCGGGG CCGTGCCGCG GGGCCGCAGG AGGATGTAGA TGAGTGTGCC CAAGGGCTAG 240 
ATGACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 300 
AGCCTGGCTA CCAAGGGGAA GGCAGGCAGT GTGAGGACAT CGATGAATGT GGAAATGAGC 360 
TCAATGGAGG CTGTGTCCAT GACTGTTTGA ATATTCCAGG CAATTATCGT TGCACTTGTT 420 
TTGATGGCTT CATGTTGGCT CATGACGGTC ATAATTGTCT TGATGTGGAC GAGTGCCTGG 480 
AGAACAATGG CGGCTGCCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT 540 
GCAAGGAGGG GTTTTTCCTG AGTGACAATC AGCACACCTG CATTCACCGC TCGGAAQAGG 600 
GCCTGAGCTG CATGAATAAG G ATCACGGCT GTAGTCACAT CTGCAAGGAG GOCCCAAGGG 660 
GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAGACTGCA 720 
TCTTGACCTG TAACCATGGG AACGGTGGGT GCCAGCACTC CTGTGACGAT ACAGCCGATG 780 
GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 840 
AGCGAGAGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 900 
ATAAACGGGT GAAACGGCGG CTGCTCATGG AAACGTGTGC TGTCAACAAT GGAGGCTGTG 960 
ACCGCACCTG TAAGGATACT TCGACAGGTG TCCACTGCAG TTGTCCTGTT GGATTCACTC 1020 
TCCAGTTGGA TGGGAAGACA TGTAAAGATA TTGATGAGTG CCAGACCCGC AATGGAGGTT 1080 
GTGATCATTT CTGCAAAAAC ATCGTGGGCA GTTTTGACTG CGGCTGCAAG AAAGGATTTA 1 140 
AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATGA GTGCTCTTTG GATAGGACCT 1200 
GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 1260 
CCCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 1320 
GTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1380 
AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 1440 
CACCCCGTGT GTCCCTGCAC TGCGGTAAGA GTGGTGGAGG AGACGGGTGC TTCCTCAGAT 1 500 
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GTCACTCTGG CATTCACCTC TCTTCAGATG TCACCACCAT CAGGACAAGT GTAACCTTTA 1 560 
AGCTAAATGA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 1620 
CAGCACTACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 1680 
GCAGCTCTGG CAAGCAAGTC CCAGGAGCCC CTGGCCGACC AAGCACCCCT AAGGAAATGT 1740 
TTATCACTGT TG AGTTTGAG CTTGAAACTA ACCAAAAGG A GGTG ACAGCT TCTTGTG ACC 1 800 
TGAGCTGCAT CGTAAAGCG A ACCGAGAAGC GGCTCCGTAA AGCCATCCGC ACGCTCAGAA 1 860 
AGGCCGTCCA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 1920 
AAAAGCCTCC CAGAACATCT GAACGCCAGG CAGAGTCCTG TGGAGTGGGC CAGGGTCATG 1980 
CAG AAAA CCA ATGTGTCAGT TGCAGGGCTG GGACCTATTA TGATGG AGCA CGAGAACGCT 2040 
GCATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAACCAT 2100 
GCCCAAGACC AGGAAATTCT GGGGCCCTGA AGACCCCAGA AGCTTGGAAT ATGTCTGAAT 2160 
GTGGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 2220 
GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 2280 
GCCTTGCCAC CAAACATCAG GGAGCTACTT CCTTTCAGGA CTGTGAAACC AGAGTTCAAT 2340 
GTTCACCTGG ACATTTCTAC AACACCACCA CTCACCGATG TATTCGTTGC CCAGTGGGAA 2400 
CATACCAGCC TGAATTTGGA AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACGACTG 2460 
ACTTTGATGG CTCCACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGGA GGGGAGCTGG 2520 
GAGATTTCAC TGGGTACATT GAATCCCCAA ACTACCCAGG CAATTACCCA GCCAACACCG 2580 
AGTGTACGTG GACCATCAAC CCACCCCCCA AGCGCCGCAT CCTG ATCGTG GTCCCTGAGA 2640 
TCTTCCTGCC CATAGAGGAC G ACTGTGGGG ACTATCTGGT GATGCGG AAA ACCTCTTCAT 2700 
CCAATTCTGT G ACAACATAT G AAACCTGCC AGACCTACGA ACGCCCCATC GCCTTCACCT 2760 
CCAGGTCAAA GAAGCTGTGG ATTCAGTTCA AGTCCAATG A AGGGAACAGC GCTAGAGGGT 2820 
TCCAGGTCCC ATACGTGACA TATGATGAGG ACTACCAGGA ACTCATTGAA GACATAGTTC 2880 
GAGATGGCAG GCTCTATGCA TCTGAGAACC ATCAGGAAAT ACTTAAGGAT AAGAAACTTA 2940 
TCAAGGCTCT GTTTGATGTC CTGGCCCATC CCCAGAACTA TTTCAAGTAC ACAGCCCAGG 3000 
AGTCCCGAGA GATGTTTCCA AGATCGTTCA TCCGATTGCT ACCTTCCAAA GTGTCCAGGT 3060 
TTTTGAGACC TTACAAATGA CTCAGCCCAC GTGCCACTCA ATACAAATGT TCTGCTATAG 3 120 
GGTTGGTGGG ACAGAGCTGT CTTCCTTCTG CATGTCAGCA CAGTCGGGTA TTGCTGCCTC 3 180 
CCGTATCAGT GACTCATTAG AGTTCAATTT TTATAGATAA TACAGATATT TTGGTAAATT 3240 
G AACTTGGTT TTTCTTTCCC AGCATCGTGG ATGTAGACTG AGAATGGCTT TGAGTGGCAT 3300 
CAGCTTCTCA CTGCTGTGGG CGGATGTCTT GGATAGATCA CGGGCTGGCT GAGCTGGACT 3360 
TTGGTCAGCC TAGGTG AGAC TCACCTGTCC TTCTGGGGTC TTACTCCTCC TCAAGGAGTC 3420 
TGTAGTGGAA AGGAGGCCAC AGAATAAGCT GCTTATTCTG AAACTTCAGC TTCCTCTAGC 3480 
CCGGCCCTCT CTAAGGGAGC CCTCTGCACT CGTGTGCAGG CTCTGACCAG GCAGAACAGG 3540 
CAAGAGGGGA GGGAAGGAGA CCCCTGCAGG CTCCCTCCAC CCACCTTG AG ACCTGGGAGG 3600 
ACTCAGTTTC TCCACAGCCT TCTCCAGCCT GTGTGATACA AGTTTGATCC CAGGAACTTG 3660 
AGTTCTAAGC AGTGCTCGTG AAAAAAAAAA GCAGAAAGAA TTAGAAATAA ATAAAAACTA 3720 
AGCACTTCTG GAGACAT 

Scq ID NO: 64 Protein sequence: 
Protein Accession U; NP 066025. 1 

1 11 21 31 41 51 
I I I I 1 I 

MGVAGRNRPG AAWAVLLLLL LLPPLLLLAG AVPPGRGRAA GPQEDVDECA QGLDDCHADA 60 
LCQNTPTSYK CSCKPGVQGE GRQCEDIDEC GNELNGGCVH DCLNIPGNYR CTCFDGFMLA 120 
HDGHNCLD VD ECLENNGGCQ HTCVNVMGS Y ECCCKEGFFL SDNQHTCIHR SEEGLSCMNK 1 80 
DHGCSHIOCE APRGSVACEC RPGFELAKNQ RDCILTCNHG NGGCQHSCDD TADGPECSCH 240 
PQYKMHTDGR SCLEREDTVL EVTESNTTSV VDGDKRVKRK LLMETCA VNN GGCDRTCKDT 300 
STGVHCSCPV GFTLQLDGKT CKDIDECQTR NGGCDHFCKN IVGSFDCGCK KGFKLLTDEK 360 
SCQDVDECSL DRTCDHSCIN HPGTFACACN RGYTLYGFTH CGDTNECSIN NGGCQQVCVN 420 
TVGSYECQCH PGYKLHWNKK DCVEVKGLLP TSVSPRVSLH CGKSGGGDGC FLRCHSGIHL 480 
SSDVTTIRTS VTFKLNEGKC SLKNAELFPE GLRPALPEKH SSVKESFRYV NLTCSSGKQV 540 
PGAPGRPSTP KEMFITVEFE LETNQKEVTA SCDLSCIVKR TEKRLRKAIR TLRKAVHREQ 600 
FHLQLSGMNL DVAKKPPRTS ERQAESCGVG QGHAENQCVS CRAGTYYDGA RERCILCPNG 660 
TFQNEEGQMT CEPCPRPGNS GALKTPEAWN MSECGGLCQP GEYSADGFAP CQLCALGTFQ 720 
PEAGRTSCFP CGGGLATKHQ GATSFQDCET RVQCSPGHFY NTTTHRCIRC PVGTYQPEFG 780 
KNNCVSCPGN TTTDFDGSTN ITQCKNRRCG GELGDFTGYI ESPNYPGNYP ANTECTWTIN 840 
PPPKRRILIV VPEIFLPIED DCGDYLVMRK TSSSNSVTTY ETCQTYERPI AFTSRSKKLW 900 
1QFKSNEGNS ARGFQVPYVT YDEDYQELIE DIVRDGRLYA SENHQEILKD KKLIKALFDV 960 
LAHPQNYFKY TAQESREMFP RSFIRLLRSK VSRFLRPYK 

Seq ID NO: 65 DNA sequence 

Nucleic Acid Accession #: NM_0072 1 0 

Coding sequence: 1-1869 (underlined sequences correspond to start and stop codons) 
! H 21 31 41 51 

i 1 I I ! 1 

ATGAGGCTCC TCCGCAG ACG CCACATGCCC CTGCGCCTGG CCATGGTGGG CTGCGCCTTT 60 
GTGCTCTTCC TCTTCCTCCT GCATAGGGAT GTGAGCAGCA GAGAGGAGGC CACAGAGAAG 120 
CCGTGGCTG A AGTCCCTGGT G AGCCGG AAG GATCACGTCC TGGACCTCAT GCTGGAGGCC 1 80 
ATGAACAACC TTAGAGATTC AATGCCCAAG CTCCAAATCA GGGCTCCAG A AGCCCAGCAG 240 
ACTCTGTTCT CCATAAACCA GTCCTGCCTC CCTGGGTTCT ATACCCCAGC TGAACTGAAG 300 
CCCTTCTGGG AACGGCCACC ACAGGACCCC AATGCCCCTG GGGCAGATGG AAAAGCATTT 360 
CAGAAGAGCA AGTGGACCCC CCTGGAGACC CAGGAAAAGG AAGAAGGCTA TAAGAAGCAC 420 
TGTTTCAATG CCTTTGCCAG CGACCGGATC TCCCTGCAGA GGTCCCTGGG GCCAGACACC 480 
CGACCACCTG AGTGTGTGG A CCAGAAGTTC CGGCGCTGCC CCCCACTGGC CACCACCAGC 540 
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GTGATCATTG TGTTCCACAA CGAAGCCTGG TCCACACTGC TGCGAACAGT GTACAGCGTC 600 
CTACACACCA CCCCTGCCAT CTTGCTCAAG GAG ATCATAC TGGTGGATGA TGCCAGCACA 660 
GAGGAGCACC TAAAGGAGAA GCTGGAGCAG TACGTGAAGC AGCTGCAGGT GGTGAGGGTG 720 
GTGCGGCAGG AGGAGCGGAA GGGGTTGATC ACCGCCCX5GC TGCTGGGGGC CAGCGTGGCA 780 
CAGGCGGAGG TGCTCACGTT CCTGGATGCC CACTGTGAGT GCTTCCACGG CTGGCTGGAG 840 
CCCCTCCTGG CTCGAATCGC TGAGGACAAG ACAGTGGTGG TGAGCCCAGA CATCGTCACC 900 
ATCGACCTTA ATACTTTTGA GTTCGCCAAG CCCGTCCAGA GGGGCAGAGT CCATAGCCGA 960 
GGCAACTTTG ACTGGAGCCT GACCTTCGGC TGGGAAACAC TTCCTCCACA TG AGAAGCAG 1020 
AGGCGCAAGG ATGAAACATA CCCCATCAAA TCCCCGACGT TTGCTGGTGG CCTCTTCTCC 1080 
ATCCCCAAGT CCTACTTTGA GCACATCGGT ACCTATGATA ATCAGATGGA GATCTGGGGA 1 140 
GGGGAGAACG TGGAAATGTC CTTCCGGGTG TGGCAGTGTO GGGGCCAGCT GGAGATCATC 1200 
CCCTGCTCTG TCGTAGGCCA TGTGTTCCGG ACCAAGAGCC CCCACACCTT CCCCAAGGGC 1260 
ACTAGTGTCA TTGCTCGCAA TCAAGTGCGC CTGGCAGAGG TCTGGATGGA CAGCTACAAG 1320 
AAGATTTTCT ATAGGAGAAA TCTGCAGGCA GCAAAGATGG CCCAAGAGAA ATCCTTCGGT 1380 
GACATTTCGG AACGACTGCA GCTGAGGGAA CAACTGCACT GTCACAACTT TTCCTGGTAC 1440 
CTGCACAATG TCTACCCAG A GATGTTTGTT CCTGACCTGA CGCCCACCTT CTATGGTGCC 1 500 
ATCAAGAACC TCGGCACCAA CCAATGCCTG GATGTGGGTG AGAACAACCG CGGGGGGAAG 1560 
CCCCTCATCA TGTACTCCTG CCACGGCCTT GGCGGCAACC AGTACTTTGA GTACACAACT 1620 
CAGAGGGACC TTCGCCACAA CATCGCAAAG CAGCTGTGTC TACATGTCAG CAAGGGTGCT 1680 
CTGGGCCTTG GGAGCTGTCA CTTCACTGGC AAGAATAGCC AGGTCCCCAA GGACGAGG AA 1740 
TGGGAATTGG CCCAGGATCA GCTCATCAGG AACTCAGGAT CTGGTACCTG CCTGACATCC 1800 
CAGGACAAAA AGCCAGCCAT GGCCCCCTGC AATCCCAGTG ACCCCCATCA GTTGTGGCTC 1 860 
TTTGTCTAGG ACCCAGATCA TCCCCAGAGA GAGCCCCCAC AAGCTCCTCA GGAAACAGGA 1920 
TTGCTGATGT CTGGGAACCT GATCACCAGC TTCTCTGGAG GCCGTAAAGA TGGATTTCTA 1980 
AACCCACTGG GTGGCAAGGC AGGACCTTCC TAATCCTTGC AACAACATTG GGCCCATTTT 2040 
CTTTCCTTCA CACCGATGGA AGAGACCATT AGGACATATA TTTAGCCTAG CGTTTTCCTG 2100 
TTCTAGAAAT AGAGGCTCCC AAAGTAGGGA AGGCAGCTGG GGGAGGGTTC AGGGCAGCAA 2160 
TGCTGAGTTC AAGAAAAGTA CTTCAGGCTG GGCACAGTGG CTCATGCCTG AAATCCTAGC 2220 
ACTTTGGGAA GACAATGTGG G AGAATGGCT TG AGCCCAGG AGTTCAAGAC CGGCCTGAGC 2280 
AACATAGTGA GGATCCCATC TCTACGCCCA CCCTCCCCCC GGCAAAAAAA AAAGCTGGGT 2340 
ATGGTGGCTT ATGCCTGTAG TCGCAGCTAC TCAGAAGGCT GAGGTGGGAG GATTGCTTGT 2400 
TCCCCGGAGG TTGAAGCTAC AGTGAGCCTT GATTGTGTCA CTGCACTCCA GCCTGGGCAA 2460 
CAGGTAAGAC TCTGTCTCAA AAAAAAAACA AAAAAGAAGA AGAAAAGTAC TTCTACAGCC 2520 
ATGTCCTATT CCTTGATCAT CCAAAGCACC TGCAGAGTCC AGTGAAATG A TATATTCTGG 2580 
CTGGGCACAG TGGCTCACAC CTGTAATCCT AGCACTTTGG GAGGCCAAGG CAGGTGGATC 2640 
ACCTGAGGTC AGAAGTTTGA AACCAGCCTG GACTACATGG TGAAACTCCA TCTCTACTAA 2700 
AAGTACAAAA ATTAGCTGGG CATGATGGCA CGCACCTGCA GTCCCAGCTA CTTGGGAGGC 2760 
TGAGGCAGGA GAATCACTCG AACCCAGGAG GCAGAGGTTG CAGTGAGCCA AGACAGCACC 2820 
ATTGCACCCC AGCCTGAGCA ACAAGAGCGA AACTCCATCT CAGGAAAAAA AAAAAAAAAA 2880 
A 

Scq ID NO: 66 Protein sequence: 
Protein Accession #: NP_00914 1 

I 11 21 31 41 51 

I i i i i I 

MRLLRRRHMP LRLAMVGCAF VLFLFLLHRD VSSREEATEK PWUCSLVSRK DHVLDLMLEA 60 
MNNLRDSMPK LQIRAPEAQQ TLFSINQSCL PGFYTPAEUC PFWERPPQDP NAPGADGKAF 1 20 
QKSKWTPLET QEKEEG YKKH CFNAFASDRI SLQRSLGPDT RPPECVDQKF RRCPPLATTS 1 80 
VUVFHNEAW STLLRTVYSV LHTTPAILLK EIILVDDAST EEHLKEKLEQ WKQLQWRV 240 
VRQEERKGLI TARLLGASVA QAEVLTFLDA HCECFHGWLE PLLARIAEDK TVWSPDIVT 300 
IDLNTFEFAK PVQRGRVHSR GNFDWSLTFG WETLPPHEKQ RRKDETYPIK SPTFAGGLFS 360 
IPKSYFEHIG TYDNQMEIWG GENVEMSFRV WQCGGQLEII PCSVVGHVFR TKSPHTFPKG 420 
TSVIARNQVR LAEVWMDSYK K1FYRRNLQA AKMAQEKSFG DISERLQLRE QLHCHNFSWY 480 
LHNVYPEMFV PDLTPTFYGA IKNLGTNQCL DVGENNRGGK PLIMYSCHGL GGNQYFEYTT 540 
QRDLRHNIAK QLCLHVSKGA LGLGSCHFTG KNSQVPKDEE WELAQDQLIR NSGSGTCLTS 600 
QDKKPAMAPC NPSDPHQLWL FV 

Seq ED NO: 67 DNA sequence 

Nucleic Acid Accession #: NM J)141 12 

Coding sequence: 600-4484 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I J I I I 

TTCCTCCGCG AAGGCTCCTT TGATATTAAT AGTGTTGGTG TCTTGAAACT GACGTAATGC 60 
GCGGAGACTG AGGTCCTGAC AAGCGATAAC ATTTCTGATA AAGACCCGAT CTTACTGCAA 120 
TCTCTAGCGT CCTCTTTTTT GGTGCTGCTG GTTTCTCCAG ACCTCGCGTC CTCTCGATTG 1 80 
CTCTCTCGCC TTCCTATTTC TTTTTTTTTT TTTTAAACAA AAAACAACAC CCCCTCCCCT 240 
CTCCCACCCG GCACCGGGCA CATCCTTGCT CTATTTCCTT TCTCTTTCTC TCTCTCTCTC 300 
TCTCTTTTTT AATAAGGGTG GGGGAGGGAA AGGGGGGGGA GGCAGGAAAG ACCTTTTTCT 360 
CTCCCCCCCG CAATAATCCA AGATCA ACTC TGCAA ACAAC AGAAGACGGT TCATGGCTTT 420 
GGCCGCCGCG CCACCATCTT TCGGGCTGCC GAGGGTGTTC TTGACGATTA ATCAACAGAT 480 
GTACAGATCA GCTCTCAAAA TGTCTTCTGT GTCTTCTGAG CGTCTTCTAA GACAATTGCA 540 
TTAGCCTCCT GCTAGTTG AC TAATAGAATT AATAATTGTA AAAAGCACTC TAAAGCCACA 600 
laCCTTATG A AGTCAATGCT GGGTATG ATT TTACAAATAT GGTCCGGAAA AAGAACCCCC 660 
CTCTGAG AAA CGTTGCAAGT GAAGGCGAGG GCCAGATCCT GGAGCCTATA GGTACAGAAA 720 
GCAAGGTATC TGGAAAGAAC AAAGAATTCT CTGCAGATCA GATGTCAGAA AATACGGATC 780 
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AGAGTGATGC TGCAGAACTA AATCATAAGG AGGAACATAG CTTGCATGTT CAAGATCCAT 840 
CTTCTAGCAG TAAGAAGGAC TTGAAAAGCG CAGTTCTGAG TGAGAAGGCT GGCTTCAATT 900 
ATGAAAGCCC CAGTAAGGGA GGAAACTTTC CCTCCTTTCC GCATGATGAG GTGACAGACA 960 
GAAATATGTT GGCTTTCTCA TTTCCAGCTG CTGGGGGAGT CTGTG AGCCC TTGAAGTCTC 1020 
CGCAAAGAGC AGAGGCAGAT GACCCTCAAG ATATGGCCTG CACCCCCTCA GGGGACTCAC 1080 
TGGAGACAAA GGAAGATCAG AAGATGTCAC CAAAGGCTAC AGAGGAAACA GGGCAAGCAC 1 140 
AGAGTGGTCA AGCCAATTGT CAAGGTTTGA GCCCAGTTTC AGTGGCCTCA AAAAACCCAC 1200 
AAGTGCCTTC AG ATGGGGGT GTAAG ACTGA ATAAATCCAA AACTGACTTA CTGGTGAATG 1 260 
ACAACCCAGA CCCGGCACCT CTGTCTCCAG AGCTTCAGGA CTTTAAATGC AATATCTGTG 1320 
GATATGGTTA CTACGGCAAC GACCCCACAG ATCTGATTAA GCACTTCCGA AAGTATCACT 1380 
TAGGACTGCA TAACCGCACC AGGCAAGATG CTGAGCTGGA CAGCAAAATC TTGGCCCTTC 1440 
ATAACATGGT GCAGTTCAGC CATTCCAAAG ACTTCCAGAA GGTCAACCGT TCTGTGTTTT 1500 
CTGGTGTGCT GCAGGACATC AATTCTTCAA GGCCTGTTTT ACTAAATGGG ACCTATGATG 1560 
TGCAGGTGAC TTCAGGTGGA ACATTCATTG GCATTGGACG GAAAACACCA GATTGCCAAG 1620 
GGAACACCAA GTATTTCCGC TGTAAATTCT GCAATTTCAC TTATATGGGC AACTCATCCA 1680 
CCGAATTAGA ACAACATTTT CTTCAGACTC ACCCAAACAA AATAAAAGCT TCTCTCCCCT 1740 
CCTCTGAGGT TGCAAAACCT TCAGAGAAAA ACTCTAACAA GTCCATCCCT GCACTTCAAT 1800 
CCAGTGATTC TGGAGACTTG GGAAAATGGC AGGACAAGAT AACAGTCAAA GCAGGAGATG 1860 
ACACTCCTGT TGGGTACTCA GTGCCCATAA AGCCCCTCGA TTCCTCTAGA CAAAATGGTA 1920 
CAGAGGCCAC CAGTTACTAC TGGTGTAAAT TTTGTAGTTT CAGCTGTG AG TCATCTAGCT 1 980 
CACTTAAACT GCTAGAACAT TATGGCAAGC AGCACGGAGC AGTGCAGTCA GGCGGCCTTA 2040 
ATCCAGAGTT AAATGATAAG CTTTCCAGGG GCTCTGTCAT TAATCAGAAT GATCTAGCCA 2100 
AAAGTTCAGA AGGAGAGACA ATGACCAAGA CAGACAAGAG CTCGAGTGGG GCTAAAAAGA 2160 
AGGACTTCTC CAGCAAGGGA GCCGAGGATA ATATGGTAAC GAGCTATAAT TGTCAGTTCT 2220 
GTGACTTCCG ATATTCCAAA AGCCATGGCC CTGATGTAAT TGTAGTGGGG CCACTTCTCC 2280 
GTCATTATCA ACAGCTCCAT AACATTCACA AGTGTACCAT TAAACACTGT CCATTCTGTC 2340 
CCAGAGGACT TTGCAGCCCA GAAAAGCACC TTGGAGAAAT TACTTATCCG TTTGCTTGTA 2400 
GAAAAAGTAA TTGTTCCCAC TGTGCACTCT TGCTTCTGCA CTTGTCTCCT GGGGCGGCTG 2460 
GAAGCTCGCG AGTCAAACAT CAGTGCCATC AGTGTTCATT CACCACCCCT GACGTAGATG 2520 
TACTCCTCTT TCACTATGAA AGTGTGCATG AGTCCCAAGC ATCGGATGTC AAACAAGAAG 2580 
CAAATCACCT GCAAGGATCG GATGGGCAGC AGTCTGTCAA GGAAAGCAAA GAACACTCAT 2640 
GTACCAAATG TGATTTTATT ACCCAAGTGG AAGAAGAG AT TTCCCGACAC TACAGGAGAG 2700 
CACACAGCTG CTACAAATGC CGTCAGTGCA GTTTTACAGC TGCCGATACT CAGTCACTAC 2760 
TGGAGCACTT CAACACTGTT CACTGCCAGG A ACAGGACAT CACTACAGCC AACGGCGAAG 2820 
AGGACGGTCA TGCCATATCC ACCATCAAAG AGGAGCCCAA AATTGACTTC AGGGTCTACA 2880 
ATCTGCTAAC TCCAG ACTCT AAAATGGGAG AGCCAGTTTC TGAGAGTGTO GTG AAGAG AG 2940 
AGAAGCTGGA AGAGAAGGAC GGGCTCAAAG AGAAAGTTTG GACCGAGAGT TCCAGTGATG 3000 
ACCTTCGCAA TGTGACTTGG AGAGGGGCAG ACATCCTGCG GGGGAGTCCG TCATACACCC 3060 
AAGCAAGCCT GGGGCTGCTG ACGCCTGTGT CTGGCACCCA AGAGCAGACA AAGACTCTAA 3120 
GGGATAGTCC CAATGTGGAG GCCGCCCATC TGGCGCGACC TATTTATGGC TTGGCTGTGG 3180 
AAACCAAGGG ATTCCTGCAG GGGGCGCCAG CTGGCGGAGA GAAGTCTGGG GCCCTCCCCC 3240 
AGCAGTATCC TGCATCGGGA GAAAACAAGT CCAAGGATGA ATCCCAGTCC CTGTTACGGA 3300 
GGCGTAGAGG CTCCGGTGTT TTTTGTGCCA ATTGCCTGAC CACAAAGACC TCTCTCTGGC 3360 
GAAAGAATGC AAATGGCGGA TATGTATGCA ACGCGTGTGG CCTCTACCAG AAGCTTCACT 3420 
CGACTCCCAG GCCTTTAAAC ATCATTAAAC AAAACAACGG TGAGCAGATT ATTAGGAGGA 3480 
GAACAAGAAA GCGCCTTAAC CCAGAGGCAC TTCAGGCTGA GCAGCTCAAC AAACAGCAGA 3540 
GGGGCAGCAA TGAGGAGCAA GTCAATGGAA GCCCGTTAGA GAGGAGGTCA GAAGATCATC 3600 
TAACTGAAAG TCACCAG AGA GAAATTCCAC TCCCCAGCCT AAGTAAATAC GAAGCCCAGG 3660 
GTTCATTGAC TAAAAGCCAT TCTGCTCAGC AGCCAGTCCT GGTCAGCCAA ACTCTGGATA 3720 
TTCACAAAAG GATGCAACCT TTGCACATTC AGATAAAAAG TCCTCAGGAA AGTACTGGAG 3780 
ATCCAGGAAA TAGTTCATCC GTATCTGAAG GGAAAGGAAG TTCTGAGAGA GGCAGTCCTA 3840 
TAGAAAAGTA CATGAGACCT GCGAAACACC CAAATTATTC ACCACCAGGC AGCCCTATTG 3900 
AAAAGTACCA GTACCCACTT TTTGGACTTC CCTTTGTACA TAATG ACTTC CAGAGTGAAG 3960 
CTGATTGGCT GCGGTTCTGG AGTAAATATA AGCTCTCCGT TCCTGGGAAT CCGCACTACT 4020 
TGAGTCACGT GCCTGGCCTA CCAAATCCTT GCCAAAACTA TGTGCCTTAT CCCACCTTCA 4080 
ATCTGCCTCC TCATTTTTCA GCTGTTGGAT CAGACAATGA CATTCCTCTA GATTTGGCGA 4140 
TCAAGCATTC CAGACCTGGG CCAACTGCAA ACGGTGCCTC CAAGGAG AAA ACGAAGGCAC 4200 
CACCAAATGT AAAAAATGAA GGTCCCTTGA ATGTAGTAAA AACAGAG AAA GTTGATAG AA 4260 
GTACTCAAGA TGAACTTTCA ACAAAATGTG TGCACTGTGG CATTGTCTTT CTGGATGAAG 4320 
TGATGTATGC TTTGCATATG AGTTGCCATG GTGACAGTGG ACCTTTCCAG TGCAGCATAT 4380 
GCCAGCATCT TTGCACGGAC AAATATGACT TCACAACACA TATCCAGAGG GGCCTGCATA 4440 
GGAACAATGC ACAAGTGG AA AAAAATGGAA AACCTAAAGA GTAAAACCTT AGCACTTAGC 4500 
ACAATTAAAT AGAAATAGGT TTTCTTGATG GGAATTCAAT AGCTTGTAAT GTCTTATG AA 4560 
GACCTATTAA AA AAATACTT CATAG AGCCT GCCTTATCCA ACATGAAATT CCCTTCTTTT 4620 
GTTATTCTTT CTTTTGATGA GTAGGTTACC AAGATTAAAA AGTG AGATAA ATGGTCAATG 4680 
AGAAAGAATG GAAGATGGTA AACAATC ACT TTTTAAAACC TGTTAAGTCA AAACCATCTT 4740 
GGCTAATATG TACTGGGGAA ATAATCCATA AGAGATATCA CCAGACTAGA ATTAATATAT 4800 
TTATAAAGAA AGAGACCAAA ACTGTCTAGA ATTTGAAAGG GTTTACATAT TATTATACTA 4860 
AAGCAGTACT GG ACTGGCCA TTGG ACCATT TGTTCCAAAA CCCATAAATT GTTGCCTAAA 4920 
TTTATAATGA TCATGAAACC CTAGGCAGAG GAGGAGAAAT TGAAGGTCCA GGGCAATGAA 4980 
AGAAAAATGG CGCCCTCTCA ATTTAGTCTT CTCTCATTGG CCATGTTTCA GATTTTGACC 5040 
TAGAAATGCG AG CTGTG GTT AGGCTTGGTT AGAGTGCAGC AAGCAACATG ACAGATGGTG 5100 
GCACGCTGTT TTTACCCAGC CCTGCCTGTA CATACACATG CACACCCTCT CTG ATATTTT 5 1 60 
TGTCCTTTAG ATGTTCAAAT ACTCAGTAGT CCTTTTGTTT GCGGTTTAGA TTCATTTTGT 5220 
CCACACATGT ACCCATTTTA AAAAACAATG TCCTCGATGC TTCTGTAGTG ATTTCATTTT 5280 
AGCCAGGTAT TTCTTTCTTG TGTGTGATGA ACCAGTATGG ATTTGCTTTT CTAAGCCTCC 5340 
TGTTGGTTAC TAATCTCACT TGGCACATTA TAACTAAAGG AATCCCCTCA ATTCAAAAGC 5400 
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ATAGATGGAT ACAAATGTCA GACCGTGGGT TTAATTTGTT TAGAACACAT GGCATTTCTT 5460 
CACAAGGTAA CCTGCTGTAT TTATTTATTT TCTTTTGGTT AAATATAATT TCCAAACTTT 5520 
GTGGTCAGGC AGCGTCTAAG GTTACGTTAC CACAGACTGA CAGTTGGTAT ATGTACCAGC 5580 
CAATCCCTTC ATTAAATGTA TACAGATTTA GTTAAGTAGC ATTAAATAGG ATTCTTAGAA 5640 
GTATGTCCTC ATAGAACTTT TAATACTTAA GGCTTTGTAA AAACTATCCA TGAAGGGAAA 5700 
GCTCCTCAGC ATAACTGCTC AGGGAAATAG GGCTAAATAA CTGAACATTA AATAATTGGT 5760 
TAAAGGTGCT GTTAGTCGAG CCTCAATGCT TGCTACAAGG ATGTATGTAC AAGGACTGAC 5820 
TTTAATAATT TGCATTATAT TGTCCCAACC AGTAGTTTAT TTTTTGCCAC GGAGATGTAG 5880 
AAGATATTAC AAGCTACTGG ATGCACTGTC AGATTAACTT ATTTCATTAA AGAAGTTGGG 5940 
AGAACAAATA GGAAAAAAAA AACTTATTTT TCTAGTAAAT ATTAATGTAT TACATTTCAA 6000 
ATAATGGTGC CTG ACATATT GAATAATTAT TTTCTACAGT GTACGTATGC AACAAAGATA 6060 
TTCCATCATG CATTAGAGTC AGTTCTGGCT CTGCCTAGCT GTTTACATTT GCAAATGTAG 6120 
CAAACAAGGT AATGAAGCAA CTATTTCTAT TGCAGTAGAT ATCCTTTTGT OTGTGTGTGT 6180 
GTGCATTAAA GTTGTAAACG GTAACATGAA ACAAATGAAA GTTCTTGCTA TAATGGTATG 6240 
GAAAACAAGA AGGAAATGAA AATATTTTTA TGCCTACTTA GGAAAAAAAG GGTAGCACTT 6300 
ATTCATTCCA AGTACTTTTT TTTTTTTAAT TTTTAAGCTC TTAACTCACA TTGTTATGCT 6360 
TAAGATG ATA AACATATATC CTCTTTTTAT TGCTTTGTCT ATGTTTCATA TGAAACATTT 6420 
CAGAAATTAT TTTGATAAGT GTTGCTGGAA TCTGCAACGC TGATTTTTTT TTGCATTCTG 6480 
TAGTCGCATT TGCACTCC AT TTTTACATTA ATTCGCAGTT GCTTTGTATC ATTGTTTTGT 6540 
TTGGGTTTTG TTTCTTTTTC ACAGTGCCGG GTCTTCGTTT CTTAAAGTTG GATGGCAGGT 6600 
AGAGTTCAAC CAGTTCGTG A CTGTTGTAGC G AATGAAGTT AAAAAAATGT CTTTCTGATG 6660 
TTGTGTTGTC ATTTTCATTT TTGCATTTTT TTGTTTGCAT ATTAAAAAAA GAGAAAAGAG 6720 
AAAGCAAGAG ACAGAAATCA GGACTAAGTC CTCTGCTTCA GTTTCATTGT TAACGGGCCT 6780 
TATTCTGATC TCACCTGTCG CGTAGCTCTA ATATTCACAT AAACTGAAAT AAAGAAGTGG 6840 
AATGAGGAGC TTTGACATTC AAATTATGTG ATGTAATTTA TCTTCCTTAG GAATTTTGAT 6900 
GGATGCATCT CAAAATGTAT AGCCAGACTT GAGAGGTGAC AATTAAAG AT CTAAAAAAGA 6960 
GAGGAGATTC CCCCAAACAA CAATATTTAA TTTTCTTAGT AAAAAGAATA ACAGAATGCA 7020 
TCGTGGCAAT CCTTAAGCAA CATTATCTAT GTGGACTGCT TAAATCAGCA AAACACCAGA 7080 
AGTTTGGTTA ACTTGGGCAA TATGACAAGT ATTACTTTTT GGGCAAAACT ACTCATTAAG 7140 
CAATTTCTCT AGTGTGTCGG ACACAAATAG GTTCTTTATT TTTGGCATGT ATGCCTTTTT 7200 
ATTTTCATTC AATTTTTTTT TTTTCTCAGA CAGACATAGT AGTATCAACT AGCATTGGAA 7260 
AATACATATC ACTATTCTTG GAATATTTAT GGTCAGTCTA CTTTTTAGTA AAATATTTTT 7320 
GGATAGCGTT GACACGATAG ATCTTATTCC ATACTTCTTT ATTATTGATA ATTTTATTTT 7380 
CATTTTTTGC TTTCATTATT ATACATATTT TGGTGGAG AA GAGGTTGGGC TTTTTTGAAA 7440 
GAGACAAAAA TTTATTATAA CACTAAACAC TCCTTTTTTG ACATATTAAA GCCT TTAT TC 7500 
CATCTCTCAA G ATATATTAT AAAATTTATT TTTTTAATTT AAGATTTCTG AATTATTTTA 7560 
TCTTAAATTG TGATTTTAAA CGAGCTATTA TGGTACGG AA CTTTTTTTAA TGAGGAATTT 7620 
CATGATGATT TAGGAATTTT CTCTCTTGGA AAAGGCTTCC CCTGTG ATGA AAATGATGTG 7680 
CCAGCTAAAA TTGTGTGCCA TTTAAAAACT G AAAATATTT TAAAATTATT TGTCTATATT 7740 
CTAAATTGAG CTTTGGATCA AACTTTAGGC CAGGACCAGC TCATGCGTTC TCATTCTTCC 7800 
TTTTCTCACT CTTTCTCTCA TCACTCACCT CTGTATTCAT TCTGTTGTTT GGG ATAGAAA 7860 
AATCATAAAG AGCCAACCCA TCTCAGAACG TTGTGGATTG AGAGAGACAC TACATGACTC 7920 
CAAGTATATG AGAAAAGGAC AGAGCTCTAA TTGATAACTC TGTAGTTCAA AAGGAAAAGA 7980 
GTATGCCCAA TTCTCTCTAC ATGACATATT GAG ATTTTTT TTAATCAACT TTTAAGATAG 8040 
TGATGTTCTG TTCTAAACTG TTCTGTTTTA GTGAAGGTAG ATTTTTATAA AACAAGCATG 8 1 00 
GGGATTCTTT TCTAAGGTAA TATTAATGAG AAGGGAAAAA AGTATCTTTA ACAGCTCTTT 8160 
GTTGAAGCCT GTGGTAGCAC ATTATGTTTA TAATTGCACA TGTGCACATA ATCTATTATG 8220 
ATCCAATGCA AATACAGCTC CAAAAATATT AAATGTATAT ATATTTTAAA ATGCCTGAGG 8280 
AAATACATTT TTCTTAATAA ACTGAAGAGT CTCAGTATGG CTATTAAAAT AATTATTAGC 8340 
CTCCTGTTGT GTGGCTGCAA AACATCACAA AGTGACCGGT CTTGAG ACCT GTGAACTGCT 8400 
GCCCTGTTTA OTAAATAAAA TTAATGCATT TCTAGAGGGG GAATATCTGC CA TCCAG TGO 8460 
TGGAAATGTG GAGTAAAG AA GCTGGTGGTC TGCTTCTGTG CTGTATGCCA GCCTTTTGCC 8520 
TTAAGTTGAG AGGAGGTCAA CTTTAGCTAC TGTCTTTGGT TTGAGAGCCA TGGCAAAAAA 8580 
AAAAAAAGAA AAAAAGATCA AGTCGTCTTT GGTGAGCCAG TAAGGTGAAA GCTTGCTOAC 8640 
TGTCCAAGGC ACAAGAGAAA ATTGAGGAAT TGAAATGCAA CCTGAGTATC AAACTAAATA 8700 
TTCTAATCAA AGGTAGGTAC TGTTAGGTGG AATTCTATCA GCAG GCAAC T GCAAATGAGA 8760 
AGAAGATAGA AGGACGCCCG TCGGGACTTT GGAGGGCATT GTTATTTTCC CAAAGAAAGA 8820 
CGGCCAAGGG CAGAGGCATG GATTCTTTGC AGAGCACTTC CTTTTGGTTT TTCAGTACTG 8880 
TTTCATAGAC AGTGGGCTCA CATGTTCCTG ATAGTGCTGC AGTTGCTTAG AAAGCATCCC 8940 
AGTTAATTGC AGTAATTAGA ACTTCTGGAA TATGCTAGGG CAGAAGTATG TCAAGTATGT 9000 
CACATGAAGA AAATGTGAAA TTCAAGAGTA ATCCACACGT GAGAAACTAG ACAATGTACA 9060 
TTCATGTGTT CTCTTGAAAG GAAAGGGAGA GCTGTAAGCT TCACTCTGTC CTACACCGGA 9120 
GAAAAGCAGG AATAACTTTA CCGTGGAAAT AATGTTTAGC TTTT ATCAGA GAAAATTG TC 9180 
CTTCTAGAGC ATAG AGTCCC AAAACTCAAT TCTGGTTTTC CCCTGTTTTT 111 11 1111 1 9240 
TTTTTCCCAA CATATGAACT GCAGCATATC ACTTTTTCTT TTTGTGCCTC AGGTTCCTCA 9300 
CCTGTAAAAT TGAAAAATAT ATGTATTAAT AATATTATTA ATAATAATAA TGGTAATGTA 9360 
GTACTTGTTT GTAAAGCACT TTG AG ATCCT TGGTTGAAAG GCACCATAGG AGTGCCAAGT 9420 
ATTATTATGT GGCCAAGGGG GTTATTTAAA CTGTCAGTTC CCAAAGGCCA GGAAAGGTTG 9480 
GGGTCATTTT TCTTAAAGAC GAGCTGTAAA TATCAACTAG GCAGCCAATA GTGTTGACTA 9540 
TGAAGATGCA AAACTATTAC TAGGCTGATA AAATCATAGT TTCTTAATGG CTACCAATAA 9600 
GGCAAATATC ACAATAATAA ACGCCAAATT CCTTAGGGCG GACTATTTGA CAACCACATG 9660 
GAAAACTTTG GGGGAGGCAT GAGGGGGGAA CATCTCAAAA TGCCAATGTA AAATTTAACT 9720 
TACAGCAATA TTCACCAGCA GAAAATGTCT TTCATATGGA ATGATTTCAT GTTGCTAAGA 9780 
AAAAGAATTC AATTTGTAGT CCTGATTTG A ATACTAGAAT GTTGGCTATA ATAGTTCTGT 9840 
TCTTACAACA CATG AAATTT TTTCGTTTTA TTTTATTTTG TTTTCATAGT GCATGTTCAT 9900 
TTCTACTCAC AAACATGTTC TTGGTGTATT TCTTATGCAA ACAATCTTCA GGCAGCAAAG 9960 
ATGTCTGTTA CATCTAAACT TGAATAATAA AGTTTTACCA CCAGTTACAC A 
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Seq ID NO: 68 Protein sequence: 
Protein Accession #: NPJ)5483 1 

1 II 21 31 41 51 
I I I I I I 

MPYEVNAGYD FTNMVRKKNP PLRNVASEGE GQILEPIGTE SKVSGKNKEF SADQMSENTD 60 
QSDAAELNHK EEHSLHVQDP SSSSKKDLKS A VLSEKAGFN YESPSKGGNF PSFPHDEVTD 1 20 
RNMLAFSFPA AGGVCEPLKS PQRAEADDPQ DMACTPSGDS LETKEDQKMS PKATEETGQA 1 80 
QSGQANCQGL SPVSVASKNP QVPSDGG VRL NKSKTDLLVN DNPDPAPLSP ELQDFKCNIC 240 
GYGYYGNDPT DLIKHFRKYH LGLHNRTRQD AELDSKILAL HNMVQFSHSK DFQKVNRSVF 300 
SGVLQDINSS RPVLLNGTYD VQVTSGGTFI GIGRKTPDCQ GNTKYFRCKF CNFTYMGNSS 360 
TELEQHFLQT HPNKIKASLP SSEVAKPSEK NSNKSIPALQ SSDSGDLGKW QDKJTVKAGD 420 
DTPVGYSVPI KPLDSSRQNG TEATS YYWCK FCSFSCESSS SLKLLEHYGK QHGAVQSGGL 480 
NPELNDKLSR GSVINQNDLA KSSEGETMTK TDKSSSGAKK KDFSSKGAED NMVTSYNCQF 540 
CDFRYSKSHG PDVIWGPLL RHYQQLHN1H KCTHCHCPFC PRGLCSPEKH LGE1TYPFAC 600 
RKSNCSHCAL LLLHLSPGAA GSSRVKHQCH QCSFTTPDVD VLLFHYESVH ESQASDVKQE 660 
ANHLQGSDGQ QSVKESKEHS CTKCDFITQV EEEISRHYRR AHSCYKCRQC SFTAADTQSL 720 
LEHFNTVHCQ EQDITTANGE EDGHAISTDC EEPKIDFRVY NLLTPDSKMG EPVSESVVKR 780 
EKLEEKDGLK EKVWTESSSD DLRNVTWRGA DILRGSPSYT QASLGLLTPV SGTQEQTKTL 840 
RDSPNVEAAH LARPIYGLAV ETK.GFLQGAP AGGEKSGALP QQYPASGENK SKDESQSLLR 900 
RRRGSGVFCA NCLTTKTSLW RKNANGGYVC NACGLYQKLH STPRPLNIIK QNNGEQIIRR 960 
RTRKRLNPEA LQAEQLNKQQ RGSNEEQVNG SPLERRSEDH LTESHQREIP LPSLSKYEAQ 1020 
GSLTKSHSAQ QPVLVSQTLD 1HKRMQPLHI QBCSPQESTG DPGNSSSVSE GKGSSERGSP 1080 
IEKYMRPAKH PNYSPPGSPI EKYQYPLFGL PFVHKDFQSE ADWLRFWSKY KLSVPGNPHY 1 140 
LSHVPGLPNP CQNYVPYPTF NLPPHFSAVG SDNDIPLDLA DCHSRPGPTA NGASKEKTKA 1200 
PPNVKNEGPL NVVKTEKVDR STQDELSTKC VHCGIVFLDE VMYALHMSCH GDSGPFQCSI 1260 
CQHLCTDKYD FTTHIQRGLH RNNAQVEKNG KPKE 

Seq ID NO: 69 DNA sequence 

Nucleic Acid Accession #: XM_073879 

Coding sequence: 1-3 8 7( underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

ATGGGGTTTG GAGACCAGGG AACGGTGGAA GGGAGCCTAG GAACGTCG AA AAAACCACCT 60 
GAAGTGAAAA TGTTTGGAGC CAGTCAAGGT TTGCTGACAA TGGAAACAAA CCAGTCCCTG 120 
GCACAAGGCA CAGGCTGCTC AGTAGTAAAG GTAG ACACTG TTCTCTTCG A GAGTTTATAC 1 80 
CACTGCGGCT TTGAACACGG GAGCGTGATG CACTGCCTTG GGGATGATCA CCCCCAGGAA 240 
GACAGGAAGG CTCACTTCTC TGCCCCAGTT GCAGCCATCG CCTCTCCAGC ACCGACTCCT 300 
GTCTGTCCTG CACACCACTC AACACAG AGC ATCTGCCAGT TTCTACAGCA CTGCAGGCAG 360 
AACACTCACT TGCAGGCTGC TAACTAA 

Seq ID NO: 70 Protein sequence: 
Protein Accession U: XPJ)73879 

1 U 21 31 41 51 
I 1 I I I I 

MGFGDQGTVE GSLGTSKKPP EVKMFGASQG LLTMETNQSL AQGTGCSWK VDTVLFESLY 60 
HCGFEHGSVM HCLGDDHPQE DRKAHFSAPV AAIASPAPTP VCPAHHSTQS ICQFLQHCRQ 120 
NTH LQ A AN 

Seq ID NO: 71 DNA sequence 

Nucleic Acid Accession 8: AB 033064 

Coding sequence: 826-1986 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I 1 I 1 

GGGGAACAGC AAATTCAGTC ACAGACAATC CTCCACTCGG TCAAGAGCCA CITITCTCTT 60 
CCTGCCTTGC CCCCCCGCAG GGGGTAAGGA ACTGAGCGTT TAATCTTTAG CCGGTTGGCT 120 
ACCAGCTAAA ATTCTACTTA TCTTAGTTTC TAGTGGATAG CTTTCTTATT TTGCCCATGT 1 80 
TTTCTTAGAA TCCCTGTTTA ATATACTTTT GTCAGTAGTA GTATCTAGGA GTAGCAGGGA 240 
GAGTGACAAT AAATTAGCCC CTTCTTTTTT CCCTTGTCAT TCAGGCCCCT TTTCCTCTCC 300 
AGAGGGAAAT TACCAGTAAA CTCTTCTAAA TCTTCCACCC CTTCTCAGTC ATACTGTGAA 360 
GAAACACACT AAAGTGGACA TTATTTG ACC AGTGAACACG AACCCAGCTT CAGGCATTGG 420 
TTTGTTGTGG CACATGGAG A AACATCTCTT TTAAAATATC TCCCAATTAC CCTTTTCACA 480 
ATTTGTATCC ACCTAGGATT TGCTGCTGGG GTAAGTCACT AGATTTATTT CTCAAAGCTC 540 
CCCTCTCTAT GAGCTGAAAG ACTGACCAAC CATGAACACT AGTAGGGGAT GGGGAAAGGG 600 
GACAGAGCAG AGCCAGTTGT TCCACACTTT GGGAAGCAGG AGTAGCTTTT ATCATCTTCC 660 
TCTGGGGAGC AGGCATAGAG ACATAAACTG AGTGAAAATG GGTGGAGGAA GAACTTCTAT 720 
ACCCACGAAC AACATGTGAA GAGAGAGAAC CAAACATAAA GTAAGGAGGG TAGACGTTAC 7S0 
ATCCAAGAGG AAATAATCCA GGCAAGGAAG CACAAGCTGA TCAAGATGTG TAGTTCTGTG 840 
G CTGC CAAGT TGTGGTTTTT GACAGATCGT CGCATCAGGG AAGACTATCC TCAAAAAGAG 900 
ATTTTACGAG CATTGAAGGC CAAATGTTGT GAGGAGGAAC TGGACTTTAG GGCTGTGGTG 960 
ATGGATGAGG TGGTGCTGAC AATOGAGCAA GGAAACCTGG GTCTGCGGAT CAATGGAGAG 1020 
CTAATCACTG CCTACCCACA AGTGGTGGTA GTCAGAGTAC CAACCCCTTG GGTGCAAAGT 1080 
GATAGTGACA TCACTGTTTT GCGCCATCTA GAGAAGATGG GATGTCGGTT AATGAACCGA 1 1 40 
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CCTCAAGCCA TCCTGAACTG CGTTAATAAG TTCTGGACAT TTC AAGAG TT GGCTGGCCAT 1200 
GGTGTTCCTC TGCCGGATAC TTTCTCTTAT GGTGGCCACG AAAATTTTGC TAAAATG ATT 1 260 
GATGAGGCTG AAGTTCTGGA GTTCCCAATG GTAGTAAAGA ATACGCGGGG TCACAGAGGT 1320 
AAAGCTGTTT TCTTGGCTCG AGATAAGCAC CATTTGGCTG ATCTAAGCCA TCTTATTCGC 1380 
CATGAAGCGC CATACCTGTT CCAGAAGTAT GTTAAAGAGT CTCATGGACG GGATGTACGT 1440 
GTCATTGTCG TGGGAGGCCG TGTGGTTGGC ACCATGTTAC GTTGTTCAAC AGATGGGAGA 1500 
ATCCAAAGCA ACTGCTCATT AGGTGGTGTG GGGATGATGT GCTCATTGAG TGAACAAGGG 1560 
AAGCAGCTAG CTATCCAGGT GTCTA ATATC CTGGGGATGG ATGTGTGTGG CATTGACCTG 1620 
CTGATGAAAG ATGACGGCTC CTTCTGCGTC TGTGAGGCCA ATGCAAATGT AGGTTTCATC 1680 
GCCTTTGATA AGGCTTGTAA TCTAGATGTA GCTGGTATCA TAGCAGACTA TGCCGCCTCC 1740 
CTTCTACCCT CTGGCCGGCT CACCCGGCGT ATGTCCCTGC TCTCCGTGGT GTCCACTGCC 1800 
AGTGAGACTA GTGAGCCGGA GCTGGGTCCC CCAGCCAGCA CTGCTGTTGA CAACATGAGT 1860 
GCAAGTTCCA GCTCTGTTGA CAGCGACCCT GAAAGCACGG AGCGAGAGCT GCTCACCAAG 1920 
CTCCCAGGGG GCCTGTTCAA CATGAACCAG CTGCTAGCCA ATGAAATCAA ACTACTGGTG 1980 
GACTGACTCC ACTGGTAATT AACCAACAAA ACCCTTGTAA AACTTTCTTT CTTCTTTTCT 2040 
ATTTTTAAAA CCAACTTGCA ATGCTGTTCA TGGAGGATGC TCAGGAAGAT GAGAGAAAAT 2100 
TAGTAGGATT AGTTGGAGAG AGTGGGAGAT AGATGAGACC TCTGCTAGTA AGATGTTACT 2160 
TTCATTTACA AATCCTACAA ATAGAGAGGC AGAATAGGTG GGGTATAGAA AAATGTCAGG 2220 
CTCTCATAGT TACCCTTTTA AATTGCTAAA AAATGTGTAT GCTCATAGGC CATGA GGAA C 2280 
AAATACTTTT TTTTTTTCAT GGTCCCTTGC TTTTGTTTTT GTACAAAAAA AAATGGTTTT 2340 
GCTACAAATA TCCAAGTAGC ATAACTTCAC ATTGTGTTGG AAGATTTGf C ATCAOTGAGG 2400 
AAAACATCTG CTTAAATTAC AGGAATTTTT GTATTATACA GCTCTGAAAA TTCTGCCATT 2460 
TCCTTATTAA CTAGCAGCTT TAGTTTGTAG TTTATGAAAT CTTGAGGGGC TCTTTTACTG 2520 
GGATTTCTTA TTTTTTTGTT TTTTCCCGCT TAATTTGGTG GGAGGTCAAA TTGAATATAA 2580 
CCCAATAA AG GCTTCTTAAT GAGAAAATTG GCATGTTTGC ATGATG AAAT GGAAATGAAC 2640 
AGTATTGCAA TGTCCGGTAT ACAAAATAAC ATTAATTCAA T GTAGATA AA ATTACACTAG 2700 
TTTAAAATAT GTGCATTCAC TTGTATTTGT TAGTGTTTTA GTCTTTTTTG AAAGATGTGC 2760 
TCTGTTAATG TTG CTTTTTT TTTT TTTTTl TAATACATGC TAGTCTAACA TTTCCTGCTC 2820 
TATGCCTGCA TCTTTAACAA TGGCCAAAGT GAAGAA AATG CTACCTTTTT TGTT AACAAG 2880 
ACACTGACTT GAAACATGTA CATTTAAAGC CTTTTATTTT TTCCCTTTTT GTTTTGGTAG 2940 
TTGGGCATTT AAATAAGGAC AAGGAAAAAT ATTTTTGGGG GCAAATCAAG AGCCT ATGAG 3000 
TTCTAAGTAT AAAGCTGAAG TGATTTCGAA TGCCAGCGTT ATATATTTGC ATTTTTCACA 3060 
TTTTACGAGG G AGTATATGT GTATGTGTGT GCACGCATGC ATGTGTATGT GTTTTGCTTT 3 120 
TTGTTTCCAT CAACTAATCA AAAAGGATAA TTTAGAAAAT GGAGCATGAT GGGAAACAGA 3180 
GTTTTTGACT TTAAAAAACA GATGAGTTGT TTTCATAAGT AGA CTCCA CT GGGGTAGAGG 3240 
TATTCACCTT AAAACATAGG GTGAGTAGAT GCTTTTTTAG GCCTTTTTGT GTATATGTAC 3300 
GTTGTTTGTT 1111 1 CCTTT TGTTTCTAGC CTGTTCAGTG TACAGTTTAT TCAAGGCTAC 3360 
ATGCTTTTCT TTAATGCTTC TGGCTATGCA TTTTCTCTTT TTACATATAG GATTTGGG AT 3420 
TGGGGGTGGG TTGGATGTTT TTGTTTGGGG ACTTATTTAG TAGTATTGAG TCTCTTATAG 3480 
CCCTACTCTT AAGCCTTCAA TACTGTCCAC TCTTTATATT CCTTTACTTG CAGAATTTAT 3540 
AAAAGCCCCC AAACTGCATA TAATATGAGC CTTTAAAACA TGGGTAAAAC TAA TCCCAT T 3600 
GATGGGTTTG GATGGTATGT TAAGAAATGG AGATGCTGCA GAGCCCAACG TAATTTTTTA 3660 
AACAGCAAGT TTTCCATCTC CCTACGAATC CTCTGAAGCT TTTACCCAAG CCCTTTCTTG 3720 
CCTCTCCAGT GCTATTTTCC TTCAGATGGA CCTTAAACAT AATTTCTTGG ACACTACTAG 3780 
AGAGACTTCG AGGCAATAAT AAAAGATCAG TATTAACCAG CTATAACAGA G GTTTGA TCA 3840 
TGCTTACTTG TACAGTTTTT CCCCCGTTTT AAAAAGG AAT GTAATAAAAT TTGTTTTTTC 3900 
CATAGAATTA AATAATATTA AAATTGAGTG AAAGGTTGAT TGTTGATGAA TAGAATAGTA 3960 
CCTCTCATCT GTGCAGTGTC TCATTTCACC TCAGAGAAAA GGATACATAA GAGGAGTTTG 4020 
TAATTTATCT TAGG ATATTC TAATTGCATT TAAAAGAACT TATCTTGCGC AGGGTAAATG 4080 
GGGGACTCAC ATACATATAT TAATACCTCT GACTCATTAA CAGAAAGAAA TACTTGGTAC 4140 
TTCTTTCGCT GAATGACCAT ACTGTGGAGG ATGCATACTA TTTGGTATAG AGAAATAAAT 4200 
GAGGAAGAAA G AACTGCTTA ATTAAATTAT CATTCATATG TTCATATAG A GACCATCTGG 4260 
TTGCCATGTG TATTATGACA CATACACTTT GAATAGTTAC ATATCACAAG TATGTAGTTC 4320 
ATGTTTGTGT TGGTGGGGTA AGGCATCAGG AAAAATGTAG TTAGTCTTTT CTTAACTTAT 4380 
ACCAAATTAA CCAACTATAT TATAGGAAAT ATGTGAAATT AGTTCATTAG CTTTATTCAC 4440 
TATTATGCAT TCACATGATA TTAAAACGTA CACTCACATG TTAGAATGAA AAGAGCAGTA 4500 
GTTATCTTAG ATTTTAAAAA CATGGATATC TTCTTGAATT CCTTCAAGAT TGAGGTAGAG 4560 
AATAAGAGCA AATCATTCTG GAAGTACCTT AAGGAAACAA ACAGCAGCAG ATATTTAGGT 4620 
TAAACTTATT TTCATAATTG TTTAATAACT TTTGTATAAT CTTCATTGCT ATTATGAGAG 4680 
AG AATG TATA TATCAAATAT GTGTAATGAT AAAATCTGAA TTGTAAAATT T TTGTA TATT 4740 
GTTAAAATTG TAATTCTAAA TTGTATTTCA A A AATG ATT A TTTCTGATAT TGTTTTTATG 4800 
TCACCCATGA TGAAAACTGG ACTTTATATA TCTAAACATA CAAGTATGAA CTATTCTATT 4860 
TAAAATTTTT AATAGTTTTT TTCTTTTTTG GTGCCTATAA TTGATTGGTC ATTTCTGCTG 4920 
GCTTTTCTCC AATGAACATT GAAATCTTCC TGTATATGTT ACCAATAAGA AAACTACCCT 4980 
GGAACAGTAG AAAAACCCAA CAAGAGACTT GGCATTCATC AAGCACATTA TCAGACTTTG 5040 
AG AACATATT GAAGGCATTG ACTTTG AAAA TCATCTCTTT TTCTCAAG AA GAAAGCAATG 5 100 
GAGAAGCAAA TTTGTTTCAT TCAGTGAATC CCCAGTTTGG G GCTTGTGG G GCTTAG AG AC 5160 
ATTGTGAAAT CAAATCTTGT GTTATACTTT TCTCCTGGCT CACTTTTTTT GAGAAGGTTT 5220 
ATGGGCTATT TGGCTGGTGA GACACGATCC CCTCCTAAGA AAATG TAGGT GCTCAGACAG 5280 
GTAACCACTG CTGCTACTGT TTTTATTTGT TTGTTTGTTC AATTTTATTT AAGATTTGTT 5340 
TTTGTTGTAC TAGGATTTTA AA AAATGTAA TATATTGCAO G ATTTATAAC CAG 

Seq ID NO: 72 Protein sequence: 
Protein Accession #: BAA86S52 



1 11 21 31 41 51 
I I I I 1 I 
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MCSSVAAKLW FLTDRRIRED YPQKEILRAL KAKCCEEELD FRAWMDEVV LTIEQGNLGL 60 
RINGEL1TAY PQVVVVRVPT PWVQSDSDIT VLRHLEKMGC RLMNRPQAIL NCVNKFWTFQ 120 
ELAGHGVPLP DTFSYGGHEN FAKM1DEAEV LEFPMWKNT RGHRGKAVFL ARDKHHLADL 180 
SHLIRHEAPY LFQKYVKESH GRDVRV1VVG GRVVGTMLRC STDGRMQSNC SLGGVGMMCS 240 
LSEQGKQLAI QVSNILGMDV CGIDLLMKDD GSFCVCEANA NVGFIAFDKA CNLDVAGIIA 300 
DYAASLLPSG RLTRRMSLLS WSTASETSE PELGPPASTA VDNMSASSSS VDSDPESTER 360 
ELLTKLPGGL FNMNQLLANE IKLLVD 

Seq IP NO: 73 DNA sequence 

Nucleic Acid Accession ft XMJJ40080.2 

Coding sequence: 159-1 104 (underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 
I I I I I I 

CTGAGTGGGG GCGGGGACTG CTGGAGTTGC GGGGCCTGCC TGGGGTAGGG CGGGGCAGGA 60 
CAGCTTGGAG ATAGGGCCCG GAATTGCGGG CGTCACTCTG CTCCTGCGAC CTAGCCAGGC 120 
GTGAGGG AGT GACAGCAGCG CATTCGCGGG ACGAGAGCG A TG AGTGAGAA CGCCGCACCA 1 80 
GGTCTGATCT CAGAGCTGAA GCTGGCTGTG CCCTGGGGCC ACATCGCAGC CAAAGCCTGG 240 
GGCTCCCTGC agggccctcc agttctctgc CTGCACGGCT GGCTGG ACAA TGCCAGCTCC 300 
TTCGACAGAC TCATCCCTCT TCTCCCGCAA GACTTTTATT ACGTTGCCAT GGATTTCGGA 360 
GGTCATGGGC TCTCGTCCCA TTACAGCCCA GGTGTCCCAT ATTACCTCCA GACTTTTGTG 420 
AGTGAG ATCC GAAGAGTTGT GGCAGCCTTG AAATGGAATC GATTCTCCAT TCTGGGCCAC 480 
AGCTTCGGTG GCGTCGTGGG CGGAATGTTT TTCTGTACCT TCCCCGAGAT GGTGGATAAA 540 
CTTATCTTGC TGGACACGCC GCTCTTTCTC CTGGAATCAG ATGAAATGG A GAACTTGCTG 600 
ACCTACAAGC GGAGAGCCAT AGAGCACGTG CTGCAGGTAG AGGCCTCCCA GGAGCCCTCG 660 
CACGTGTTCA GCCTGAAGCA GCTGCTGCAG AGGTTACTGA AGAGCAATAG CCACTTGAGT 720 
GAGGAGTGCG GGGAGCTTCT CCTGCAAAGA GGAACCACGA AGGTGGCCAC AGGTCTGGTT 780 
CTGAACAGAG ACCAGAGGCT CGCCTGGGCA GAGAACAGCA TTGACTTCAT CAGCAGGGAG 840 
CTGTGTGCGC ATTCCATCAG GAAGCTGCAG GCCCATGTCC TGTTGATCAA AGCAGTCCAC 900 
GGATATTTTG ATTCAAGACA GAATTACTCT GAGAAGGAGT CCCTGTCGTT CATGATAGAC 960 
ACGATGAAAT CCACCCTCAA AGAGCAGTTC CAGTTTGTGG AAGTCCCAGG CAATCACTGT 1020 
GTCCACATGA GCGAACCCCA GCACGTGGCC AGTATCATCA GCTCCTTCTT ACAGTGCACA 1080 
CACATGCTCC CAGCCCAGCT GTAGCTCTGG GCCTGGAACT ATGAAGACCT AGTGCTCCCA 1 140 
GACTCAACAC TGGGACTCTG AGTTCCTGAG CCCCACAACA AGGCCAGGGA TGGTGGGG AC 1 200 
AGGCCTCACT AGTCTTGAGG CCCAGCCTAG GATGGTAGTC AGGGGAAGGA GCGAGATTCC 1260 
AACTTCAACA TCTGTGACCT CAAGGGGGAG ACAGAGTCTG GGTTCCAGGG CTGCTTTCTC 1320 
CTGGCTAATA ATAAATATCC AGCCAGCTGG AGGAAGGAAG GGCAGGCTGG GCCCACCTAG 1380 
CCTTTCCCTG CTGCCCAACT GGATGGAAAA TAAAAGGTTC TTGTATTCTC A 

Seq ID NO: 74 Protein sequence: 
Protein Accession ft XPJM0080. 1 

1 11 21 31 41 SI 
I I I I I I 

MSENAAPGU SELKLAVPWG HIAAKAWGSL QGPPVLCLHG WLDNASSFDR LIPLLPQDFY 60 
YVAMDFGGHG LSSHYSPGVP YYLQTFVSEI RRWAALKWN RFSILGHSFG GVVGGMFFCT 120 
FPEMVDKLIi LDTPLFLLES DEMENLLTYK RRAIEHVLQV EASQEPSHVF SLKQLLQRLL 180 
KSNSHLSEEC GELLLQRGTT KVATGLVLNR DQRLAWAENS IDFISRELCA HSIRKLQAHV 240 
LLIKAVHGYF DSRQNYSEKE SLSFMIDTMK STLKEQFQFV EVPGNHCVHM SEPQHVASII 300 
SSFLQCTHML PAQL 

Seq 10 NO: 75 DNA sequence 

Nucleic Acid Accession ft NM_005794 

Coding sequence: 434-1276 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
1 I I I I I 

GGTTCCCTTC CACGCTGTGA AGCTTTGTTC TTTTGGTCTT CATGATAAAT CTTGCTGCTG 60 
CTCACTCGTT GGGTCCGTGC CACCTTTAAG AGCTGTAACA CTCACCGCGA AGGTCTGCAA 120 
CTTCACTCCT GGGGCCAGCA AG ACCACG AA TGCACCGAGA GG A ATGAACA ACTCTGGACA 1 80 
CACCATCTTT AAGAACCGTA ATACTCACCG CAAGGGTCTG CAACTTCATT CTTGAAGTCA 240 
GTGAGGCCAA GAACCCATCA ATTCCGTACA CATTTTGGTG ACTTTGAAGA GACTGTCACC 300 
TATCACCAAG TGGTGAGACT ATTGCCAAGC AGTGAGACTA TTGCCAAGTG GTGAG ACCAT 360 
CACCAAGCGG TGAGACTATC ACCTATCGCC AAGTGGCCTG ATTCAGCAGG AAGCATCTCA 420 
GACACCAACC ACTATGCTGT CAGCAGTTGC CCGGGGCTAC CAGGGCTGGT TTCATCCCTG 480 
TGCTAGGCTT TCTGTGAGGA TGAGCAGCAC CGGGATAGAC AGGAAGGGCG TCCTGGCTAA 540 
COGGGTAGCC GTGGTCACGG GGTCCACCAG TGGGATCGGC TTTGCCATCG CCCGACGTCT 600 
GGCCCGGGAC GGGGCCCACG TGGTCATCAG CAGCCGGAAG CAGCAGAACG TGGACCGGGC 660 
CATGGCCAAG CTGCAGGGGG AGGGGCTGAG TGTGGCGGGC ATTGTGTGCC ACGTGGGGAA 720 
GGCTGAGGAC CGGGAGCAGC TGGTGGCCAA GGCCCTGGAG CACTGTGGGG GCGTCGACTT 780 
CCTGGTGTGC AGCGCAGGGG TCAACCCTCT GGTAGGG AGC ACTCTGGGGA CCAGTGAGCA 840 
GATCTGGGAC AAGATCCTAA GTGTGAACGT GAAGTCCCCA GCCCTGCTGC TGAGCCAGTT 900 
GCTGCCCTAC ATGGAGAACA GGAGGGGTGC TGTCATCCTG GTCTCTTCCA TTGCAGCTTA 960 
TAATCCAGTA GTGGCGCTGG GTGTCTACAA TGTCAGCAAG ACAGCGCTGC TGGGTCTCAC 1020 
TAGAACACTG GCATTGGAGC TGGCCCCCAA GGACATCCGG GTAAACTGCG TGGTTCCAGG 1080 
AATTATAAAA ACTGACTTCA GCAAAGTGTT TCATGGGAAT GAGTCTCTCT GG AAGAACTT 1 140 
CAAGGAACAT CATCAGCTGC AGAGGATTGG GGAGTCAGAG GACTGTGCAG GAATCGTGTC 1200 
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CTTCCTGTGC TCTCCAGATG CCAGCTACGT CAACGGGGAG AACATTGCGG TGGCAGGCTA 1260 
CTCCACTCGG CTCIGAGAGG AGTGGGGGCG GCTGCGTAGC TGTGGTCCCA GCCCAGGAGC 1320 
CTGAGGGGGT GTCTAGGTG A TCATTTGGAT CTGGAGCAGA GTCTGCCATT CTGCCAG ACT 1380 
AGCAATTTGG GGGCTTACTC ATGCTAGGCT TGAGGAAGAA GAAAAACGCT TCGGCATTCT 1440 
CC 

Seq ID NO: 76 Protein sequence: 
Protein Accession #: NP_O05785 

I II 21 31 41 5! 
111)11 

MLSAVARGYQ GWFHPCARLS VRMSSTGIDR KGVLANRVAV VTGSTSGIGF A1ARRLARDG 60 
AHVV1SSRKQ QNVDRAMAKL QGEGLSVAGI VCHVGKAEDR EQLVAKALEH CGGVDFLVCS 120 
AG VNPLVGST LGTSEQIWDK ILSVNVKSPA LLLSQLLPYM ENRRG AVILV SSIAAYNPVV 1 80 
ALGVYNVSKT ALLGLTRTLA LELAPKDIRV NCVVPGllKT DFSKVFHGNE SLWKNFKEHH 240 
QLQRIGESED CAGIVSFLCS PDASYVNGEN IAVAGYSTRL 



Seq ID NO: 77 DNA sequence 

Nucleic Acid Accession U: NMJ»2425 

Coding sequence: 26-1453 (underlined sequences correspond to start and stop codons) 
1 II 21 31 41 51 

I - ! I I 1 I 

AAAGAAGGTA AGGGCAGTGA GAATGATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 
AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGATCT 120 
TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGTTTAGA AG 1 80 
AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGGAATG CAGAAGTTCC TTGGGTTGGA 240 
GGTGACAGGG AAGCTAGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT 300 
TCCTGACGTT GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGGAGGA AAACCCACCT 360 
TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 420 
TGAGAAAGCT CTGAAAGTCT GGGAAGAGGT GACTCCACTC ACATTCTCCA GGCTGTATGA 480 
AGGAGAGGCT GATATAATGA TCTCTTTCGC AGTTAAAGAA CATGGAGACT TTTACTCTTT 540 
TGATGGCCCA GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGG AGA 600 
TATTCACTTT GATGATGATG AAAAATGGAC AGAAGATGCA TCAGGCACCA ATTTATTCCT 660 
CGTTGCTGCT CATG AACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 720 
TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTTCCGCC TTTCGCAAGA 780 
TGATGTG AAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAACCCCT 840 
GGTGCCCACA AAATCTGTTC CTTCGGGATC TGAGATGCCA GCCAAGTGTG ATCCTGCTTT 900 
GTCCTTCG AT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG ACAGATATTT 960 
TTGGCGAAGA TCCCACTGGA ACCCTGAACC TGAATTTCAT TTGATTTCTG CATTTTGGCC 1020 
CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CC OTTTTT AT 1080 
TTTTAAAGGA AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 1 140 
AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 
CAAGGAAAAG AAGAAAACAT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 1260 
TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 1 320 
GCCTAAGGTT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1380 
ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC ATATTAAAGA GTAACAGCTG 1440 
OTTACATTGC JAGGCG AGAT AGGGGG AAG A CAGATATGGG TGTTTTTAAT AAATCTAATA 1 500 
ATTATTCATC TAATGTATTA TGAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTGACT 1560 
GAAGAAGATG AGCCTTGCAG ATATCTGCAT GTGTCATG AA GAATGTTTCT GG AATTCTTC 1 620 
ACTTGCTTTT GAATTGCACT GAACAGAATT A AG AAATACT CATGTGCAAT AGGTG AG AG A 1 680 
ATGTATTTTC ATAG ATGTGT TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1 740 
CTT 

Scq ID NO: 78 Protein sequence: 
Protein Accession #: NP_0024 1 6 

I II 21 31 41 51 
I I I I I I 

MHLAFLVLLC LPVCSA YPLS GAAKEEDSNK. DLAQQYLEKY YNLEKDVKQF RRKDSNL1VK 60 
KIQGMQKFLG LEVTGKLDTD TLEVMRKPRC GVPDVGHFSS FPGMPK.WRKT HLTYR1VNYT 120 
PDLPRDAVDS AIEKALKVWE EVTPLTFSRL YEGEADIMIS FAVKEHGDFY SFDGPGHSLA 180 
HAYPPGPGLY GDIHFDDDEK WTEDASGTNL FLVAAHELGH SLGLFHSANT EALMYPLYNS 240 
FTELAQFRLS QDDVNGIQSL YGPPPASTEE PLVPTKSVPS GSEMPAKCDP ALSFDAISTL 300 
RGEYLFFKDR YFWRRSHWNP EPEFHLISAF WPSLPSYLDA AYEVNSRDTV FIFKGNEFWA 360 
IRGNEVQAGY PRGIHTLGFP PTIRKJDAAV SDK.EKKKTYF FAADKYWRFD ENSQSMEQGF 420 
PRUADDFPG VEPKVDAVLQ AFGFFYFFSG SSQFEFDPNA RMVTHILKSN SWLHC 

Seq ID NO: 79 DNA sequence 

Nucleic Acid Accession #: NM JW0493. 1 

Coding sequence: 1-2043 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

ATGCTGCCAC AAATACCCTT TTTGCTGCTA GTATCCTTG A ACTTGGTTCA TGG AGTGTTT 60 
TACGCTGAAC GATACCAAAC GCCCACAGGC ATAAAAGGCC CACTACCCAA CACCAAGACA 120 
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CAGTTCTTCA TTCCCTACAC CATAAAGAGT AAAGGTATAG CAGTAAG AGG AG AGCAAGGT 1 80 
ACTCCTGGTC CACCAGGCCC TGCTGGACCT CG AGGGCACC CAGGTCCTTC TGGACCACCA 240 
GGAAAACCAG GCTACGGAAG TCCTGGACTC CAAGGAGAGC CAGGGTTGCC AGGACCACCG 300 
GGACCATCAG CTGTAGGGAA ACCAGGTGTG CCAGGACTCC CAGGAAAACC AGGAGAGAGA 360 
GGACCATATG GACCAAAAGG AGATGTTGGA CCAGCTGGCC TACCAGGACC CCGGGGCCCA 420 
CCAGGACCAC CTGGAATCCC TGGACCGGCT GGAATTTCTG TGCCAGGAAA ACCTGGACAA 480 
CAGGGACCCA CAGGAGCCCC AGGACCCAGG GGCTTTCCTG GAGAAAAGGG TGCACCAGGA 540 
GTCCCTGGTA TGAATGGACA GAAAGGGGAA ATGGGATATG GTGCTCCTGG TCGTCCAGGT 600 
GAGAGGGGTC TTCCAGGCCC TCAGGGTCCC ACAGGACCAT CTGGCCCTCC TGGAGTGGGA 660 
AAAAGAGGTG AAAATGGGGT TCCAGGACAG CCAGGCATCA AAGGTGATAG AGGTTTTCCG 720 
GGAGAAATGG GACCAATTGG CCCACCAGGT CCCCAAGGCC CTCCTGGGGA ACGAGGGCCA 780 
GAAGGCATTG GAAAGCCAGG AGCTGCTGGA GCCCCAGGCC AGCCAGGGAT TCCAGGAACA 840 
AAAGGTCTCC CTGGGGCTCC AGGAATAGCT GGGCCCCCAG GGCCTCCTGG CTTTGGGAAA 900 
CCAGGCTTGC CAGGCCTGAA GGGAGAAAGA GGACCTGCTG GCCTTCCTGG GGGTCCAGGT 960 
GCCAAAGGGG AACAAGGGCC AGCAGGTCTT CCTGGGAAGC CAGGTCTGAC TGGACCCCCT 1020 
GGG AATATGG GACCCCAAGG ACCAAAAGGC ATCCCGGGTA GCCATGGTCT CCCAGGCCCT 1080 
AAAGGTGAGA CAGGGCCAGC TGGGCCTGCA GGATACCCTG GGGCTAAGGG TGAAAGGGGT 1 140 
TCCCCTGGGT CAGATGG AAA ACCAGGGTAC CCAGGAAAAC CAGGTCTCGA TGGTCCTAAG 1 200 
GGTAACCCAG GGTTACCAGG TCCAAAAGGT GATCCTGGAG TTGGAGGACC TCCTGGTCTC 1260 
CCAGGCCCTG TGGGCCCAGC AGGAGCAAAG GGAATGCCCG GACACAATGG AGAGGCTGGC 1320 
CCAAGAGGTG CCCCTGGAAT ACCAGGTACT AGAGGCCCTA TTGGGCCACC AGGCATTCCA 1380 
GGATTCCCTG GGTCTAAAGG GGATCCAGGA AGTCCCGGTC CTCCTGGCCC AGCTGGCATA 1440 
GCAACTAAGG GCCTCAATGG ACCCACCGGG CCACCAGGGC CTCCAGGTCC AAGAGGCCCC 1500 
TCTGGAGAGC CTGGTCTTCC AGGGCCCCCT GGGCCTCCAG GCCCACCAGG TCAAGCAGTC 1560 
ATGCCTGAGG GTTTTATAAA GGCAGGCCAA AGGCCCAGTC TTTCTGGGAC CCCTCTTGTT 1620 
AGTGCCAACC AGGGGGTAAC AGGAATGCCT GTGTCTGCTT TTACTGTTAT TCTCTCCAAA 1680 
GCTTACCCAG CAATAGGAAC TCCCATACCA TTTGATAAAA TTTTGTATAA CAGGCAACAG 1740 
CATTATGACC CAAGGACTGG AATCTTTACT TGTCAGATAC CAGGAATATA CTATTTTTCA 1800 
TACCACGTGC ATGTGAAAGG GACTCATGTT TGGGTAGGCC TGTATAAGAA TGGCACCCCT 1 860 
GTAATGTACA CCTATGATGA ATACACCAAA GGCTACCTGG ATCAGGCTTC AGGGAGTGCC 1920 
ATCATCGATC TCACAGAAAA TGACCAGGTG TGGCTCCAGC TTCCCAATGC CGAGTCAAAT 1980 
GGCCTATACT CCTCTGAGTA TGTCCACTCC TCTTTCTCAG G ATTCCTAGT GGCTCCAATG 2040 
TGAG TACACC CCACAGAGCT AATCTAAATC TTGTGCTAGA AAAAGCATTC TCTAACTCTA 2100 
CCCCACCCTA CAAAATGCAT ATGGAGGTAG GCTGAAAAGA ATGTAATTTT TATTTTCTGA 2160 
AATACAGATT TGAGCTATCA GACCAACAAA CCTTCCCCCT GAAAAGTG AG CAGCAACGTA 2220 
AAAACGTATG TGAAGCCTCT CTTGAATTTC TAGTTAGCAA TCTTAAGGCT CTTTAAGGTT 2280 
TTCTCCAATA TTAAAAAATA TCACCAAAGA AGTCCTGCTA TGTTAAAAAC AAACAACAAA 2340 
AAACAAAGCA ACAAAAAAAA AAATTAAAAA AAAAAACAGA AATAG AGCTC TAAGTTATGT 2400 
GAAATTTGAT TTGAG AAACT CGGCATTTCC TTTTTAAAAA AGCCTGTTTC TAACTATGAA 2460 
TATGAGAACT TCTAGGAAAC ATCCAGGAGG TATCATATAA CTTTGTAGAA CTTAAATACT 2520 
TGAATATTCA AATTTAAAAG ACACTGTATC CCCTAAAATA TTTCTGATGG TGCACTACTC 2580 
TGAGGCCTGT ATGGCCCCTT TCATCAATAT CTATTCAAAT ATACAGGTGC ATATATACTT 2640 
GTTAA AGCTC TTATATAAAA AAGCCCCAAA ATATTGAAGT TCATCTGAAA TGCAAGGTGC 2700 
TTTCATCAAT GAACCTTTTC AAAACTTTTC TATGATTGCA GAGAAGCTTT TTATATACCC 2760 
AGCATAACTT GGAAACAGGT ATCTG ACCTA TTCTTATTTA GTTAACACAA GTGTGATTAA 2820 
TTTGATTTCT TTAATTCCTT ATTGAATCTT ATGTG ATATG ATTTTCTGGA TTTACAGAAC 2880 
ATTAGCACAT GTACCTTGTG CCTCCCATTC AAGTGAAGTT ATAATTTACA CTGAGGGTTT 2940 
CAAAATTCGA CTAGAAGTGG AGATATATTA TTTATTTATG CACTGTACTG TATTTTTATA 3000 
TTGCTGTTTA AAACTTTTAA GCTGTGCCTC ACTTATTAAA GCACAAAATG TTTTACCTAC 3060 
TCCTTATTTA CGACACAATA AAATAACATC AATAGATTTT TAGGCTGAAT TAATTTGAAA 3120 
GCAGCAATTT GCTGTTCTCA ACCATTCTTT CAAGGCTTTT CATTCGACAC AATAAAATAA 31 BO 
CATCAATAGA Till 1AGG 

Scq ID NO: 80 Protein sequence: 
Protein Accession #: NP_000484. 1 



1 11 21 31 41 51 
1 I 1 I I I 

MLPQIPFLLL VSLNLVHG VF YAERYQTPTG IKGPLPNTKT QFFIPYTIKS KG1AVRGEQG 60 
TPGPPGPAGP RGHPGPSGPP GKPGYGSPGL QGEPGLPGPP GPSAVGKPGV PGLPGKPGER 120 
GPYGPKGDVG PAGLPGPRGP PGPPGIPGPA GISVPGKPGQ QGPTGAPGPR GFPGEKGAPG 180 
VPGMNGQKGE MGYGAPGRPG ERGLPGPOGP TGPSGPPGVG KKGENGVPGQ PGIKGDRGFP 240 
GEMGPIGPPG PQGPPGERGP EG1GKPGAAG APGQPGIPGT KGLPGAPGIA GPPGPPGFGK 300 
PGLPGLKGER GPAGLPGGPG AKGEQGPAGL PGKPGLTGPP GNMGPQGPKG IPGSHGLPGP 360 
KGETGPAGPA GYPGAKGERG SPGSDGKPGY PGKPGLDGPK GNPGLPGPKG DPGVGGPPGL 420 
PGPVGPAGAK GMPGHNGEAG PRGAPGIPGT RGPIGPPGIP GFPGSKGDPG SPGPPGPAGI 480 
ATKGLNGPTG PPGPPGPRGP SGEPGLPGPP GPPGPPGQAV MPEGFKAGQ RPSLSGTPLV 540 
SANQGVTGMP VSAFTVILSK AYPAIGTPIP FDKILYNRQQ HYDPRTGIFT CQ1PG1YYFS 600 
YHVHVKGTHV WVGLYKMGTP VMYTYDEYTK GYLDQASGSA IIDLTENDQV WLQLPNAESN 660 
GLYSSEYVHS SFSGFLVAPM 

Seq ID NO: 81 DNA sequence 

Nucleic Acid Accession 0: NM_000786 

Coding sequence: 332-1861 (underlined sequences correspond to start and stop codons) 



II 21 31 41 51 
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CGCOATTCTC AGGGATTGAT CCGCCTCTTC AGGTAAGTTA TCTTCCGGCC CCGTACCACT 60 
GTGCCACAGG CGCAGCCCGC TTCCTCAGGT GCCCTATCCC GCGCAGAAGA CCACGGCTTC 1 20 
ACAGAGTGTT ATTTAAGGGC GTGGCCAGCG GAACATCCCG CCCCATTCTG TGACGCACGG 180 
GGTGGCGCGC GTGGGACCCG AGGGGTGGGG CTGGGTTTAG TAGGAGACCT GGGGCAAGGC 240 
CCCCTGTGGA CGACCATCTG CCAGCTTCTC TCGTTCCGTC G ATTGGGAGG AGCGGTGGCG 300 
ACCTCGGCCT TCAGTGTTTC CGACGGAGTG AATGGCGGCG GCGGCTGGGA TGCTGCTGCT 360 
GGGCTTGCTG CAGGCGGGTG GGTCGGTGCT GGGCCAGGCG ATGGAGAAGG TGACAGGCGG 420 
CAACCTCTTG TCCATGCTGC TGATCGCCTG CGCCTTCACC CTCAGCCTGG TCTACCTGAT 480 
CCGTCTGGCC GCCGGCCACC TGGTCCAGCT GCCCGCAOGG OTGAAAAGTC CTCCATACAT 540 
TTTCTCCCCA ATTCCATTCC TTGGGCATGC CATAGCATTT GGGAAAAGTC CAATTGAATT 600 
TCTAGAAAAT GCATATGAGA AGTATGGACC TGTATTTAGT TTTACCATGG TAGGCAAG AC 660 
ATTTACTTAC CTTCTGGGGA GTGATGCTGC TGCACTGCTT TTTAATAGTA AAAATGAAGA 720 
CCTGAATGCA GAAGATGTCT ACAGTCGCCT GACAACACCT GTGTTTGGG A AGGGAGTTGC 780 
ATACGATGTG CCTAATCCAG TTTTCTTGGA GCAGAAGAAA ATGTTAAAAA GTGGCCTTAA 840 
CATAGCCCAC TTTAAACAGC ATGTTTCTAT AATTG AAAAA G AAACAAAGG AATACTTTGA 900 
GAGTTGGGGA GAAAGTGGAG AAAAAAATGT GTTTGAAGCT CTTTCTGAGC TCATAATTTT 960 
AACAGCTAGC CATTGTTTGC ATGGAAAGGA AATCAGAAGT CAACTCAATG AAAAGGTAGC 1020 
ACAGCTGTAT GCAGATTTGG ATGGAGGTTT CAGCCATGCA GCCTGGCTCT TACCAGGTTG 1080 
GCTGCCTTTG CCTAGTTTCA GACGCAGGGA CAGAGCTCAT CGGGAAATCA AGGATATTTT 1140 
CTATAAGGCA ATCCAGAAAC GCAGACAGTC TCAAGAAAAA ATTGATGACA TTCTCCAAAC 1200 
TTTACTAGAT GCTACATACA AGGATGGGCG TCCTTTGACT GATGATGAAG TAGCAGGGAT 1260 
GCTTATTGGA TTACTCTTGG CAGGGCAGCA TACATCCTCA ACTACTAGTG CTTGGATGGG 1320 
CTTCTTTTTG GCCAGAGACA AAACACTTCA AAAAAAATGT TATTTAG A AC AGAAAACAGT 1380 
CTGTGGAGAG AATCTGCCTC CTTTAACTTA TGACCAGCTC AAGGATCTAA ATTTACTTGA 1440 
TCGCTGTATA AAAGAAACAT TAAGACTTAG ACCTCCTATA ATGATCATGA TGAGAATGGC 1500 
CAGAACTCCT CAGACTGTGG CAGGGTATAC CATTCCTCCA GGACATCAGG TGTGTGTTTC 1560 
TCCCACTGTC AATCAAAGAC TTAAAGACTC ATGGGTAGAA CGCCTGGACT TTAATCCTGA 1620 
TCGCTACTTA CAGGATAACC CAGCATCAGG GGAAAAGTTT GCCTATGTGC CATTTGGAGC 1680 
TGGGCGTCAT CGTTGTATTG GGG AAAATTT TGCCTATGTT CAAATTAAG A CAATTTGGTC 1 740 
CACTATGCTT CGTTTATATG AATTTG ATCT CATTGATGGA TACTTTCCCA CTGTGAATTA 1 800 
TACAACTATG ATTCACACCC CTGAGAACCC AGTTATCCGT TACAAACG AA GATCAAAAIQ 1860 
AAAAAGGTTG CAAGGAACGA ATATATGTGA TTATCACTGT AAGCCACAAA GGCATTCGAA 1920 
GAGAATGAAG TGTACAAAAC AACTCTTGTA GTTTACTGTT TTTTTAAGTG TGTAATTCTA 1980 
AAAGCCAGTT TATGATTTAG GATTTTGTTA ACTGAATGGT TCTATCAAAT ATAATAGCAT 2040 
TTGACACATT TTCTAATAGT TATGATACTT ATACATGTGC TTTCAGGAAG TTCCTTGGTG 2 100 
AAACAATTGT TGAGGGGGGA TCTAGGTAAT TGGCAGATTC TAAATAATAT AATTTCCAGA 2160 
TAGTAATTTT AAGAGTACTC ATCGCTCTTG CCAAATAAGT TCAGGGTATT CAAATCTTGG 2220 
ACTAGTCCTG CAAGGTATAA AGAATAAAAA TCCCAGTGAG ATACTTGGAA ACCACAGTTT 2280 
ATTATTATTT ATCTGGGCAA TTATTGTGTG TGTGAGGATG GAAGGGTAGG GAATAATCG A 2340 
ACATCTAAAG CCTTG AATAA GAGAATACTA ATTGTTTTGG TATGATGATA CTCAGAAATG 2400 
GAGATATTAT AGGAAAAAGA AATCCTTTGG AATTTTAACT AAAATCACTG CATATGGGAA 2460 
ATTAAGAGAT CCAGGACCAT ATTTGATAAG AGTTCCTAAA AATAATGTAA TTATTAATGC 2520 
TAAAGACTGC TCATGTATCT TGATCTAATT ACTAAATAAA TTACATATTT ATTTACCTGA 2580 
TAAATATGTA TCTAGTTCTA CAAGGTCACA TTTATGTGG A AGTCCAAAGT CAAGTCCT TA 2640 
GGGGATAATT TTGTTTTGGG CTCAGTTGTT CCCTGCTTCC 1 1 1 1 1 1 1 1 1 i TTTTTTTTTT 2700 
TTGAGATGGA GTCTCGCTCT GTTGCCCAGG CTGGAGTGCA GTGGTGCGAT CTCAGCTCAC 2760 
TGCATCCTCT GCCTCCCGGG TTCAAGCA AT TCTCTGCCTC AGCCTCCCAA GTAGTTGGG A 2820 
TTACAGGCAC CTGCCACCAT GCCTGGCTAA TTTTTTGTAT TTTTAGTAGA GACGGGGGTT 2880 
TCACTATGTT GGCTAGGCTG GTCTTG AACT CCTGAGCCTC GTGAGTCCAC CCGCCTTGGC 2940 
CTCCCAAAGT GCTGGGATTA CAGGCATGAG CCACCGCACC TGGCCTTCCC TGCTTCCTCT 3000 
CTAGAATCCA ATTAGGGATG TTTGTTACTA CTCATATTGA TTAAAACAGT TAACAAACTT 3060 
rTTTCTTTTT AAAATGTGAG ATCAGTGAAC TCTGGTTTTA AGATAATCTG AAACAAGGTC 3120 
CTTGGGAGTA ATAAAATTGG TCACATTCTG T AAAGCACAT TCTGTTTAGG AATCAACTTA 3 180 
TCTCAAATTG TAACTCGGGG CCTAACTATA TGAGATGGCT GAAAAAATAC CACATCGTCT 3240 
GTTTTCACTA GGTGATGCCA AAATATTTTG CTTTATGTAT ATTACAGTTC TTTTTAAAAC 3300 
ACTGGAAGAC TCATGTTAAA CTCTAATTGT GAAGGCAG AA TCTCTGCTAA TTTTTCAGAT 3360 
TAAAATTCTC TTTG AAAAAA T 

Seq ID NO: 82 protein sequence: 
Protein Accession*: NPJJ00777 

1 11 21 31 41 51 
I I I I I I 

MAAAAGMLLL GLLQAGGSVL GQAMEKVTGG NLLSMLLIAC AFTLSLVYLI RLAAGHLVQL 60 
PAG VKSPPYI FSPIPFLGHA IAFGKSPIEF LENAYEKYGP VFSFTMVGKT FTYLLGSDAA 1 20 
ALLFNSKNED LNAEDVYSRL TTPVFGK.GVA YDVPNPVFLE QKKMLKSGLN IAHFKQHVSI 180 
IEKETKEYFE SWGESGEKNV FEALSELIIL TASHCLHGKE IRSQLNEKVA QLYADLDGGF 240 
SHAAWLLPGW LPLPSFRRRD RAHREDCDIF YKAIQKRRQS QEKIDDILQT LLDATYKDGR 300 
PLTDDEVAGM UGIXLAGQH TSSTTSAWMG FFLARDKTLQ KKCYLEQKTV CGENLPPLTY 360 
DQLKDLNLLD RCDCETLRLR PPIMIMMRMA RTPQTVAGYT IPPGHQVCVS PTVNQRLKDS 420 
WVERLDFNPD RYLQDNPASG EKFAYVPFGA GRHRC1GENF AYVQIKTIWS TMLRLYEFDL 480 
1DGYFPTVNY TTMIHTPENP V1RYKRRSK 

Scq ID NO: 83 DNA sequence 

Nucleic Acid Accession it: NMJW655 1 .2 

Coding sequence: 64*336 (underlined sequences correspond to start and stop codoos) 
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I 11 21 31 41 51 
1 1 I I I I 

AATTCTAGAA GTCCAAATCA CTCATTGTTT GTGAAAGCTG AGCTCACAGC AAAACAAGCC 60 
ACCATGAAGC TGTCGGTGTG TCTCCTGCTG GTCACGCTGG CCCTCTGCTG CTACCAGGCC 120 
AATGCCG AGT TCTGCCCAGC TCTTGTTTCT GAGCTGTTAG ACTTCTTCTT CATTAGTGAA 1 80 
CCTCTGTTCA AGTTAAGTCT TGCCAAATTT GATGCCCCTC CGGAAGCTGT TGCAGCCAAG 240 
TTAGGAGTGA AGAG ATGCAC GGATCAGATG TCCCTTCAGA AACGAAGCCT CATTGCGGAA 300 
GTCCTGGTGA AAATATTG AA GAAATGTAGT GTGTGACATG TAAAAACTTT CATCCTGGTT 360 
TCCACTGTCT TTCAATG ACA CCCTG ATCTT CACTGCAGAA TGTAAAGGTT TCAACGTCTT 420 
GCTTTAATAA ATCACTTGCT CTAC 

Scq ID NO: 84 Protein sequence: 
Protein Accession #: N?_006542.1 

I 11 21 31 41 51 
I I I I 1 I 

MiOSVCLLLV TLALCCYQAN AEFCPALVSE LLDFFFISEP LFfCLSLAKFD APPEAVAAKL 60 
GVKRCTDQMS LQKRSLIAEV LVKILKKCSV 

Scq ID NO: 85 DNA sequence 

Nucleic Acid Accession #: KM 0023 1 7. 1 

Coding sequence: 231-1484 (underlined sequences correspond to start and stop codoos) 

1 11 21 31 41 51 
I I ! 1 I I 

GGGCCAGGAC TGAGAAAGGG GAAAGGGAAG GGTGCCACGT CCGAGCAGCC GCCTTGACTG 60 
GGGAAGGGTC TGAATCCCAC CCTTGGCATT GCTTGGTGGA G ACTGAGATA CCCGTGCTCC 120 
GCTCGCCTCC TTGGTTGAAG ATTTCTCCTT CCCTCACGTG ATTTGAGCCC CG I 11 11 ATT 180 
TTCTGTGAGC CACGTCCTCC TCGAGCGGGG TCAATCTGGC AAAAGGAGTG ATGCGCTTCG 240 
CCTGGACCGT GCTCCTGCTC GGGCCTTTGC AGCTCTGCGC GCTAGTGCAC TGCGCCCCTC 300 
CCGCCGCCGG CCAACAGCAG CCCCCGCGCG AGCCGCCGGC GGCTCCGGGC GCCTGGCGCC 360 
AGCAGATCCA ATGGGAGAAC AACGGGCAGG TGTTCAGCTT GCTGAGCCTG GGCTCACAGT 420 
ACCAGCCTCA GCGCCGCCGG GACCCGGGCG CCGCCGTCCC TGGTGCAGCC AACGCCTCCG 480 
CCCAGCAGCC CCGCACTCCG ATCCTGCTG A TCCGCGACAA CCGCACCGCC GCGGGGCGAA 540 
CGCGGACGGC CGGCTCATCT GGAGTCACCG CTGGCCGCCC CAGGCCCACC GCCCGTCACT 600 
GGTTCCAAGC TGGCTACTCG ACATCTAGAG CCCGCG AAGC TGGGCCCTCG OGCGCGG AGA 660 
ACCAGACAGC GCCGGGAGAA GTTCCTGCTC TCAGTAACCT GCGGCCGCCC AGCCGCGTGG 720 
ACGGCATGGT GGGCGACG AC CCTTACAACC CCTACAAGTA CTCTGACGAC AACCCTTATT 780 
ACAACTACTA CGATACTTAT GAAAGGCCCA GACCTGGGGG CAGGTACCGG CCCGGATACG 840 
GCACTGGCTA CTTCCAGTAC GGTCTCCCAG ACCTGGTGGC CGACCCCTAC TACATCCAGG 900 
CGTCCACGTA CGTGCAGAAG ATGTCCATGT ACAACCTGAG ATGCGCGGCG GAGGAAAACT 960 
GTCTGGCCAG TACAGCATAC AGGGCAGATG TCAGAGATTA TGATCACAGG GTGCTGCTCA 1020 
GATTTCCCCA AAGAGTGAAA AACCAAGGGA CATCAGATTT CTTACCCAGC CGACCAAGAT 1080 
ATTCCTGGGA ATGGCACAGT TGTCATCAAC ATTACCACAG TATGG ATGAG TTTAGCCACT 1 140 
TGTACCTGCT TGATGCCAAC ACCCAGAGGA GATGGGCTGA AGGCCACAAA GCAAGTTTCT 1200 
GTCTTGAAGA CACATCCTGT GACTATGGCT ACCACAGGCG ATTTGCATGT ACTGCACACA 1260 
CACAGGGATT GAGTCCTGGC TGTTATGATA CCTATGGTGC AGACATAGAC TGCCAGTGGA 1320 
TTGATATTAC AGATGTAAAA CCTGGAAACT ATATCCTAAA GGTCAGTGTA AACCCCAGCT 1380 
ACCTGGTTCC TGA ATCTGAC TATACCAACA ATGTTGTGCG CTGTGACATT CGCTACACAG 1440 
GACATCATGC GTATGCCTCA GGCTGCACAA TTTCACCGTA TTAGAAGGCA AAGCAAAACT 1500 
CCCAATGGAT AAATCAGTGC CTGGTGTTCT GAAGTGGGAA AAAATAGACT AACTTCAGTA 1560 
GGATTTATGT ATTTTGAAAA AGAGAACAGA AAACAACAAA AGAATTTTTG TTTGGACTGT 1620 
TTTCAATAAC AAAGCACATA ACTGGATTTT GAACGCTTAA GTCATCATTA CTTGGGAAAT 1680 
TTTTAATGTT TATTATTTAC ATCACTTTGT GAATTAACAC AGTGTTTCAA TTCTGTAATT 1740 
ACATATTTGA CTCTTTCAAA AAAAAAAAAA AAAAAAAAAA 

Scq ID NO: 86 Protein sequence: 
Protein Accession U: NP_002308. 1 

1 U 21 31 41 51 
I i I 1 \ \ 

MRFAWTVLLL GPLQLCALVH CAPPAAGQQQ PPREPPAAPG AWRQQIQWEN NGQVFSLLSL 60 
GSQYQPQRRR DPG AAVPGAA NASAQQPRTP ILLIRDNRTA AGRTRTAGSS GVTAGRPRPT 1 20 
ARHWFQAG YS TSRAREAGPS RAENQTAPGE VPALSNLRPP SRVDGMVGDD PYNPYKYSDD 1 80 
NPYYNYYDTY ERPRPGGRYR PGYGTG YFQY GLPDLVADPY YIQASTYVQK MSMYNLRCAA 240 
EENCLASTAY RADVRDYDHR VLLRFPQRVK NQGTSDFLPS RPRYSWEWHS CHQHYHSMDE 300 
FSHLYLLDAN TQRRWAEGHK ASFCLEDTSC DYGYHRRFAC TAHTOGLSPG CYDTYGADID 360 
CQWIDITDVK PGNYIUCVSV NPSYLVPESD YTNNWRCDI RYTGHHAYAS GCTISPY 

Seq ID NO: 87 DNA sequence 

Nucleic Acid Accession #: NM 00641 9. 1 

Coding sequence: 91-420 (underlined sequences correspond to start and stop codoos) 

1 U 21 31 41 51 
I I I I I I 

TTCGGCACTT GGG AG AAGAT GTTTGAAAAA ACTGACTCTG CTAATGAGCC TGGACTCAGA 60 
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GCTCAAGTCT OAACTCTACC TCCAGACAGA ATGAAGTTCA TCTCGACATC TCTGCTTCTC 120 
ATGCTGCTGG TCAGCAGCCT CTCTCCAGTC CAAGGTGTTC TGGAGGTCTA TT ACACAAGC 1 80 
TTGAGGTGTA GATGTGTCCA AGAGAGCTCA GTCTTTATCC CTAGACGCTT CATTGATCGA 240 
ATTCAAATCT TGCCCCGTGQ GAATGGTTGT CCAAGAAAAG AAATCATAGT CTGGAAG AAG 300 
AACAAGTCAA TTGTGTGTGT GGACCCTCAA GCTGAATGGA TACAAAGAAT GATGGAAGTA 360 
TTG AG AAAAA GAAGTTCTTC AACTCTACCA GTTCCAGTGT TTAAG AG AAA GATTCCCTGA 420 
TGCTGATATT TCCACTAAGA ACACCTGCAT TCTTCCCTTA TCCCTGCTCT GGATTTTAGT 480 
TTTGTGCTTA GTTAAATCTT TTCCAGGG AG AAAG AACTTC CCCATACAAA TAAGGCATGA 540 
GGACTATGTG AAAAATAACC TTGCAGGAGC TGATGGGGCA AACTCAAGCT TCTT CACT CA 600 
CAGCACCCTA TATACACTTG GAGTTTGCAT TCTTATTCAT CAGGGAGGAA AGTTTCTTTG 660 
AAAATAGTTA TTCAGTTATA AGTAATACAG GATTATTTTG ATTATATACT TGTTGTTTAA 720 
TGTTTAAAAT TTCTTAGAAA ACAATGG AAT GAG AATTT AA GCCTCAAATT TGAACATGTG 780 
GCTTGAATTA AGAAGAAAAT TATGGCATAT ATTAAAAGCA GGCTTCTATG AAAGACTCAA 840 
AAAGCTGCCT GGGAGGCAGA TGGAACTTGA GCCTGTCAAG AGGCAAAGGA ATCCATGTAG 900 
TAG ATATCCT CTGCTTAAAA ACTCACTACG GAGGAGAATT AAGTCCTACT TTTAAAGAAT 960 
TTCTTTATAA AATTTACTGT CTAAGATTAA TAGCATTCGA AGATCCCCAG ACTTCATAGA 1020 
ATACTCAGGG AAAGCATTTA AAGGGTGATG TACACATGTA TCCTTTCACA CATTTGCCTT 1080 
GACAAACTTC TTTCACTCAC ATCTTTTTCA CTGACTTTTT TTGTGGGGGC GGGGCCGGGG 1 1 40 
GGACTCTGGT ATCTAATTCT TTAATGATTC CTATAAATCT AATGACATTC AATAAAGTTG 1200 
AGCAAACATT TTACTT 

Seq ID NO: 88 Protein sequence: 
Protein Accession U: NPJJ06410.1 

1 11 21 31 41 51 

MKFISTSLLL MLLVSSLSPV QGVLEVYYTS LRCRCVQESS VFIPRRFIDR IQ1LPRGNGC 60 
PRKEIIVWKK NKSIVCVDPQ AEWIQRMMEV LRKRSSSTLP VPVFKRKIP 

Seq ID NO: 89 DNA sequence 

Nucleic Acid Accession #: NM.002652 

Coding sequence: 37-477 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

CTTCTCTGGG ACACATTGCC TTCTGTTTTC TCCAGCATGC GCTTGCTCCA GCTCCTGTTC 60 
AGGGCCAGCC CTGCCACCCT GCTCCTGGTT CTCTGCCTGC AGTTGGGGGC CAACAAAGCT 120 
CAGGACAACA CTCGG AAG AT CATAATA AAG AATTTTGACA TTCCCA AGTC AGTACGTCCA 1 80 
AATGACG AAG TCACTGCAGT GCTTGCAGTT CAAACAG AAT TGAAAG AATG CATGGTGGTT 240 
AAAACTTACC TCATTAGCAG CATCCCTCTA CAAGGTGCAT TTAACTATAA GTATACTGCC 300 
TGCCTATGTG ACGACAATCC AAAAACCTTC TACTGGGACT TTTACACCAA CAGAACTGTG 360 
CAAATTGCAG CCGTCGTTGA TGTTATTCGG GAATTAGGCA TCTGCCCTGA TGATGCTGCT 420 
GTAATCCCCA TCAAAAACAA CCGGTTTTAT ACTATTG AAA TCCTAAAGGT AGAATAATGG 480 
AAGCCCTGTC TGTTTGCCAC ACCCAGGTGA TTTCCTCTAA AGAAACTTGG CTGGAATTTC 540 
TGCTGTGGTC TATAAAATAA ACTTCTTAAC ATGCTT 

Seq ID NO: 90 Protein sequence: 
Protein Accession #: NP JJ02643. 1 

1 II 21 31 41 51 

MRLLQLLFRA SPATLLLVLC LQLG ANKAQD NTRKIIIKNF DIPKSVRPND EVTAVLAVQT 60 
ELKECMVVKT YLISSIPLQG AFNYKYTACL CDDNPKTFYW DFYTNRTVQI AAWDVIREL 120 
G1CPDDAAVI PIKNNRFYTI EILKV 

Seq ID NO: 91 DNA sequence 

Nucleic Acid Accession U: AK00034 1 

Coding sequence: 85-975 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GATAGCGCCG GGCAGAGGGA CCCGGCTACC CTGGACAGCG CAJCGCCGCC CGCCCGGGTC 60 
GCCGCGCCAC AGCCGCTGCG GATCATGGAA CATCTAAAGG CCTTTGATGA TG AAATCAAT 120 
GCTTTTTTGG ACAATATGTT TGGACCGCG A GATTCTCGAG TCAGAGGGTG GTTCACGTTG 1 80 
GACTCTTACC TTCCTACCTT TTTTCTTACT GTCATGTATC TGCTCTCAAT ATGGCTGGGT 240 
AACAAGTATA TGAAGAACAG ACCTGCTCTT TCTCTCAGGG GTATCCTCAC CTTGTATAAT 300 
CTTGG AATCA CACTTCTCTC CGCGTACATG CTGGCAGAGC TCATTCTCTC CACTTGGGAA 360 
GGAGGCTACA ACTTACAGTG TCAAGATCTT ACCAGCGCAG GGGAAGCTGA CATC CGGG TA 420 
GCCAAGGTGC TTTGGTGGTA CTATTTCTCC AAATCAGTAG AGTTCCTGGA CACAATTTTC 480 
TTCGTTTTGC GG AAAAAAAC GAGTCAG ATT ACTTTTCTTC ATGTATATCA TCATGCTTCT 540 
ATGTTTAACA TCTGGTGGTG TGTCTTG AAC TGGATACCTT GTGGACAAAG TTTCTTTGGA 600 
CCAACACTGA ACAGTTTTGT CCACATTCTT ATGTACTCCT ACTATGG ACT TTCTGTGTTT 660 
CCATCTATGC ACAAGTATCT TTGGTGG AAG AAATATCTCA CACAGGCTCA GCTGGTGCAG 720 
TTCGTGCTCA CCATCACGCA CACCATGAGC GCCGTCGTGA AACCGTGTGG CTTC CCCTT C 780 
GGTTGTCTCA TCTTCCAGTC ATCTTATATG CTAACGTTAG TCATCCTCTT CTTAAATTTT 840 
TATGTTCAGA CATACCGAAA AAAG CC AATG AAGAAAGATA TGCAAGAGCC ACCTGCAGGG 900 
AAAGAAGTG A AGAATGGTTT TTCCAAAGCC TACTTCACTG CAGCAAATGG AGTG ATG AAC 960 
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AAGAAAGCAC AATAAAAATC AGTAACAGAA AAAGCACATA TACTAGCCTA ACA GATTG GC 1020 
TTGTTTTAAA GCAAAGACTG AATTGAAGGT TACATGTTTT AGGATAAACT AATTTCTTTT 1080 
GAGTTCATAA ATCATTTGTA CCCAGAATGT ATTAATATAT TGCTATTAGG TTAATCTGTT 1 140 
AACTGAATGC TTTGATCAGC ATTGAGGTGA TGCTCACCTC CGAGGACCTC AGAACTGGTG 1200 
5 CAGCTTCTCT CTCCCTCCCT CCCACAG ACT G AACCTTTCG CCAGAAGCTG TCCTTATAAC 1260 
GCCTTATACG CATACACAGC CAGG AAACGT GGAGCATTGT TTCTCACAGA GAGTCTCCAA 1320 
ATAAAAAGGG TTTTGTTCAG ATTAAAATGT TTACAACAAA ATGTTAATTA TATTCTAAAT 1380 
ACAGGGTATG TTCTAATCTA TATTAAGCAA TAATGCCAGT GCATAATCAT TCCATTTGTT 1440 
CCTTTAGCAA TCAACCCCAG AAAATATTAA AATGGGATCA TACACAGAAG ATAGAAAAAT 1500 

1 0 CTAGCAAAAC TTCTCTTTCT GTAAGCCAGA GTCTTGTCTA TCAGATTCCC ACAACCACTC 1560 
CTGATTCTAA ATTTAGTGAT ATGGTAATGA AATTGGTATT TATTTTAAAT ATTAGTTATT 1620 
CTAAGGAGAA AAAAATGCTT CTGCAAGATT TTCATAATTC AGGGGCTGTG GATAGGATTG 1680 
TTCCTCTGTT TCCCTAATCA TTCATCTGTT CATGTCTCCC TCTTGTGCCA GTCAGCCTAG 1740 
GTTATACAGA TGCCATGCTC CACACCACGA GCAGTGTACA AATCTGGCTG CCCGTTTACT 1800 

15 TTCTGAGCAA GCACTGGAGT CCACTCCGAC CTTTTTCTTT G AACATGCAT GCTGCTGGAA 1860 

TATGTATAAA TCAGAACTAG CAGAAGTAGC AGAGTGATGG GAGCAAAATA GGCACTGAAT 1920 
TCGTCAACTC TTTTTTGTGA GCCTACTTGT GAATATTACC TCAGATACCT GTTGTCACTC 1980 
TTCACAGGTT ATTTAAGTTC TTGAAGCTGG GAGGAAAAAG ATGGAGTAGC TTGGAAAGAT 2040 
TCCAGCACTG AGCCGTGAGC CGGTCATGAG CCACGATAAA AAATGCCAGT TTGGCAAACT 2100 

20 CAGCACTCCT GTTCCCTGCT CAGGTATATG CGATCTCTAC TGAGAAGCAA GCACAAAAGT 2160 
AGACCAAAGT ATTAATG AGT ATTTCCTTTC TCCATAAGTG CAGGACTGTT ACTCACTACT 2220 
AAACTCTACC AAGAATGGAA ACCAAGAATA TTTTCTGAAG ATTTTTTTGA AG ATTAATTT 2280 
ATACCCTATA AAATAAAACT TGTTAGCTTC GATG AAGTCA AAAAAAAAAA AAAAAAAAAA 

25 Seq ID NCh 92 Protein sequence: 

Protein Accession #: BAA9 1 096. 1 

I It 21 31 41 51 

30 MEHLKAFDDE WAFLDNMFG PRDSRVRGWF TLDSYLPTFF LTVMYLLSIW LGNKYMKNRP 60 
ALSLRGILTL YNLGITLLSA YMLAEULST WEGGYNLQCQ DLTSAGEADI RVAKVLWWYY 120 
FSKSVEFLDT IFFVLRKKTS QITFLHVYHH ASMFNIWWC V LNWIPCGQSF FGPTLNSFVH 1 80 
ILMYSYYGLS VFPSMHKYLW WKKYLTQAQL VQFVLTTTHT MSAWKPCGF PFGCLIFQSS 240 
YMLTLVILFL NFYVQTYRKK PMKKDMQEPP AGKEVKNGFS KAYFTAANGV MNKKAQ 



35 



Seq ID NO: 93 DNA sequence 

Nucleic Acid Accession #: NM_000044 

Coding sequence: 1 1 15-3874 (underlined sequences correspond to start and stop codons) 



40 1 II 21 31 41 51 

CGAGATCCCG GGGAGCCAGC TTGCTGGGAG AGCGGGACGG TCCGGAGCAA GCCCACAGGC 60 
AGAGGAGGCG ACAGAGGGA A AAAGGGCCGA GCTAGCCGCT CCAGTGCTGT ACAGGAGCCG 120 
AAGGG ACGCA CCACGCCAGC CCCAGCCCGG CTCCAGCGAC AGCCAACGC C TCTT GCAGCG 1 80 

45 CGGCGGCTTC GAAGCCGCCG CCCGGAGCTG CCCTTTCCTC TTCGGTGAAG TTTT TAAAA G 240 
CTGCTAAAGA CTCGGAGGAA GCAAGGAAAG TGCCTGGTAG GACTGACGGC TGCCTTTGTC 300 
CTCCTCCTCT CCACCCCGCC TCCCCCCACC CTGCCTTCCC CCCCTCCCCC GTCTTCTCTC 360 
CCGCAGCTGC CTCAGTCGGC TACTCTCAGC CAACCCCCCT CACCACCCTT CTCCCCACCC 420 
GCCCCCCCGC CCCCGTCGGC CCAGCGCTGC CAGCCCGAGT TTGCAGAGAG GTAACTCCCT 480 

5 0 TTGGCTGCGA GCGGGCGAGC TAGCTGCACA TTGCAAAGA A GGCTCTTAGG AGCCAGGCGA 540 
CTGGGGAGCG GCTTCAGCAC TGCAGCCACG ACCCGCCTGG TTAGAATTCC GGCGGAGAGA 600 
ACCCTCTGTT TTCCCCCACT CTCTCTCCAC CTCCTCCTGC CTTCCCCACC CCG AGTGCGG 660 
AGCAG AG ATC AAAAGATG AA AAGGCAGTCA GGTCTTCAGT AGCCAAAAAA CAAAACAAAC 720 
AAAAACAAAA AAGCCG AAAT AAAAGAAAAA GATAATAACT CAGTTCTTAT TTGCACCTAC 780 

5 5 TTCAGTGGAC ACTG AATTTG G AAGGTGGAG GATTTTGTTT TTTTCTTTTA AGATCTGGGC 840 
ATCTTTTGAA TCTACCCTTC AAGTATTAAG AGACAGACTG TGAGCCTAGC AGGGC AGATC 900 
TTGTCCACCG TGTGTCTTCT TCTGCACGAG ACTTTGAGGC TGTCAGAGCG CTTTTTGCGT 960 
GGTTGCTCCC GCAAGTTTCC TTCTCTGGAG CTTCCCGCAG GTGGGCAGCT AGCTGCAGCG 1020 
ACTACCGCAT CATCACAGCC TGTTGAACTC TTCTGAGCAA G AG AAGGG G A GGCGGGGTAA 1080 

60 GGG AAGTAGG TGGAAG ATTC AGCCAAGCTC AAGGATGG A A GTGCAGTTAG GGCTGGGAAG 1 140 
GGTCTACCCT CGGCCGCCGT CCAAGACCTA CCGAGGAGCT TTCCAG AATC TGTTCCAGAG 1200 
CGTGCGCGAA GTGATCCAG A ACCCGGGCCC CAGGCACCCA GAGGCCGCGA GCGCAGCACC 1260 
TCCCGGCGCC AGTTTGCTGC TGCTGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA 1320 
GCAGCAGCAG CAGCAGCAGC AGCAGCAAGA GACTAGCCCC AGGCAGCAGC AGCAGCAGCA 1380 

65 GGGTGAGGAT GGTTCTCCCC AAGCCCATCG TAGAGGCCCC ACAGGCTACC TGGTCCTGGA 1440 
TO AGGAACAG CAACCTTCAC AGCCGCAGTC GGCCCTGGAG TGCCACCCCG AGAGAGGTTG 1500 
CGTCCCAGAG CCTGGAGCCG CCGTGGCCGC CAGCAAGGGG CTGCCGCAGC AGCTGCCAGC 1560 
ACCTCCGGAC GAGGATGACT CAGCTGCCCC ATCCACGTTG TCCCTGCTGG GCCCCACTTT 1620 
CCCCGGCTTA AGCAGCTGCT CCGCTGACCT TAAAGACATC CTGAGCGAGG CCAGCACCAT 1680 

70 GCAACTCCTT CAGCAACAGC AGCAGGAAGC AGTATCCG AA GGCAGCAGCA GCGGGAG AGC 1 740 
GAGGGAGGCC TCGGGGGCTC CCACTTCCTC CAAGGACAAT TACTTAGGGG GCACTTCGAC 1800 
CATTTCTGAC AACGCCAAGG AGTTGTGTAA GGCAGTGTCG GTGTCCATGG GCCTGGGTGT 1860 
GGAGGCGTTG GAGCATCTGA GTCCAGGGGA ACAGCTTCGG GGGGATTGCA TGTACGCCCC 1920 
ACTTTTGGGA GTTCCACCCG CTGTGCGTCC CACTCCTTGT GCCCCATTGG CCGAATGCAA 1980 

75 AGGTTCTCTG CTAGACG ACA GCGCAGGCAA GAGCACTGAA GATACTGCTG AGTATTCCCC 2040 
TTTCAAGGG A GGTTACACCA AAGGGCTAGA AGGCGAGAGC CTAGGCTGCT CTGGCAGCGC 2100 
TGCAGCAGGG AGCTCCGGGA CACTTGAACT GCCGTCTACC CTGTCTCTCT ACAAGTCCGG 2160 
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AGCACTGOAC GAGGCAGCTG CGTACCAG AG TCGCGACTAC TACAACTTTC CACTGGCTCT 2220 
GGCCGGACCG CCGCCCCCTC CGCCGCCTCC CCATCCCCAC GCTCGCATCA AGCTGGAGAA 2280 
CCCGCTGGAC TACGGCAGCG CCTGGGCGGC TGCGGCGGCG CAGTGCCGCT ATGGGGACCT 2340 
GGCGAGCCTG CATGGCGCGG GTGCAGCGGG ACCCGGTTCT GGGTCACCCT CAGCCGCCGC 2400 
TTCCTCATCC TGGCACACTC TCTTCACAGC CG AAGAAGGC CAGTTGTATG GACCGTGTGG 2460 
TGGTGGTGGG GGTGGTGGCG GCGGCGGCGG CGGCCGCGGC GGCGGCGGCG GCGGCGGCGG 2520 
CGGCGGCGGC GAGGCGGGAG CTGTAGCCCC CTACGGCTAC ACTCGGCCCC CTCAGGGGCT 2580 
GGCGGGCCAG GAAAGCGACT TCACCGCACC TGATGTGTGG TACCCTGGCG GCATGGTOAG 2640 
CAGAGTGCCC TATCCCAGTC CCACTTGTGT CAAAAGCGAA ATGGGCCCCT GGATGGATAG 2700 
CTACTCCGGA CCTTACGGGG ACATGCGTTT GG AGACTGCC AGGGACCATG TTTTGCCCAT 2760 
TGACTATTAC TTTCCACCCC AGAAGACCTG CCTGATCTGT GGAOATGAAG CTTCTGGGTG 2820 
TCACTATGGA GCTCTCACAT GTGGAAGCTG CAAGGTCTTC TTCAAAAGAG CCGCTGAAGG 2880 
GAAACAGAAG TACCTGTGCG CCAGCAGAAA TGATTGCACT ATTG ATAAAT TCCGAAGGA A 2940 
AAATTGTCCA TCTTGTCGTC TTCGG AAATG TTATGAAGCA GGGATGACTC TGGGAGCCCG 3000 
GAAGCTGAAG AAACTTGGTA ATCTGAAACT ACAGGAGGAA GGAGAGGCTT CCAGCACCAC 3060 
CAGCCCCACT GAGGAGACAA CCCAGAAGCT GACAGTGTCA CACATTGAAG GCTATGAATG 3120 
TCAGCCCATC TTTCTGAATG TCCTGG AAGC C ATTG AG CCA GGTGTAGTGT GTGCTGGACA 3 1 80 
CGACAACAAC CAGCCCGACT CCTTTGCAGC CTTGCTCTCT AGCCTCAATG AACTGGGAGA 3240 
GAGACAGCTT GTACACGTGG TCAAGTGGGC CAAGGCCTTG CCTGGCTTCC GCAACTTACA 3300 
CGTGGACGAC CAGATGGCTG TCATTCAGTA CTCCTGGATG GGGCTCATGG TGTTTGCCAT 3360 
GGGCTGGCGA TCCTTCACCA ATGTCAACTC CAGGATGCTC TACTTCGCCC CTOATCTGGT 3420 
TTTCAATGAG TACCGCATGC ACAAGTCCCG G ATGTACAGC CAGTGTGTCC GAATGAGGCA 3480 
CCTCTCTCAA GAGTTTGG AT GGCTCCAAAT CACCCCCCAG GAATTCCTGT GCATGAAAGC 3540 
ACTGCTACTC TTCAGCATTA TTCCAGTGGA TGGGCTGAAA AATCAAAAAT TCTTTGATGA 3600 
ACTTCGAATG AACTACATCA AGGAACTCG A TCGTATCATT GCATGCAAAA GAAAAAATCC 3660 
CACATCCTGC TCAAGACGCT TCTACCAGCT CACCAAGCTC CTGGACTCCG TGCAGCCTAT 3720 
TGCG AGAGAG CTGCATCAGT TCACTTTTG A CCTGCTAATC AAGTCACACA TGGTGAGCGT 3780 
GG ACTTTCCG G AAATG ATGG CAGAGATCAT CTCTGTGCAA GTGCCCAAGA TCCTTTCTGG 3840 
GAAAGTCAAG CCCATCTATT TCCACACCCA GTGAAGCATT GGAAACCCTA TTTCCCCACC 3900 
CCAGCTCATG CCCCCTTTCA GATGTCTTCT GCCTGTTATA ACTCTGCACT ACTCCTCTGC 3960 
AGTGCCTTGG GGAATTTCCT CTATTGATGT ACAGTCTGTC ATGAACATGT TCCTGAATTC 4020 
TATTTGCTGG GCTTTTTTTT TCTCTTTCTC TCCTTTCTTT TTCTTCTTCC CTCCCTATCT 4080 
AACCCTCCCA TGGCACCTTC AGACTTTGCT TCCCATTGTG GCTCCTATCT GTGTTTTGAA 4140 
TGGTGTTGTA TGCCTTTAAA TCTGTGATGA TCCTCATATG GCCCAGTGTC AAGTTGTGCT 4200 
TGTTTACAGC ACTACTCTGT GCCAGCCACA CA AACGTTTA CTTATCTTAT GCCACGGGAA 4260 
GTTTAGAGAG CTAAGATTAT CTGGGGAAAT CAAAACAAAA AACAAGCAAA CAAAAAAAAA 4320 
A 

Seq ID NO: 94 Protein sequence: 
Protein Accession ff: NPJW0035. 1 

1 11 21 31 41 51 
I I I I I 1 

MEVQLGLGRV YPRPPSKTYR G Af QNLFQSV REVIQNPGPR HPEAASAAPP GASLLLLQQQ 60 
QQQQQQQQQQ QQQQQQQQET SPRQQQQQQG EDGSPQAURR GPTGYLVLDE EQQPSQPQSA 120 
LECHPERGCV PEPGAAVAAS KGLPQQLPAP PDEDDSAAPS TLSLLGPTFP GLSSCSADLK. 1 80 
DILSEASTMQ LLQQQQQEAV SEGSSSGRAR EASGAPTSSK DNYLGGTSTI SDNAKELCKA 240 
VSVSMGLGVE ALEHLSPGEQ LRGDCMYAPL LG VPPAVRPT PCAPLAECKG SLLDDSAGKS 300 
TEDTAEYSPF KGGYTKGLEG ESLGCSGSAA AGSSGTLELP STLSLYKSGA LDEAAAYQSR 360 
DYYNFPLALA GPPPPPPPPH PHARIKLENP LDYGSAWAAA AAQCRYGDLA SLHGAGAAGP 420 
GSGSPSAAAS SSWHTLFTAE EGQLYGPCGG GGGGGGGGGG GGGGGGOGGG GGEAGAVAPY 480 
GYTRPPQGLA GQESDFTAPD VWYPGGMVSR VPYPSPTCVK SEMGPWMDSY SGPYGDMRLE 540 
TARDHVLP1D YYFPPQKTCL ICGDEASGCH YGALTCGSCK VFFKRAAEGK QKYLCASRND 600 
CTIDKFRRKN CPSCRLRKCY EAGMTLGARK LKKLGNLKLQ EEGEASSTTS PTEETTQKLT 660 
VSHIEGYECQ PIFLNVLEA1 EPGVVCAGHD NNQPDSFAAL LSSLNELGER QLVHVVKWAK 720 
ALPGFRNLHV DDQMAVIQYS WMGLMVFAMG WRSFTNVNSR MLYFAPDLVF NEYRMHKSRM 780 
YSQCVRMRHL SQEFGWLQIT PQEFLCMKAL LLFSIIPVDG LKNQKFFDEL RMNYIKELDR 840 
UACKRKNPT SCSRRFYQLT KLLDSVQPLA RELHQFTFDL LIKSHMVSVD FPEMMAEIIS 900 
VQVPKILSGK VKPIYFHTQ 

Seq ID NO: 95 DNA sequence 

Nucleic Acid Accession ft: NM_002497 

Coding sequence: 1 35-1472 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I i I 

GGCACGAGTA GGGGTGGCGG GTCAGTGCTG CTCGGGGGCT TCTCCATCCA GGTCCCTGGA 60 
GTTCCTGGTC CCTGGAGCTC CGCACTTGGC GCGCAACCTG CGTGAGGCAG CGCGACTCTG 120 
GCGACTGGCC GGCCATGCCT TCCCGGGCTG AGG ACTATGA AGTGTTGTAC ACCATTGGCA 180 
CAGGCTCCTA CGGCCGCTGC CAGAAGATCC GGAGGAAGAG TGATGGCAAG ATATTAGTTT 240 
GGAAAGAACT TGACTATGGC TCCATGACAG AAGCTGAGAA ACAGATGCTT GTTTCTGAAG 300 
TGAATTTGCT TCGTGAACTG AAACATCCAA ACATCGTTCG TTACTATGAT CGGATTATTG 360 
ACCGGACCAA TACAACACTG TACATTGTAA TGG AATATTG TGAAGGAGGG GATCTGGCTA 420 
GTGTAATTAC AAAGGGAACC AAGGAAAGGC AATACTTAGA TGAAGAGTTT GTTCTTCGAG 480 
TGATGACTCA GTTGACTCTG GCCCTGAAGG AATGCCACAG ACGAAGTGAT GGTGGTCATA 540 
CCGTATTGCA TCGGGATCTT AAACCAGCCA ATGTTTTCCT GGATGGCAAG CAAAACGTCA 600 
AGCTTGG AG A CTTTGGGCTA GCTAGAATAT TAAACCATGA CACGAGTTTT GCAAAAACAT 660 
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TTGTTGGCAC ACCTTATTAC ATGTCTCCTG AACAAATG AA TCGCATGTCC TACAATGAGA 720 
AATCAG ATAT CTGGTCATTG GGCTGCTTGC TGTATG AGTT ATGTGCATTA ATGCCTCCAT 780 
TTACAGCTTT TAGCCAGAAA GAACTCGCTG GGAAAATCAG AGAAGGCAAA TTCAGGCGAA 840 
TTCCATACCG TTACTCTGAT GAATTGAATG AAATTATTAC GAGGATGTTA AACTTAAAGG 900 
5 ATTACCATCG ACCTTCTGTT GAAGAAATTCTTGAGAACCC TTTAATAGCA GATTTGGTTG 960 

CAGACGAGCA AAGAAGAAAT CTTGAGAGAA GAGGGCGACA ATTAGGAGAG CCAGAAAAAT 1020 
CGCAGGATTC CAGCCCTGTA TTGAGTGAGC TGAAACTGAA GGAAATTCAG TTACAGGAGC 1080 
GAGAGCGAGC TCTCAAAGCA AG AG A AG AAA GATTGGAGCA GAAAGAACAG GAGCTTTGTG 1 140 
TTCGTGAGAG ACTAGCAGAG GACAAACTGG ctagagcaga AAATCTGTTG AAGAACTACA 1200 

10 GCTTGCTAAA GG AACGGAAG TTCCTGTCTC TGGCAAGTAA TCCAGAACTT CTTAATCTTC 1260 
CATCCTCAGT AATTAAGAAG AAAGTTCATT TCAGTGGGGA AAGTAAAGAG AACATCATGA 1320 
GGAGTGAGAA TTCTGAGAGT CAGCTCACAT CTAAGTCCAA GTGCAAGGAC CTG AAGAAAA 1380 
GGCTTCACGC TGCCCAGCTG CGGGCTCAAG CCCTGTCAG A TATTGAGAAA AATTACCAAC 1440 
TGAAAAGCAG ACAGATCCTG GGCATGCGCTAGCCAGGTAG AGAGACACAG AGCTGTGTAC 1500 

1 5 AGGATGTAAT ATTACCAACC TTTAAAGACT GATATTCAAA TGCTGTAGTG TTGAATACTT 1560 
GGCCCCATGA GCCATGCCTT TCTGTATAGT ACACATGATA TTTCGGAATT GGTTTTACTG 1620 
TTCTTCAGCA ACTATTGTAC AAAATGTTCA CATTTAATTT TTCTTTCTTC TTTTAAGAAC 1680 
ATATTATAAA AAGAATACTT TCTTGGTTGG GCTTTTAATC CTGTGTGTGA TTACTAGTAG 1740 
GAACATGAGA TGTGACATTC TAAATCTTGG GAGAAAAAAT AATATTAGGA AAAAAATATT 1800 

20 TATGCAGGAA GAGTAGCACT CACTGAATAG TTTTAAATGA CTGAGT GGTA TGCTTACAAT 1860 
TGTCATGTCT AGATTTAAAT TTTAAGTCTG AGATTTTAAA TGTTTTTG AG CTTAGAAAAC 1920 
CCAGTTAGAT GCAATTTGGT CATTAATACC ATGACATCTT GCTTATAAAT ATTCCATTGC 1980 
TCTGTAGTTC AAATCTGTTA GCTTTGTG AA AATTCATCAC TGTGATGTTT GTATTCTTTT 2040 
TTTTTTTCTG TTTAACAGAA TATGAGCTGT CTGTCATTTA CCTACTTCTTTCCCACTAAA 2100 

25 T AAA AG A ATT CTTCAGTTA 

Seq ID NO: 96 Protein sequence: 
Protein Accession U: NP 002488 

30 1 11 21 31 41 51 

GIREFNPNIM ANEVERINMI TOSISGENEA RELATEDKJN ASEHOMOSAP IENSMPSRAE 60 
DYEVLYTIGT GSYGRCQK1R RKSDGKILVW KELDYGSMTE AEKQMLVSEV NLLRELKHPN 120 
IVRYYDRIID RTNTTLYIVM EYCEGGDLAS VITKGTKERQ YLDEEFVLRV MTQLTLALKE 180 

35 CHRRSDGGHT VLHRDLKPAN VFLDGKQNVK LGDFGLARIL NHDTSFAKTF VGTPYYMSPE 240 
QMNRMSYNEK SDIWSLGCLL YELCALMPPF TAFSQKELAG KIREGKFRRI PYRYSDELNE 300 
IHRMLNLKD YHRPSVEE1L ENPLIADLVA DEQRRNLERR GRQLGEPEKS QDSSPVLSEL 360 
KLKEIQLQER ERALKAREER LEQKEQELCV RERLAEDKLA RAENLLKNYS LLKERKFLSL 420 
ASKPELLNLP SSVIKKKVHF SGESKENIMR SENSESQLTS KSKCKDUCKR LHAAQLRAQA 480 

40 LSD1EKNYQL KSRQILGMR 



45 



Scq ID NO: 97 DNA sequence 

Nucleic Acid Accession*: NM_007050.2 

Coding sequence: 185-4576 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

CCTCCCGCCT CAGTTCGCGC CGCGCCTCGG CTTGGAACGC AGGAGCGCCG GCTCCGGGAG 60 
CCCGAGCGG A GCCAGCCGCG CGCACAGCCA GCGGCCGCGC CGGCG ATGCG GGGCCACCCC 120 

50 GCGCCCGCCC CAGTCCCGGC CCCGGCCCCC GCGGGAAGGG GCTGAGCTGC CCGCCGCCGC 180 
CCGGATGGCG AGCCTCGCCG CGCTCGCCCT CAGCCTGCTC CTGAGGCTGC AGCTGCCGCC 240 
ACTGCCCGGC GCCCGGGCTC AGAGCGCCCC AGGTGGCTGT TCCTTTGATG AGCACTACAG 300 
CAACTGTGGT TATAGTGTGG CTCTAGGG AC CAATGGGTTC ACCTGGGAGC AGATTAACAC 360 
AACGGAGAAA CCAATGCTGG ACCAGGCAGT GCCCACAGGA TCTTTCATGA TGGTGAACAG 420 

55 CTCTGGGAGA GCCTCTGGCC AGAAGGCCCA CCTTCTCCTG CCAACCCTGA AGGAGAATGA 480 
CACCCACTGC ATCG ACTTCC ATTACTACTT CTCCAGCCGT G ACAGGTCCA GCCCAGGGGC 540 
CTTGAACGTC TACGTGAAGG TGAATGGTGG CCCCCAAGGG AACCCTGTGT GGAATGTGTC 600 
CGGGGTCGTC ACTGAGGGCT GGGTGAAGGC AGAGCTCGCC ATCAGCACTT TCTGGCCACA 660 
TTTCTATCAG GTGATATTTG AATCCGTCTC ATTG AAGGGT CATCCTGGCT ACATCGCCGT 720 

60 GG ACGAGGTC CGGGTCCTTG CTCATCCATG CAG AAAAGCA CCTCATTTTC TGCG ACTCCA 780 
AAACGTGG AG GTGAATGTGG GGCAGAATGC CACATTTCAG TGCATTGCTG GTGGG AAGTG 840 
GTCTCAGCAT GACAAGCTTT GGCTCCAGCA ATGGAATGGC AGGGACACGG CCCTGATGGT 900 
CACCCGTGTG GTCAACCACA GGCGCTTCTC AGCCACAGTC AGTGTGGCAG ACACTGCCCA 960 
GCGGAGCGTC AGCAAGTACC GCTGTGTGAT CCGCTCTGAT GGTGGGTCTG GTGTGTCCAA 1020 

65 CTACGCGGAG CTGATCGTGA AAGAGCCTCC CACGCCCATT GCTCCCCCAG AGCTGCTGGC 1080 
TGTGGGGGCC ACATACCTGT GGATCAAGCC AAATGCCAAC TCCATCATCG GGGATGCCCC 1 140 
CATCATCCTG AAGGAAGTGG AATATCGCAC CACCACAGGC ACGTGGGCAG AGACCCACAT 1200 
AGTCGACTCT CCCAACTATA AGCTGTGGCA TCTGGACCCC GATGTTGAGT ATGAGATCCG 1260 
AGTGCTCCTC ACACGACCAG GTGAGGGGGG TACGGGACCG CCAGGGGCTC CCCTCACCAC 1320 

70 CAGGACCAAG TGTGCAGATC CGGTACATGG CCCACAGAAC GTGGAAATCG TAGACATCAG 1380 
AGCCCGGCAG CTG ACCCTGC AGTGGG AGCC CTTCGGCTAC GCGGTGACCC GCTGCCATAG 1440 
CTACAACCTC ACCGTGCAGT ACCAGTATGT GTTCAACCAG CAGCAGTACG AGGCCGAGGA 1500 
GGTCATCCAG ACCTCCTCCC ACTACACCCT GCGAGGCCTG CGCCCCTTCA TGACCATCCG 1560 
GCTGCGACTC TTGCTGTCTA ACCCCGAGGG CCGAATGG AG AGCGAGG AGC TGGTGGTGCA 1620 

75 GACTGAGGAA GACGTTCCAG GAGCTGTTCC TCTAGAATCC ATCCAAGGGG GGCCCTTTGA 1680 
GGAGAAGATC TACATCCAGT GGAAACCTCC CAATGAGACC AATGGGGTCA TCACGCTCTA 1740 
CGAG ATCAAC TACAAGGCTG TCGGCTCGCT GGACCCAAGT GCTGACCTCT CGAGCCAGAG 1 800 
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GGGGAAAGTO TTCAAGCTCC GGAATG AAAC CCACCACCTC TTTGTGGGTC TGTACCCAGG 1 860 
GACCACCTAT TCCTTCACCA TCAAGGCCAG CACAGCAAAG GGCTTTGGGC CCCCTGTCAC 1920 
CACTCGGATT GCCACCAAAA TTTCAGCTCC ATCCATGCCT G AGTACG ACA CAGACACCCC 1 980 
ATTGAATGAG ACAGACACGA CCATCACAGT G ATGCTGAAA CCCGCTCAGT CCCGGGGAGC 2040 
5 TCCTGTCAGT GTTTATCAGC TGGTTGTCAA GGAGGAGCGA CTTCAGAAGT CACGGAGGGC 2100 
AGCTGACATT ATTGAGTGCT TTTCGGTGCC CGTGAGCTAT CGGAATGCCT CCAGCCTCGA 2160 
TTCTCTACAC TACTTTGCTG CTGAGTTG AA GCCTGCCAAC CTGCCTGTCA CCCAGCCATT 2220 
TACAGTGGGT GACAATAAGA CATACAATGG CTACTGGAAC CCTCCTCTCT CTCCCCTGAA 2280 
AAGCTACAGC ATCTACTTCC AGGCACTCAG CA A AGCCAAT GGAGAGACCA AAATCAACTG 2340 

1 0 TGTTCGTCTG GCTACAAAAG CACCAATGGG CAGCGCCCAG GTGACCCCGG GGACTCCACT 2400 
CTGCCTCCTC ACCACAGGTG CCTCCACCCA GAATTCTAAC ACTGTGGAGC CAGAGAAGCA 2460 
GGTGGACAAC ACCGTGAAGA TGGCTGGCGT GATCGCTGGC CTCCTCATGT TCATCATCAT 2520 
TCTCCTGGGC GTGATGCTCA CCATCAAAAG GAGAAGAAAT GCTTATTCCT ACTCCTATTA 2580 
CTTGTCCCAA AGGAAGCTGG CCAAGAAGCA GAAGGAGACC CAGAGTGGAG CCCAGAGGGA 2640 

1 5 GATGGGGCCT GTGGCCTCTG CCGACAAACC CACCACCAAG CTCAGCGCCA GCCGCAATGA 2700 
TGAAGGCTTC TCTTCTAGTT CTCAGGACGT CAACGGATTC ACAGATGGCA GCCGCGGGGA 2760 
GCTTTCCCAG CCCACCCTCA CGATCCAGAC TCATCCCTAC CGCACCTGTG ACCCTGTGGA 2820 
G ATGAGCTAC CCCCGGG ACC AGTTCCAACT CGCCATCCGG GTGGCTGACT TGCTGCAGCA 2880 
CATCACGCAG ATGAAGAGAG GCCAGGGCTA CGGGTTCAAG GAGGAATACG AGGCCTTACC 2940 

2 0 AGAGGGGCAG ACAGCTTCGT GGGACACAGC CAAGGAGG AT G AAAACCGCA ATAAGAATCG 3000 
ATATGGGAAC ATCATATCCT ACGACCATTC CCGGGTGAGG CTGCTGGTGC TGGATGGAGA 3060 
CCCGCACTCT GACTACATCA ATGCCAACTA CATTG ACGGA TACCATCGAC CTCGGCACTA 3 1 20 
CATTGCG ACT CAAGGTCCG A TGCAGG AG AC TGTAAAGG AC TTTTGG AGAA TGATCTGGCA 3 1 80 
GGAGAACTCC GCCAGCATCG TCATGGTCAC AAACCTGGTG GAAGTGGGCA GGGTGAAATG 3240 

2 5 TGTGCG ATAC TGGCCAGATG ACACGG AGGT CTACGGAG AC ATTAAAGTCA CCCTGATTGA 3300 

AACAGAGCCC CTGGCAG AAT ACGTCATACG CACCTTCACA GTCCAGAAG A AAGGCTACCA 3360 
TGAGATCCGG GAGCTCCGCC TCTTCCACTT CACCAGCTGG CCTGACCACG GCGTTCCCTG 3420 
CTATGCCACT GGCCTTCTGG GCTTCGTCCG CCAGGTCAAG TTCCTCAACC CCCCGGAAGC 3480 
TGGGCCCATA GTGGTCCACT GCAGTGCTGG GGCTGGGCGG ACTGGCTGCT TCATTGCCAT 3540 

3 0 TG ACACCATG CTTG ACATGG CCG AGAATG A AGGGGTGGTG GACATCTTCA ACTGCGTGCG 3600 

TGAGCTCCGG GCCCAAAGGG TCAACCTGGT ACAGACAGAG GAGCAATATG TGTTTGTGCA 3660 
CGATGCCATC CTGGAAGCGT GCCTCTGTGG CAACACTGCC ATCCCTGTGT GTGAGTTCCG 3720 
TTCTCTCTAC TACAATATCA GCAGGCTGGA CCCCCAGACA AACTCCAGCC AAATCAAAGA 3780 
TGAATTTCAG ACCCTCAACA TTGTGACACC CCGTGTGCGG CCCGAGGACT GCAGCATTGG 3840 

3 5 GCTCCTGCCC CGGAACCATG ATAAGAATCG AAGTATGGAC GTGCTGCCTC TGGACCGCTG 3900 
CCTGCCCTTC CTTATCTCAG TGGACGGAGA ATCCAGCAAT T ACATCAACG CAGCACTGAT 3960 
GGATAGCCAC AAGCAGCCTG CCGCCTTCGT GGTCACCCAG CACCCTCTAC CCAACACCGT 4020 
GGCAGACTTC TGGAGGCTGG TGTTCGATTA CAACTGCTCC TCTGTGGTGA TGCTG AATGA 4080 
GATGGACACT GCCCAGTTCT GTATGCAGTA CTGGCCTGAG AAGACCTCCG GGTGCTATGG 4140 

40 GCCCATCCAG GTGOAGTTCG TCTCCGCAGA CATCG ACQ AG GACATCATCC ACAGAATATT 4200 
CCGCATCTGT AACATGGCCC GGCCACAGGA TGGTTATCGT ATAGTCCAGC ACCTCCAGTA 4260 
CATTG GCTGG CCTGCCTACC GGGACACGCC CCCCTCCAAG CGCTCTCTGC TCAAAGTGGT 4320 
CCGACGACTG GAGAAGTGGC AGGAGCAGTA TGACGGGAGG GAGGGACGTA CTGTGGTCCA 4380 
CTGCCTAAAT GGGGGAGGCC GTAGTGGAAC CTTCTGTGCC ATCTGCAGTG TGTGTG AGAT 4440 

45 GATCCAGCAG CAAAACATCA TTGACGTGTT CCACATCGTG AAAACACTGC GTAACAACAA 4500 
ATCCAACATG GTGG AGACCC TGGAACAGTA TAAATTTGTA TACG AGGTGG CACTGGAATA 4560 
TTTAAGCTCC TTTTAGCTCA ATGGGATGGG GAACTGCCGG AGTCCAGAGG CTGCTGTGAC 4620 
CAAGCCCCCT TTTGTGTGAA TGGCAGTAAC TGGGCTCAGG AGCTCTGAGG TGGCACCCTG 4680 
CCTGACTCCA AGGAGAAGAC TGGTGGCCCT GTGTTCCACG GGGGGCTCTG CACC TTCTGA 4740 

5 0 GGGGTCTCCT GTTGCCGTGG G AGATGCTGC TCCAAAAGGC CCAGGCTTCC TTTTCAACCT 4800 
AACCAGCCAC AGCCAAGGGC CCAAGCAGAA GTACACCCAC AAGCAAGGCC TTGG ATTTCT 4860 
GGCTCCCAGA CCACCTGCTT TTGTTCTGAG TTTGTGGATC TCTTGGCAAG CCAACTGTGC 4920 
AGGTGCTGGG GAGTGGGAGG CTCCCCTGCC CTCCTTCTCC TTAGGAGTGG AGGAGATGTG 4980 
TGTTCTGCTC CTCTACGTCA TGGAAAAGAT TGAGGCTCTT GGGGGTCACT GCTCTGCTGC 5040 

5 5 CCCCTGCAAC CTCCTTCAGG GGCCTCTGGC ACCAGACATT TGCAGTCTGG ACCAGTGTGA 5100 
CCTTACGATG TTCCCTAGGC CACAAGAG AG GCCCCCCATC CTCACACCTA ACCTGCATGG 5 1 60 
GGCTTCGCCC ACAACCATTC TGTACCCCTT CCCCAGCCTG GGCCTTGACC GT CCAGCA TT 5220 
CACTGGCCGG CCAGCTGTGT CCACAGCAGT TTTTG ATAAA GGTGTTCTTT GCTTTTTTGT 5280 
_ GTGGTCAGTG GGAGGGGGTG G AACTGCAGG GAACTTCTCT GCTCCTCCTT GTCTTTGTAA 5340 

6 0 AAAGGGAOCA CCTCCCTGGG GCAGGGCTTG GGCTG ACCTG TAGG ATGTAA CCCC TGTGTT 5400 
TCTTTGGTGG TAGCTTTCTT TGGAAGAGAC AAAC AAG ATA AGATTTG ATT ATTTTCCAAA 5460 
GTGTATGTGA AAAGAAACTT TCTTTTGGAG GGTGTAAAAT CTTAGTCTCT TAT GTCAA AA 5520 
AGAAGGGGGC GGGGGAGTTT GAGTATGTAC CTCTAAGACA AATCTCTCGG GCCTTTTATT 5580 
TTTTCCTGGC AATGTCCTTA AAAGCTCCCA CCCTGGGACA GCATGCCACT GAGCAAGGAG 5640 

65 AGATGGGTGA GCCTGAAGAT GGTCCCTTTG GTTTCTGGGG CAAATAGAGC ACCAGCTTTG 5700 
TGCATAATTT GGATGTCCAA ATTTGAACTC CTTCCTAAAG AAACCCAGCA GCCACCTTGA 5760 
AAAAGGCCAT TGTGGAGCCC ATTATACTTT GATTTAAAAT AGGCCAAGAG AATCAGGCCT 5820 
GGAGATCTAG GGTCTTGTCC AAAGTGTGAG TGAGTCAATG AGAGGGAACC AACATTTGCT 5880 
AAGTCTCTAC TGTATGCCAG GG ATCATGCT TGGCACTTTC CATAGGACAT TTCACACAGT 5940 

70 CCTTAGAACC CCCAGGAGAG AGCTACTGAC TTGTTATCAT CTCCATTTGA TCATCTCCTC 6000 
CAATGAGGAA ACCCACGCAC CTTCC TTAGT AATGAAATCC TGGGTTCCAA AGGGGCAGGT 6060 
AATGGCAATG AGACTTCTCC GTGCTGTTTT CTTCATCTTC TCTAAGCCAA GCAATTATTT 6120 
TATGGAGGGA AAATAAGGCC AGAAACTTCT GAGCAGATAA CTCCACAAAT GGAAATTTAG 6180 
TACTTTCTTC CTGATGCCAG TTCTTCTGGG AAGCGCAGAA TTTCAGATAT ATTTTAGTAA 6240 

75 CACATTCCCA GCTCCCCAGG AAAGCCAGTC TCATCTAATT TCTTAGTCAG TAAAAACAAT 6300 
TCCCTGTTCC TTCAGGCTAT GAATGG ACCA GCCAGGGAAA CTCTCGACCT TGATCTCTAG 6360 
CCAGTGCTTA GGCCCAATAT CTGACAGCCT CAGGTGGGCT GGGACCTAGG AAGCTCCATC 6420 
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TTGAAGGCTG GTCTAGCCCC AGACAGGGCA TGAGGGGCAG AGAATTCAAG AAGGTACAGC 6480 
TTTGGCCCTC AAGAGCCCAC TGTATGCTGG GGAAATGGAA CCATGGTGCA GTAGTGTGGA 6540 
GTGGATGAGT GTTCCATG AG CCTAGGAGCA AGAAAGTCTC TTCGGCCTCG GGCTTCCTGG 6600 ' 
AGAAGGGGAC GTCCATTCCT GCTGGGTCTT AACAAGCATA AAAAGGAAAA AAAGGAAACT 6660 
5 CAGGCAAAGG GATCCATATG TGCAATGGCA AAGAAATGTG AAAAGGCATT GGGAGAAGCA 6720 
GTCTGGGGG A GGCC AGCCCA GTGCGGGCAC AGCACAACAC GGGGAGCAGC AAGAGATGAG 6780 
CCAGGGTCCA GGAG ACAG AT GCCCATCGCG AGTACAGACT TTGTCCTATT GGCAACAAGG 6840 
AGTCCATGGA GCTTTAGAGA G ATGCACTCA GCTTCGTGTT GGCCAAGACT CCTTCTGGGC 6900 
CAATGGGGCT GCCTCTTTTC CTTTCATCAG ACACTGTGAA AACATTCCCT TAAGCGTGCA 6960 

1 0 CTTTTTAATA TCACATCTAT TTGTCTGTCT GCTCATTGTT TTGTTGCTGG AACTAAATAT 7020 

GCAATGGATC ATGAGACTCA GATTCTATGA GAAACCCAGG GTCTCTGCTT TACCACGGAG 7080 
CAGGGTCACC AACCCAGATC TCCAGGCCCA TGAGGATGGA ACATGAAAGG AGCCGACAAA 7140 
AGTTGCTTCC ATTGGCATGG GCTCTGGAGC TGTCCAGAAG TCCAGGGACA CCAGACTTGA 7200 
TCAAGGAAGG GCTGTCACTT TAGAGGTTCA AAAGGAAGTG CCTCAAAGCA AAGGCAAGCA 7260 

1 5 AAGGAACCCC ACG ATGAACT TGCTCTTTTC CTTTGATGAG CCTCTCCCCA GGTGTATTTC 7320 
AGCAGACCCC GGGGACCCAC CCCCACTGGG CCTGCTGGCC TCCCTCGGCT CCAGCCCAAT 7380 
GCCCCAGCTG GCCTTCCCCA GCCTGCAAGG AGCCTGTAGC ATGGCA AATC TGCCTGCTGT 7440 
ATGCTATTTT CTTAGATCTT GGTACATCCA GACAGGATGA GGGTGGAGGG AGAGCTATTT 7500 
AACACAAATC CTAAGATTTT TTTCTGCTCA GGAAGGGGTG AAATAGCTGG CAGATACAAA 7560 

20 AGACAGTGGC TTTTATCATT TTAAATGGTA GGAATTTAAG GTGTGACTTC AGGGAGAAAC 7620 
AAACTTGCAA AAAAAAAAAA TCTCAGGCCA TGTTGGGGTA ACCCAGCAAG GGCCAGTGAT 7680 
GATTTCCCCC AGCTCATCCC CTTATTTTCC CACAACCCAA CCATTCTCTA AAGCAGGACA 7740 
GTGAATAGGT CTTAGGCCAG TGCACACAGG AAGAAATTGA GGCTTATGGA TGGGGATGAC 7800 
TTCCCTAAGA TCCCATGGGA CAAGG ATGTG GCAAGGCTTG GATGAGATGG GGCACCAGTG 7860 

25 CCCAGGAATT TGAACATTTT CCTTTACCCA GGAAATCTCC GGAGCCAACA CCACCACCCC 7920 
CAGGGGGTCT CCCCACCCCA CCCCATTTAC AGGGTGAGCT CAGCCTGTCA TG AGCAGAGG 7980 
AAAATATTAT TAATGCTCTC TGAGTCTTTA CAACAGGAGC TCTTACCTCA TAGATGTGGG 8040 
CTCTGTTTGG GGAAGATGCA AGGAAGTAAT GAGAAGCCCA GGAAATTTCT CCACCTGTGT 8100 
TTATGGCCTA AATAGCTTCA GGATGTATCT TAGCTGCACT CCAACATTGC ATCCTTTCTG 8160' 

3 0 GGGTGAAGAA TCTGGGCCA A CCAGGGGTCC TTGGGCCTCT AG AAGGCCAC AGTAGGCCTC 8220 
TCTTTGTGGG AATGGAAGGG GACAGTTTGC TTTTAGTGCT GGCCCTCTCT GTGGGTGTGG 8280 
CCTGCAAAGG AACCAACAG A CCCTATGCTG GGGACTCTAA CATGTGAGCT CATTAAATTC 8340 
TTCCAGCATT CTAAAGG AGG GTTTGTGATT GTCACCATTT ACTGATG AGG AAACTAAGGC 8400 
TCCTAGGGGA G AAATCACTT GCCCACAGTT CCACAGCTAG TGAGTGAATG AACCAGGATT 8460 

3 5 TAAACCGGTT TTTTCTCACT ACAGAGACAA TATTTTTCCA CCATTGTATC TCACATTTTT 8520 

CCCAGGAGGT TACCCATAAC AGAAGAGACT AGAGTGGAAC AGATACGTCA GTGGATAAAG 8580 
CTCAAAGCAA ACAACAGTAA GCTTAAAATT CCTTCATAGT CTCATGTTTT ACGTTCACAA 8640 
TTCATGCAAA ATTTGCATTC CACTTTCTGA TTTAGCCTTG TTGGTTTTAA TATGACTCTA 8700 
TGAATATTTC AAAAAAAAAT GTGCTCTGTT CCTCATGTTG TTCTGTTCTG TTCACCCCGC 8760 

40 TATGACGGAC CCTAGGTCAG CTGGTCTTCA GCTTGACCCT AGAATTGACT CTAGGAGCAG 8820 
TG ACCCTGCT GCCTCCCAG A GCCAGTTATA GGCTCAAGAT CAAGACCAAC TGACCTTCTC 8880 
CTAGGCAGCT CCTTTGGTGT GTGGGTGCTC TGACCTCACT GTTCATG AGG GGACCTCAAC 8940 
TAAGGCATCT TCCAGTTGGG TGCTGGAAGG AACCCATTAA CTCACACTAG AATGATGAGG 9000 
. _ ATTTGCTCAT CTGGCGTGGA G AAGG ATG AG CCCACAAAAC CCTAAAGGGA AAAGAGAAGC 9060 

45 TGGACACAGC TGTACTCAGC AGATTCCTGA ATGCTAGGCT GGAAAGTGGT GCCTGTTGTC 9 120 
CAAGTGGAGT CACATGGTTG CTAATGTGGG CAAGTCTGAG GACACACTTC ATGAGCAGCT 9180 
GGGGTCTGGA AGGCTCCTCA CTTTACCCTA GCCACACATA ATTACTGGGT GCCTACAGCA 9240 
CCTAGCACCT TGGAGGGGGC ACTATTAGG A AATCGAGATT ACTATGGCAC AATTAATTCC 9300 

_ TGGGTAAGGC ATGGGGTTGT GGTGGACAGA GCTCAGTCTT TAGTTTGAAC GAAAACATAC 9360 

5 0 ATACATG AAA AACATACATG AAAAAAGGAC CCTCATCAAC ATTAGAAGGG GTAG ATTTGG 9420 
AGCACTTTAG GCAGG AAAAC AGGAACGCAA GGCCAGGAAA CTGGAACCCA GTGAATACTC 9480 
AGAACCGAGG ATGCAGATG A CTTATTTAGC AAAATGGTCA CTTCTGTGAC ATAGCTGGAG 9540 
AAAGGATGGG TAACAGCTTG CCAGAGCCAC TTGGAACAAG GGCAAATCTC AGTGTCTGGG 9600 
_ GCAAAAGATG ATGCATTTCC CTCTGACCCA TCATGTTTAT TCATCCTCCA CTCCCCATTG 9660 

5 5 CCACACTAGC TCTTGCTGTA AGTCCTCACC AGGATCTACA TTTCCTCGTC GCTGGTGGG A 9720 
ACCCCTTAG A GTACATAGAG GTATCAGTCC AGTAAGACTG CTCTACACAA CAGAAGTGAG 9780 
GCCCAGGGAG TAGCAGCCAG GCCCTTATCC TGTTACCTCT GCAGG AGTGA CTGCCCAACC 9840 
CAGATCCAGA GACATTG AAG GAAATGATAA TTCCTTGGTA CCTCACTGCC TTGGGACAAA 9900 
ATG AAGA AAG CC ACCCTTCC TT AGGCTGCA GCTTGCCACT CCTGGGCTGG GTAAACAGGT 9960 

60 CATCAGCACC AGGCTCAACC AGGAGTAACA TTCTGGAAGA CATGGGTGAG CCCAAGAGGA 1 0020 
AGCATGAACA GGACGCTGTT CCTAAGTCAT GTCAACAGGT TGTGCTGGGC CAGGATCCCC 10080 
AGGGAAAAAA ATGGTCAACC CAACTGGAGG GTAGGTTAGA AGAAAAAAAA CATAAACGTG 10140 
GATAGTCATG TCATCTCAAA TCCCTGACTT GGCTTCCCCA TTACTTGACA GTCTGAGCTC 10200 
CTTCTTAGCC TGTGACCAGC TTCAAATCAC AGCCAAGTAA AACAAGGAAA TAGGAAAAGT 10260 

65 AAATCCAACT AG AAGAG ACA AGCTG AG ATT CAGATTTGTT TACTCCTCCC ATGCAAAGTT 1 0320 
TCCCTGTTGG AGGTTTTCCA TGTATACATG TCTAGAAGTG ATAGAATGCA AGGCCTTGGC 10380 
TTTGTCTTGC AGGGATCTGC CTTTGAGGTC ATAGACTGAA CAGCAGGGAG AGAGGTTAGT 10440 
GGTGGAGTGT GGGGGGAGCT GTTCTAGCTC CAGTTTCTTC TGACACATTT TTCAGGATCA 10500 
TGGATCTGAT CCTCCG AAGC ACAGCAGAGA TATCTAAGCC ATATTTGTGC ACATGAGCAG 10560 

70 ACTCTTCTAG TTTTTTAGTA ACCAGGGATG GGCTTTTGCA TGGCACTGAC TATAGAGATG 10620 
TCTTGTAGAG ATCAAGCCAG TCTTTTGCAT CCCACCTGCC CACCTCCAGA AGAGATGGGA 10680 
AAAGGTCATC AAAGGGCATT CACCAACTGA AATCCACTCA TGAATGTTAG GTCTCTAAAA 10740 
GGAGGCATCA ACACTCACAA TGGTAGCCTC CAAACCTAGC ATCCCACCTA TCTAAGAGCT 10800 
CAGGGGTGGT CCA CTGG GGC AGATACAAGG G AAGTGCAAG GGCTCAGGAT GAAAGAAAAT 10860 

7 5 CTATTGGG AA G AGTTTTAGG GGCTTGATCA TTATGGGGCT TCCTTCTATA TCTGAGAACT 1 0920 
GCTCTGGGTG GTGAGATGTG GACTCTGATC CTTAATTGGA ATGTTCGGAG AATGAGTGTC 10980 
TGGTGGCCTT G AAGTGTTGG ACAGAAAAGT ATCAGTATAA AAGCCTGGAG CTCAGGGTAA 1 1040 
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TTAATGTAGT TCATGGTTCC TTAGTGAGCA GGACTCTTGG ATGTGGAGGA GAAAGGGTCA 1 1 100 
TAGGAAGTAA ACCACCAAAA TTACAAAATT GAGTCTCTGT ACAATTACTT CAGTGCCTTT ! 1 160 
GGGCTTATG A ATACAAATCA GTGGGCCTTC TCTATGATGG TCCAACAAAC TCTCAGTGTC 1 1220 
CACCCTGTCC CTGTATCTCC CATGGAAGAT GAATAATGTC AGGTGTTCTT TGGGTCAAAG 11280 
5 GCCCCAGGGC AGTCTGGAGG CTTAGAGGGC AGAGTGGTGT CATTCCATGT AAAGTTAGGC 1 1340 
TTCTGAGGGG TCAGGCAGAA TATGGTGTCC ATATCTTCCA TAGCTCTGCA GATTCTTGGA 11400 
TGAAGTCAAG CACAGTTTGC TAGACCCAGG TCACTCCTCT GAGTATAACT AGGACCCATG 1 1460 
AGTGAAACTT AATAGCTGTA AGGAAGAACC TGCTGTCTGC CAGAGAGGAT AAGCTGCCCA 1 1520 
TCTCAGCAGC TGTCTAAAAG AAGGCAGGTG TCTCTTTAAA GGGAAGAGAA GCATTGGTGA 1 15S0 

1 0 AATGGATTTC AGGTCACTTC CATTCCAGAT GGGTGAGATC TTGTGGAGCT GGG ATCATGT 1 1 640 
TTGAACTCAT TCATACCTGT AGAGCACGA A TCCAAGTAGA TTGTGTTTGG TCTGTACAGG 1 1700 
CTGAAGCCCC CTGCTCTCCC ACCCAAGTGC CCCCACTGAG CAGGCCAACA TGCTGTTGTG 1 1760 
GCCACATATA CTGGGCTGAT CCAGGCTGGT TATCACCAAA CAGCAAACCA TAGGGAACAG 1 1820 
CTGCTTTGCC ATAGACCCAA TACCCATGTA GATCTCTCAT G AGAGCAGCC ATAACTCAGA 1 1880 

1 5 CCCACTG ACC AACAGGGCCA TGAGTGACAG CCAGAACCAG TGAAGGTCCA AGTAGGACAC 1 1 940 
AGAGCAGGGC TTTTCTTACC ATACACATTA TCTCCAGAGG TTATTTCTAC CCCACTCCCT 12000 
ATTCAAGGCC TGTTGGAGCA CACTGCAAAA GCAAAAGCAC AGTAACTCAA TTTACACATG 12060 
ATTATAATCA TTTCCAGTGC ACACATTTCA TCACCAGGTG GATCCTGAGC TAGCCCATGT 12120 
AAATCCGGGT TAACCCATAT TGGTAATCAT ACTCAAAAGC ACTTTTCACC CTACATTCTA 12180 

20 CTAGCCAATC AAAG ACAAAG AGTTGTGGCC TCTACCATTG CCTTGGCTTC TGGACACCCT 1 2240 
CACAAGCTAT CCCAAGGTTC CCGCTCAACT CCAGGGAGGC TGACATCTTC ACATCCACTG 12300 
GGCATATAAT ATTGCATGAG ACCAAAGTCT CCACACTCTT TGCAGCCTCC TCCATGAATC 12360 
CCAATGGCCT GCACTTGTAC AGTTTGGGTG TTTGATAGAT AAAGCACGTA TGAGAAGAGA 12420 
AAACAAAATA AATCAACTTT TTAAAAAAGC CAGCACTGTG CTGTCAATGT TTTTTTTTTC 12480 

25 TTTTCAATTC TAGCTCAGAA AAGCAGAAGG TAAATAATGT CAGGTCAATG AATATCAGAT 12540 
ATATTTTTTG ACTGTACATT ACAGTGAAGT GTAATCTTTT TACACCTGCA AGTCCATCTT 12600 
ATTTATTCTT GTAAATGTTC CCTGACA ATG TTTGTAATAT GGCTGTGTTA AAAAATCTAT 12660 
ACAATAAAGC TGTGACCCTG 

30 Seg ID NO: 98 Protein sequence: 

Protein Accession #: NP JXJ898 1 . 1 

1 11 21 31 41 51 

3 5 GIREFNPPRO TEINTYROSI NEPHOSPHAT ASERECEPTO RTYPETHOMO SAPIENSMAS 60 

LAALALS1XL RLQLPPLPG A RAQSAPGGCS FDEHYSNCGY SVALGTNGFT WEQINTTEKP 1 20 
MLDQAVPTGS FMMVNSSGRA SGQKAHLIXP TLKENDTHCI DFHYYFSSRD RSSPGALNVY 180 
VKVNGGPQGN PVWNVSGVVT EGWVKAELAI STFWPHFYQV DFESVSLKGH PGYIAVDEVR 240 
VLAHPCRKAP HFLRLQNVEV NVGQNATFQC IAGGKWSQHD KXWLQQWNGR DTALMVTRW 300 

40 KHRRFSATVS VADTAQRSVS KYRCVIRSDG GSGVSNYAEL IVKEPPTPIA PPELLA VGAT 360 
YLWTJCPNANS IIGDGPIILK. EVEYRTTTGT WAETHIVDSP NYKLWHLDPD VEYEIRVLLT 420 
RPGEGGTGPP GAPLTTRTKC ADPVHGPQNV EIVDIRARQL TLQWEPFGYA VTRCHSYNLT 480 
VQYQYVFNQQ QYEAEBVIQT SSHYTLRGLR PFMTIRLRLL LSNPEGRMES EELVVQTEED 540 
VPGAVPLESI QGGPFEEKIY IQWKPPNETN GVITLYEINY KAVGSLDPSA DLSSQRGKVF 600 

45 KLRNETHHLF VGLYPGTTYS FTDCASTAKG FGPPVTTR1A TKISAPSMPE YDTDTPLNET 660 
DTTITVMLKP AQSRGAPVSV YQLWKEERL QKSRRAADI1 ECFSVPVSYR NASSLDSLHY 720 
FAAELKPANL PVTQPFTVGD NKTYNGYWNP PLSPLKSYS1 YFQALSKANG ETKINCVRLA 780 
TKAPMGSAQV TPGTPLCLLT TGASTQNSNT VEPEKQVDNT VKMAGVIAGL LMFUILLGV 840 
MLTIKRRRNA YSYSYYLSQR KLAKKQKETQ SGAQREMGPV ASADKPTTKL SASRNDEGFS 900 

5 0 SSSQDVNGFT DGSRGELSQP TLTIQTHP YR TCDPVEMSYP RDQFQLAIRV ADLLQHJTQM 960 

KRGQG YGFKE EYEALPEGQT ASWDTAKEDE NRNKNRYGNI ISYDHSRVRL LVLDGDPHSD 1020 
YINANYIDGY HRPRHYIATQ GPMQETVKDF WRMIWQENSA SIVMVTNLVE VGRVKCVRYW 1080 
PDDTEVYGDI KVTLIETEPL AEYVIRTFTV QKKGYHEIRE LRLFHFTSWP DHGVPCYATG 1 140 
LLGFVRQVKF LNPPEAGPIV VHCSAGAGRT GCFIAlDTML DMAENEGWD IFNCVRELRA 1200 

55 QRVNLVQTEE QYVFVHDAIL EACLCGNTAI PVCEFRSLYY NTSRLDPQTN SSQIKDEFQT 1260 
LNIVTPRVRP EDCSIGLLPR NHDKNRSMDV LPLDRCLPFL 1SVDGESSNY INAALMDSHK 1320 
QP AAFVVTQH PLPNTV ADFW RLVFDYNCSS VVMLNEMDTA QFCMQYWPEK TSGCYGPIQV 1380 
EFVSADIDED IIHRIFRICN MARPQDGYRI VQHLQYIGWP AYRDTPPSKR SLUCVVRRLE 1440 
KWQEQYDGRE GRTVVHCLNG GGRSGTFCAI CSVCEMIQQQ NI1DVFHJVK TLRNNKSNMV 1 500 

60 ETLEQYKFVY EVALEYLSSF 

Scq ID NO: 99 DNA sequence 

Nucleic Acid Accession »: NMj002988. ] 

Coding sequence: 71-340 (underlined sequences correspond to start and stop codons) . 

65 

I 11 21 31 41 51 

CCGGCACGAG AGGAGTTGTG AGTTTCCAAG CCCCAGCTCA CTCTGACCAC TTCTCTGCCT 60 
GCCCAGCATC ATGAAGGGCC TTGCAGCTGC CCTCCTTGTC CTCGTCTGCA CCATGGCCCT 120 

70 CTGCTCCTGT GCACAAGTTG GTACCAACAA AGAGCTCTGC TGCCTCGTCT ATACCTCCTG 180 
GCAGATTCCA CAAAAGTTCA TAGTTGACTA TTCTG AAACC AGCCCCCAGT GCCCCAAGCC 240 
AGGTGTCATC CTCCTAACCA AGAGAGGCCG GCAGATCTGT GCTGACCCCA ATAAGAAGTG 300 
GGTCCAGAAA TACATCAGCG ACCTGAAGCT GAATGCCTGA GGGGCCTGGA AGCTGCGAGG 360 
GCCCAGTG AA CTTGGTGGGC CCAGGAGGGA ACAGGAGCCT GAGCCAGGGC AATGGCCCTG 420 

75 CCACCCTGG A GGCCACCTCT TCTAAGAGTC CCATCTGCTA TGCCCAGCCA CATTAACTAA 480 
CTTTAATCTT AGTTTATGCA TCATATTTCA TTTTGAAATT G ATTTC TATT GTTGAGCTGC 540 
ATTATGAAAT TAGTATTTTC TCTGACATCT CATG ACATTG TCTTTATCAT CCTTTCCCCT 600 
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TTCCCTTCAA CTCTTCGTAC ATTCAATGCA TGGATCAATC AGTGTGATTA GCTTTCTCAG 660 
CAGACATTGT GCCATATGTA TCAAATGACA AATCTTTATT GAATGGTTTT GCTCAGCACC 720 
ACCTTTTAAT ATATTGGCAG TACTTATTAT ATAAAAGGTA AACCAGCATT CTCACTGTG A 780 
^ AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO: 100 Protein sequence: 
Protein Accession ft: NP_002979.I 

I 11 21 31 41 51 

10 | | | | i | 

MKGLAAALLV LVCTMALCSC AQVGTNKELC CLVYTSWQI? QKFIVDYSET SPQCPKPGVI 60 
LLTKRGRQIC ADPNKKWVQK YISDUCLNA 

Seq ID NO: 101 DNA sequence 
15 Nucleic Acid Accession U: NM 01 5507.2 

Coding sequence: 241-1902 (underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 
on i I I I I I 

ZU CCGCAG AGG A GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 
CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAG ACTGCA GGGACAGCAC CCGGTAACTG 1 20 
CGAGTGG AGC GGAGGACCCG AGCGGCTGAG G AGAGAGGAG GCGGCGGCTT AGCTGCTACG 1 80 
GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAGA 240 
ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 
25 GGGAACGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 
TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 
TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 
AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG 540 
AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG G AAGCTACAA GTGCTTTTGC 600 
3 0 CTCAGTGGCC ACATGCTC AT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 
ATAAACTGTC AGTACAGCTG TGAAG ACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 720 
TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 
GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 
AAATGTCACA TTCGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 
3 5 AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 
GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 
ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 
AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1 140 
CCAG AACCCA CCAGGACTCC TACCCCTAAG GTG AACTTGC AGCCCTTCAA CTATGAAGAG 1 200 
40 ATAGTTTCCA GAGGCGGGAA CTCTCATGG A GGTAAAAAAG GGAATG AAGA G AAAATGAAA 1 260 
GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 
AGCCTGCG AG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 
CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 
GACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 1500 
45 TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1 560 
GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 
AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTOGAGTG 1680 
TTTGTGAAAA ACAGTAACA A TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGAAAAG 1740 
TGG AAGACAG GGAAAATTCA GTTGTATCAA GG AACTGATG CTACCAAAAG CATCATTTTT 1 800 
5 0 G AAGCAGAAC GTGGCAAGGG CAA AACCGGC GAAATCGCAG TGGATGGCGT CTTGCTTGTT 1 860 
TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGA CT GAA TGTTACT ATCTTTATAT 1920 
TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 
TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2040 
TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2 100 
5 5 TTTCTG AATC TTTCCACATT ATATTATAAA ATATGG AAAT GTCAGTTTAT CTCCCCTCCT 2 1 60 
CAGTATATCT GATTTGTATA AGTAAGTTG A TGAGCTTCTC TCTACAACAT TTCTAGAAAA 2220 
TAG AAAAAAA AGCACAGAG A AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 
TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 
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Seq ID NO: 102 Protein sequence: 
Protein Accession #: NP 056322.2 



1 11 21 31 41 51 

o5 | I | I I I 

MPLPWSLALP LLLSWVAGGF GNAASARHHG LLASARQPGV CHYGTKLACC YGWRRNSKGV 60 
CEATCEPGCK FGECVGPNKC RCFPGYTGKT CSQDVNECGM KPRPCQHRCV NTHGSYKCFC 120 
LSGHMLMPDA TCVNSRTCAM INCQYSCEDT EEGPQCLCPS SGLRLAPNGR DCLDIDECAS 180 
GKVICPYNRR CVNTFGSYYC KCHIGFELQY 1SGRYDCIDI NECTMDSHTC SHHANCFNTQ 240 

70 GSFKCKCKQG YKGNGLRCSA IPENSVKEVL RAPGTIKDRI KKLLAHKNSM KKKAKDCNVT 300 
PEPTRTPTPK. VNLQPFNYEE IVSRGGNSHG GKKGNEEKMK. EGLEDEKREE KALKNDIEER 360 
SLRGDVFFPK VNEAGEFGLI LVQRKALTSK LEHKDLNISV DCSFNHGICD WKQDREDDFD 420 
WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS NFCLLFDYRL AGDKVGKLRV 480 
FVKNSNNALA WEKTTSEDEK WKTGKJQLYQ GTDATKSIIF EAERGKGKTG EIAVDG VLLV 540 

75 SGLCPDSLLSVDD 



Seq ID NO: 103 DNA sequence 
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Nucleic Acid Accession fl: NMJJOl 565. 1 

Coding sequence: 67-363 (underlined sequences correspond to start and stop codons) 
1 It 21 31 41 51 

GAGACATTCC TCAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC CTCCAGTCTC 60 
AGCACCATG A ATCAA ACTGC G ATTCTG ATT TGCTGCCTTA TCTTTCTGAC TCTAAGTGGC 1 20 
ATTCAAGG AG TACCTCTCTC TAGAACCGTA CGCTGTACCT GCATCAGCAT TAGTAATCAA 180 
CCTGTTAATC CAAGGTCTTT AGAAAAACTT GAAATTATTC CTGCAAGCCA ATTTTGTCCA 240 
CGTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGAA 300 
TCGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA AAGATCTCCT 360 
TAAAACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGCTG 420 
CCTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 460 
GTTACACTAA AAGGTGACCA ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA 540 
GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 
GCTCTACTG A GGTGCTATGT TCTTAGTGG A TGTTCTGACC CTGCTTCAAA TATTTCCCTC 660 
ACCTTTCCCA TCTTCCAAGG GTACTAAGGA ATCTTTCTGC TT TGGGG TTT ATCAGAATTC 720 
TCAGAATCTC AAATAACTAA AAGGTATGCA ATCAAATCTG CTTTTTAAAG AATGCTCTTT 780 
ACTTCATGGA CTTCCACTGC CATCCTCCCA AGGGGCCCAA ATTCTTTCAG TGGCTACCTA 840 
CATACAATTC CAAACACATA CAGGAAGGTA GAAATATCTG AAAATGTATG TGTAAGTATT 900 
CTTATTTAAT GAA AG ACTGT ACAAAGTATA AGTCTTAGAT GTATATATTT CCTATATTGT 960 
TTTCAGTGTA CATGGAATAA CATGTAATTA AGTACTATGT ATCAATGAGT AACAGGAAAA 1020 
TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 1080 
TTTTCAAATA AAAATGAGGT ACTCTCCTGG AAATATTAAG 

Seq ID NO 104 Protein sequence: 
Protein Accession #: NP 001556.1 

1 It 21 31 41 51 

MNQTAILICC LIFLTLSG1Q GVPLSRTVRC TCISISNQPV NPRSLEKLE1 IPASQFCPRV 60 
HIATMKKKG EKRCLNPESK AIKNLLKAVS KEMSKRSP 

Seq ID NO: 105 DNA sequence 

Nucleic Acid Accession #: NM_0l 5068.1 

Coding sequence: 1 1 70-2243 (underlined sequences correspond to start and stop codons) 
I II 21 31 41 51 

GTAACAACCG TCACCCTGGG TCCCGACTGC CCACCTCCTC CTCCTCCCCC TCCCCCCAAC 60 
AACAACAACA ACAACAACTC CAAGCACACC GGCCATAAGA GTGCGTGTGT CCCCAACATG 120 
ACCGAACGAA GAAGGGACG A GCTCTCTGAA GAGATCAACA ACTTAAGAG A GAAGGTCATG 1 80 
AAGCAGTCGG AGGAGAACAA CAACCTGCAG AGCCAGGTGC AGAAGCTCAC AGAGGAGAAC 240 
ACCACCCTTC GAG AGCAAGT GG AACCCACC CCTGAGGATG AGGATGATGA C ATCG AGCTC 300 
CGCGGTGCTG CAGCAGCTGC TGCCCCACCC CCTCCAATAG AGGAAGAGTG CCCAGAAG AC 360 
CTCCCAGAGA AGTTCGATGG CAACCCAGAC ATGCTGGCTC CTTTCATGGC CCAGTGCCAG 420 
ATCTTCATGG AAAAGAGCAC CAGGGATTTC TCAGTTGATC GTGTCCGTGT CTGCTTCGTG 480 
ACAAGCATGA TGACCGGCCG TGCTGCCCGT TGGGCCTCAG CAAAGCTGGA GCGCTCCCAC 540 
TACCTGATGC ACAACTACCC AGCTTTCATG ATGGAAATG A AGCATGTCTT TGAAGACCCT 600 
CAGAGGCGAG AGGTTGCCAA ACGCAAGATC AGACGCCTGC GCCAAGGCAT GGGGTCTGTC 660 
ATCGACTACT CCAATGCTTT CCAGATGATT GCCCAGGACC TGGATTGGAA CGAGCCTGCG 720 
CTGATTGACC AGTACCACGA GGGCCTCAGC GACCACATTC AGGAGGAGCT CTCCCACCTC 780 
GAGGTCGCCA AGTCGCTGTC TGCTCTGATT GGGCAGTGCA TTCACATTGA GAGAAGGCTG 840 
GCCAGGGCTG CTGCAGCTCG CAAGCCACGC TCGCCACCCC GGGCGCTGGT GTTGCCTCAC 900 
ATTGCAAGCC ACCACCAGGT AGATCCAACC GAGCCGGTGG GAGGTGCCCG CATGCGCCTG 960 
ACGCAGGAAG AAAAAGAAAG ACGCAGAAAG CTGAACCTGT GCCTCTACTG TGGAACAGGA 1020 
GGTCACTACG CTGACA ATTG TCCTGCCAAG GCCTCAAAGT CTTCGCCGGC GGGAAACTCC 1080 
CCGGCCCCGC TGTAGAGGGA CCTTCAGCGA CCGGGCCAGA AATAATAAGG TCCCCACAAG 1 140 
ATGATGCCTC ATCTCCACAC TTG C AAGTG ATGCTCC AG AT TCATCTTCCG GGCAGACACA 1200 
CCCTGTTCGT CCGAGCCATG ATCG ATTCTG GTGCTTCTGG CAACTTCATT GATCACGAAT 1260 
ATGTTGCTCA AAATGGAATT CCTCTAAGAA TCAAGGACTG GCCAATACTT GTGGAAGCAA 1320 
TTGATGGGCG CCCCATAGCA TCGGGCCCAG TTGTCCACGA AACTCACGAC CTGATAGTTG 1380 
ACCTGGGAGA TCACCGAGAG GTGCTGTCAT TTGATGTGAC TCAGTCTCCA TTCTTCCCTG 1440 
TCGTCCTAGG GGTTCGCTGG CTGAGCACAC ATGATCCCAA TATCACATGG AGCACTCGAT 1500 
CTATCGTCTT TG ATTCTG A A TACTGCCGCT ACCACTGCCG GATGTATTCT CCAATACCAC 1560 
CATCGCTCCC ACCACCAGCA CCACAACCGC CACTCTATTA TCCAGTAGAT GG ATACAGAG 1 620 
TTTACCAACC AGTG AGGTAT TACTATGTCC AGAATGTGTA CACTCCAGTA GATG AGCACG 1680 
TCTACCCAGA TCACCGCCTG GTTGACCCTC ACATAGAAAT GATACCTGGA GCAC ACAGT A 1740 
TTCCCAGTGG ACATGTGTAT TCACTGTCCG AACCTGAAAT GGCAGCTCTT CGAGATTTTG 1800 
TGGCAAGAAA TGTAAAAGAT GGGCTAATTA CTCCAACGAT TGCACCTAAT GGAGCCCAAG 1860 
TTCTCCAGGT GAAGAGGGGG TGGAA ACTGC AAGTTTCTTA TGATTGCCGA GCTCCAAACA 1920 
ATTTTACTAT CCAGAATCAG TATCCTCGCC TATCTATTCC AAATTTAGAA GACCAAGCAC 1980 
ACCTGGCAAC GTACACTG AA TTCGTACCTC AAATACCTGG ATACCAAACA TACCCCACAT 2040 
ATGCCGCGTA CCCG ACCTAC CCAGTAGG AT TCGCCTGGTA CCCAGTGGGA CGAGACGGAC 2100 
AAGGAAG ATC ACTATATGTA CCTGTGATGA TCACTTGGAA TCCACACTGG TACCGCCAGC 2160 
CTCCGGTACC ACAGTACCCG CCGCCACAGC CGCCGCCTCC ACCACCACCA CCGCCGCCGC 2220 
CTCCATCTTA CAGTACCCTG TAAATACCTG TCATGTCCTT CAGGATCTCT GCCCTCAAAA 2280 
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TTTATTCCTG TTCAGCTTCT CAATCAGTGA CTGTGTGCTA AATTTTAGGC TACTGTATCT 2340 
TCAGGCCACC TGAGGCACAT CCTCTCTGAA ACGGCTATGG AAGGTTAGGG CCACTCTGGA 2400 
CTGGCACACA TCCTAAAGCA CCAAAAGACC TTCAACATTT TCTGAGAGCA ACAG AGTATT 2460 
TGCCAATAAA TGATCTCTCA TTTTTCCACC TTGACTGCCA ATCTAACTAA AATAATTAAT 2520 
5 AAGTTTACTT TCCAGCCAGT CCTGGAAGTC TGGGTTTTAC CTGCCAAAAC CTCCATCACC 2580 
ATCTAAATTA TAGGCTGCCA A ATTTGCTGT TTAACATTT A CAGAGAAGCT GATACAAACG 2640 
CAGGAAATGC TGATTTCTTT ATGGAGGGGG AGACGAGGAG GAGGAGGACA TGACTTTTCT 2700 
TGCGGTTTCG GTACCCTCTT TTTAAATCAC TGGAGGACTG AGGCCTTATT AAGGAAGCCA 2760 
AAATTATCGG TGCAGTGTGG AAAGGCTTCC GTGATCCTCT CGCTGCACCC TTAGAAACTT 2820 

1 0 CACCGTCTTC AAACTCCATT TCCATGGTTC TGTTAATTCT CAAGGAGCAG CAACTCGACT 2880 

GGTTCTCCCA GGAGCAGGAA AAACCCTTGT GACATG AAAC ATCTCAGGCC TGAAAAGAAA 2940 
GTGCTCTCTC AGATGGACTC TTGCATGTTA AGACTATGTC TTCACATCAT GGTGCAAATC 3000 
ACATGTACCC AATGACTCCG GCTTTGACAC AACACCTTAC CATCATCATG CCATGATGGC 3060 
TTCCACAAAG CATTAAACCT GGTAACCAGA GATTACTGGT GGCTCCAGCG TTGTTAGATG 3 120 

1 5 TTCATGAAAT GTGACCACCT CTCAATCACC TTTG AGGGCT AAAG AGTAGC ACATCAAAAG 3 1 80 
GACTCCAAAA TCCCATACCC AACTCTTAAG AGATTTGTCC TGGTACTTCA GAAAGAATTT 3240 
TCATGAGTGT TCTTAATTGG CTGGAAAAGC ACCAGCTGAC GTTTTGGAAG AATCTATCCA 3300 
TGTGTCTGCC TCCATATGCA TCTGGGCATT TCATCTTCAG TCCCCTCATT AGACTGTAGC 3360 
ATTAGGATGT GTGG AGAG AG G AGAAATGAT TTAGCACCCA GATTCACACT CCTATGCCTG 3420 

20 GAAGGGGGAC ATCTTTG AAG AAGAGGAATT AGGGCTGTGG ACACTGTCTT GAGGATGTGG 3480 
ACTTCCTTAG TGAGCTCCAC ATTACTTGAT GGTAACCACT TCAAAAGGAT CAGAATCCAC 3540 
GTAATG AAAA AGGTCCCTCT AG AGG ATGGA GCTG ATGTGA AGCTGCCAAT GGATGAAAAG 3600 
. CCTCAGAAAG CAACTCAAAG GACTCAAAGC AACGGACAAC ACAAGAGTTG TCTTCAGCCC 3660 

_ AGTGACACCT CTGATGTCCC CTGGAAGCTT TGTGCTAACC TGGGACTGCC TGACTTCCTT 3720 

25 TAGCCTGGTC CCTTGCTACT ACCTTGAACT GTTTTATCTA ACCTCTCTTT TTCTGTTTAA 3780 
TTCTTTGCTA CTGCCATTGA CCCTGCTGCA GG ATTTGTGT CATTTTCCTG CCTGGTTGCT 3840 
GAGACTCCAT TTTGCTGCCA CACACAGAGA TGTAAGAGGC AGGCTTTAAT TGCCAAAGCA 3900 
CAGTTTGAGC AGTAGAAAAC AACATGGTGT ATATCTCAAA TTGCCTGACA TGAAGAGGAG 3960 
TCTAACGGTG AAGTTTCACT TTTCATCAGC ATCATCTTTC ACATGTTCAT TATCATCCGC 4020 

3 0 TCTTATTCTT GCATGTTTAA AC ACTTAAAA TTTTTAGTAT AATTTTTAGT GTGTTTTGAA 4080 

GTGGTGACTA GGCTTTCAAA AACTTCCATT GAATTACAAA GCACTATCCA GTTCTTATTG 4140 
TTAAACTAAG TAAAAATGAT AAGTAACATA GTGTAAAATA TTCCTTTACT GTOAACTTCT 4200 
TACAATGCTG TGAATGAG AG GCTCCTCAGA ACTGGAGCAT TTGTATAATA ATTCATCCTG 4260 
TTCATCTTCA ATTTTAACAT CATATATAAT TTCAATTCTA TCAATTGGGC CTTTAAAAAT 4320 

35 CATATAAAAG GATATAAAAT TTGAAAAGAG AAACCTAATT GGCTATTTAA TCCAAAACAA 4380 
CI IU UU1 TCCTTCAATG GAATCAGAAA GCTTGTCAAT CACTCATGTG TTTTAGAGTA 4440 
ATTACTTTTA AAATGGTGCA TTTGTGCTTC TGAACTATTT TGAAGAGTCA CTTCTGTTTA 4500 
CCTCAAGTAT CAATTCATCC TC CAT AC ATT TGAATTCAAG TTGTTTTTTG TCAAATTTAC 4560 
AGTTGTCAAT TGATCTTCAA GCTGCAGGGT GCCTAGAAAT GGGCCGTTGT CTGTAGCCCT 4620 

40 GGCATGTGCA CACGGACATT TGCCACCACT GCAAGCAAAA GTCTGGAGAA GTTCACCAAC 4680 
GACAAGAACG ATTAGGGAAA ATATGCTGCT GTGGGTTAAC AACTCAGAAA GTCCCTGATC 4740 
CACATTTGGC TGTTTACTAA AGCTTGTG AT TAACTTTTTG GCAGTGTGTA CTATGCTCTA 4800 
TTGCTATATA TGCTATCTAT AAATGTAGAT GTTAAGGATA AGTAATTCTA AATTTATTAT 4860 
TCTATAGTTT TG AAG TTTG G TTA AGTTTCC TTTCACTCAA TTGATTTATT TTGTTGTTAA 4920 

4 5 TCAAATTTAT GTTAATTGG A TCCTTTAAAT IT 1 1 1TIXSGC ATTTTCCAAC AAAAATGGCT 4980 

TTATTCATAA GAAAGGAAAA AAATCAATGG AATTTGATAT CTAAAGAAGT TAGAAAGGGA 5040 
GCAAAATAAA AAACATAAAG GAGATAGATG AATTAGTAAG CAAATCAGTA GTCGAGTTTT 5100 
TCAAACTGGC AAAATTAATT AATTGACTTT TAGCCCAAAT TTACATTGTT AATTAAATCA 5160 
AGAAGGAAGA AGATCTAAGA GCTCCCATTG ATAGGCAAGC CTAGAGAGAA CTAGCTAAAT 5220 
50 TTATCATGCT AGGATATTGA AACACAGAAA GTTTACATAC ATTTATGAAG GGTCAATTTA 5280 
GTTTGGACAG TGAGGTATTT GTCTTAGTGG AAAAAAGGAG AATTAGTCTG ATCAAATCGT 5340 
GAAGTAATAC AGTGAACTTG CAGGTGCACA AAATAAGAGG GCCACATCTA TATGGTGCAG 5400 
TCTGGAATTC TGTTTAAGTT TGTAGGTACC TCTTGGACTT CTGAATTGAT CCAGTTGTCA 5460 
TCCACCACAG ACATCTCACA TCAGATACAG ACAGTTCCAA GATTGACAAC AGAGAACAAC 5520 

5 5 CTGCTGG AAA GACCTGGGCA G AA ATGG AGA GCCCTGCGGG AACCATGCTA CATTTTCATC 5580 

TAAAGAGAGA ATGCACATCT GATGAG ACTG AAAGTTCTTT GTTGTTTTAG ATTGTAGAAT 5640 
GGTATTG AAT TGGTCTGTGG AAAATTGCAT TGCTTTTATT TCTTTGTGTA ATCAAGTTTA 5700 
AGTAATAGGG GATATATAAT CATAAGCATT TTAGGGTGGG AGGGACTATT AAGTAATTTT 5760 
AAGTGGGTGG GGTTATTTAG AATGTTAGAA TAATATTATG TATTAGATAT CGCTATAAGT 5820 

60 GGACATGCGT ACTTACTTGT AACCCTTTAC CCTATAATTG CTATCCTTAA AGATTTCAAA 5880 

TAAACTCGGA GGGAACTGCA GGGAGACCAA CTTATTTAGA GCGAATTGGA CATGGATAAA 5940 
AACCCCAGTG GGAGAAAGTT CAAAGGTGAT TAGATTAATA ATTTAATAGA GGATGAGTGA 6000 
CCTCTGATAA ATTACTGCTA GAATGAACTT GTCAATGATG GATGGTAAAT TTTCATGGAA 6060 
GTTATAA AAG TGATAAATAA AAACCCTTGC TTTTACCCCT GTCAGTAGCC CTCCTCCTAC 6120 

6 5 CACTGAACCC CATTGCCCCT ACCCCTCCTT CTAACTTTAT TGCTGTATTC TCTTCACTCT 6180 
ATATTTCTCT CTATTTGCTA ATATTGCATT GCTGTTACAA TAAAAATTCA ATAAAGATTT 6240 
AGTGGTTAAG TGC 

Scq ID NO: 106 Protein sequence: 
70 Protein AccessioD ft NP 055883.1 

I 11 21 31 41 51 

I I I ! I I 

MTERRRDELS EEINNLREKV MKQSEENNNL QSQVQKLTEE NTTLREQVEP TPEDEDDDIE 60 
7 5 LRGAAAAAAP PPPIEEECPE DLPEKFDGNP DMLAPFM AQC QIFMEKSTRD FSVDRVRVCF 120 

VTSMMTGRAA RWASAKLERS HYLMHNYPAF MMEMKHVFED PQRREVAKRK IRRLRQGMGS 1 80 
VIDYSNAFQM IAQDLDWNEP AL1DQYHEGL SDHIQEELSH LEVAKSLSAL 1GQCIH1ERR 240 
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LARAAAARKP RSPPRALVLP HIASHHQVDP TEPVGGARMR LTQEEKERRR KLNLCLYCGT 300 
GGHYADNCPA KASKSSPAGN SPAPL 

Seq ID NO: 107 DNA sequence 

Nucleic Acid Accession U: NM_003679.l 

Coding sequence: 47- 1 5 07( underlined sequences con-cspond to start and stop codons) 

1 II 21 31 41 51 
I I 1 ] I I 

GGCACGAGCA GAAGCAACAA TAATTGTGAA AAATACTTCA GCAGTTATGO ACTCATCTGT 60 
CATTCAAAGG AAAAAAGTAG CTGTCATTGG TGGTGGCTTG GTTGGCTCAT TACAAGCATG 120 
CTTTCTTGCA AAG AGGAATT TCCAGATTGA TGTATATGA A GCTAGGGAAG ATACTCGAGT 1 80 
GGCTACCTTC ACACGTGGAA G AAGCATTAA CTTAGCCCTT TCTCATAGAG G ACGACAAGC 240 
CTTGAAAGCT GTTGGCCTGG A AG ATCAGAT TGTATCCCAA GGTATTCCCA TGAGAGCAAG 300 
AATGATCCAC TCTCTTTCAG GAAAAAAGTC TGCAATTCCC TATGGGACAA AGTCTCAGTA 360 
TATTCTTTCT GTAAGCAGAG AAAATCTAAA CAAGG ATCTA TTGACTGCTG CTGAGAAATA 420 
CCCCAATGTG AAAATGCACT TTAACCACAG GCTGTTGAAA TGTAATCCAG AGGAAGGAAT 480 
OATCACAGTG CTTGGATCTG ACAAAGTTCC CAAAGATGTC ACTTGTGACC TCATTGTAGG 540 
ATGTGATGGA GCCTATTCAA CTGTCAGATC TCACCTGATG AAGAAACCTC GCTTTGATTA 600 
CAGTCAGCAG TACATTCCTC ATGGGTACAT GGAGTTGACT ATTCCACCTA AG AACGGAGA 660 
TTATGCCATG GAACCTA ATT ATCTGCATAT TTGGCCTAG A AATACCTTTA TGATGATTGC 720 
ACTTCCTAAC ATGAACAAAT CATTCACATG TACTTTGTTC ATGCCCTTTG AAGAGTTTGA 780 " 
AAAACTTCTA ACCAGTAATG ATGTGGTAGA TTTCTTCCAG AAATACTTTC CGGATGCCAT 840 
CCCTCTAATT GGAG AGAAAC TCCTAGTGCA AGATTTCTTC CTGTTGCCTG CCCAGCCCAT 900 
GATATCTGTA AAGTGCTCTT CATTTCACTT TAAATCTCAC TGTGTACTGC TGGGAGATGC 960 
AGCTCATGCT ATAGTGCCGT TTTTTGGGCA AGGAATGAAT GCGGGCTTTG AAGACTGCTT 1020 
GGTATTTGAT GAGTTAATGG ATAAATTCAG TAACGACCTT AGTTTGTGTC TTCCTGTGTT 1080 
CTCAAGATTG AGAATCCCAG ATGATCACGC GATTTCAGAC CTATCCATGT ACAATTACAT 1 140 
AGAGATGCGA GCACATGTCA ACTCAAGCTG GTTCATTTTT CAGAAGAACA TGGAGAGATT 1200 
TCTTCATGCG ATTATGCCAT CG ACCTTTAT CCCTCTCTAT ACAATGGTCA CTTTTTCCAG 1 260 
AAT AACAT AC CATGAGGCTG TGCAGCGTTG GCATTGGCAA AAAAAGGTGA TAAACAAAGG 1320 
ACTCTTTTTC TTGGGATCAC TGATAGCCAT CAGCAGTACC TACCTACTTA TACACTACAT 1380 
GTCACCACGA TCTTTCCTCT GCTTGAGAAG ACCATGGAAC TGGATAGCTC ACTTCCGGAA 1440 
TACAACATGT TTCCCCGCAA AGGCCGTGGA CTCCCTAGAA CAAATTTCCA ATCTCATTAG 1500 
CAGGTGATAG AAAGGTTTTG TGGTAGCAAA TGCATGATTT CTCTGTGACC AAAATTAAGC 1560 
ATGAAAAAAA TGTTTCCATT GCCATATTTG ATTCACTAGT GGAAG ATAGT GTTCTGCTTA 1 620 
TAATTAAACT GAATGTAGAG TATCTCTGTA TGTTAATTGC AATTACTGGT TGGGGGGTGC 1680 
ATTTTAAAAG ATGAAACATG CAGCTTCCCT ACATTACACA CACTCAGGTT GAGTCATTCT 1740 
AACTATAAAA GTGCAATGAC TAAGATCCTT CACTTCTCTG AAAGTAAGGC CCTAGATGCC 1800 
TCAGGGAAG A CAGTAATCAT GCCTTTTCTT TAAAAGACAC AATAGGACTC GCAACAGCAT 1 860 
TGACTCAACA CCTAGGACTA AAAATCACAA CTTAACTAGC ATGTTAACTG CACTTTTCAT 1920 
TACGTGAATG GAACTTACCT AACCACAGGG CTCAGACTTA CTAGATAAAA CCAGAAATGG 1980 
AAATAAGGAA TTCAGGGGAG TTCCAGAGAC TTACAAAATG AACTCATTTT ATTTTCCCAC 2040 
CTTCAAATAT AAGTATTATC ATCTATCTGT TTATCGTCTA TCTATCTATC ATCTATCTAT 2100 
CTATCTATCA TCTATCTATC TATCTATCTA TCTATCTATC TATCTATCTA TCTATCTATC 2160 
TCTATTTATT TATGTATTTA GAGATCAGGT CTCACTCTGT TG ACCAGGCT GGAGTGCAGT 2220 
GGTGAGATCT GGGTTCACTG CAACCTCTGC CTCCTGGGCT CAAGCAATCC TCCCACTTCA 2280 
GCCTCCCAAA TAGCTGGGGC TACCATGGTA TTTTTCAGTA GAGACCGGGT CTTGCCATGC 2340 
TGCCCAGGCC AGTCTCAAAC TCCTGGCCTC ATGTGATCTG CCCACCTCAG CCTCCCAAAG 2400 
TACAGGGATT AGAGTTGTGA GCCACCGCTG CCAGCCCAGA GTTACCCTCT AAAGATAAGA 2460 
AAAAGGCTAT TAATATCATA CTAAGTGAAG GACAGGAAAG GGTTTTATTC ATAAATTAAA 2520 
TGTCTACATG TGCCAGAATG GAAAGGAAAC AAGGGGAGAC AACTTTTATA GAAATACAAA 2580 
GCCATTACTT TATTCAATTT CAGACCCTCA GAAGCAATTT ACTAATTTAT TCTTCGACTA 2640 
CATACTGCAG CAGAACCAGC AATACACTTG ATTTTTAAAA GCACATTTAG TGAAATGTTT 2700 
TCTTTGGTTC ATCCTTCTTT AACAGGCTGC TGAGTCACTC AGAAATCCTT CAAACATGAT 2760 
TAATTATGAA GATG AAACAC TAGAGTCATA TAAGAAATAA AAATTGGGCA ATAAAATAAA 2820 
ATGATTCAGT GTTTCTTTTC TATATTGTCA ATGAAAACCT TGAGTTCTAA TAATCCATGT 2880 
TCAGTTTGTA GGGAAAGAAA A AATAATTTT TCCTTCTACC CACTTTAGGT TCCTTGGCTG 2940 
GGGCCCCTAT A AC A A AAG AC AGATTGACAA GAGAAAAACA AACATAAATT TATTAGCGGG 3000 
TATATGTAAT ATATATGTGG G AAATACAGG GGAATGAGCA AATCTCAAAG AGCTGGCGTC 3060 
TTAGAACTCC CTGGCTTATA TAGCATCGAC AAAGAACAGT AAATTTTTAG AGAAACAACA 3120 
AAACAAAGAA AAAGAGCTTT GAGTCTGTAG GGGCAGCAAT TTGGGGGAAG CAAATATATG 3180 
GGAGTTTGCC TTGTAGATTC CTCTGGTGGT GGTCTCCAGG CTGACAAGGA TTCAAAGTTG 3240 
TCTCTG AAAC TCCTCTTTGT CATACTGCAC ATATAAAACG TCTTTTGTTT CCAACAAGAG 3300 
GATTTCTTTT TCATTCTAG A ATTATCTCCT TGATAACTTG ATCAGATATA GGACATG ACA 3360 
CTGAATAG AG TCCAACAGTA CAAAAAAAAT TCAGTATGTT CTAGCTACTT CACACATGTG 3420 
TACGCG ACAG TTATTTTTAC AGTAAGGTAT TTTCGAGAAA AATGCATTAC GTGTTTTGGA 3480 
AAATAGAGTA ATTTAAAA AA TATATTTGAA ATG AAAATCT CCAACACATT AGAAGATG AT 3540 
GATGTTAGAT GCCCATCGTG TGCCACAAGT GGTTTTTTCA TTATGTAAAG CACCCGTTGA 3600 
ATT AAA AG AA TTTGTTTTTG TTCAACCTCT TCCTGAGGCC CAAG AGCATA TGGGCAATTC 3660 
GGATTTCCTG CTGGACCACA AGGTTCTGTT GATATTACAT AGAAACGGGT ATTCCAGACA 3720 
CTTCTTATGA TGAAAGTCCA A A AGTGGCAT CCAATTTAAG GCCCCATCTT TCGTTGCCAT 3780 
TCTTCATTCC TACAA AGGAC GAACTTGG AT TACATCAACT TTGGACCCAT TGGTTTTGTC 3840 
GCTGTCGTCA ACTGACAGTG ATTCACCACT GGTGATGATA AAAATGATGG AAGAAGAGTT 3900 
GAAAGTCACT TTTTTCTTTG GCCTGTCCCC ATCTTTCTGT GACATCACAA TGGGTCTGAT 3960 
CTGCATTTCA CTTCCAGCTG CTGGTAGGTC TTTAGCAGGC CTCTGGCACC TCAGCAGTCG 4020 
GAGGCACAGA AGCTGCAAAA GGGATCTTCG AAACTGGGCA GAGAAAAAAT AAAGTGGAAT 4080 
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ATTAAGTAAA AGTTGGGCAC TAATCTGGAT TAACATTCGA GGAAATCAGT TGAGCTGATT 4140 
TAAGTTGTTT TTTGTTTGTT AGCAGGTGTG GATGTGGGGT TATGTGGTCA TGCTCAGATC 4200 
TACCTAA ATC ACCCCAG AGC TTTATGTCTT TTATTCATTC TAAATCTTAT TAACCGG AAT 4260 
ATGTAGGACC ATTTCAATAC CTTGTAATCC TCCAAGCTTC AATCTGCACA CACTTTCTAT 4320 
5 GAGGGCAGGT ACAACTATTA AG AGATTTTG AACATTAAGT TAGTCCACAA ATATTCAGTG 4380 
GGCATCTACT AGGTGACAGC CACTGTGCTA TAATTAGAGA Ci 1 1 IT ACT A TAAGCATCAA 4440 
AAACAGATAA GGCTCTTCCT GGCAGAGTTT ACAGCCTGGT GTACTTGCTA ATGTCTCTTT 4500 
AATTAGGTGA AG AATTTTTT TTTTCTATCG AAATTACTAA TCAGTTGGGG AAAAAAATAC 4560 
TATAGCAGAC AGCACTAATG TCATCAACAA ACATTGTTCT TCTCCGTGTC CTGGGTACAA 4620 

1 0 CATCGAATAA TATTTCTTGG CCTCCTTTCC GCTTCTCCTC TCTGCTGTTC CTCTCTACAA 4680 

GAACCTGGGA GGCCAACGCC TAAAGATCAT AATATCACAA TGGAAGGAAC CTAGATTCCT 4740 
AAATGACTGC ATAGGACAG A TCCCATCTCC TCCACCCAAT ACATTATTAG ACTGAACTGT 4800 
GACCTG AAAT GAGCAATAAA CTCTGTATTA ATTCACTGAA ATGTTGGGGT TGCTTGTTAT 4860 
AGTAGTCGGT CCATCATG AC CAGTAA AACA TAAATCAAAA GTTAATGTAA TTGTTATCCC 4920 

1 5 ATTATTTAGA GCG AAAT AAA TGTTGAATAT ATGGACTTTC TCAGATTAGG AAATACCAAT 4980 
TAAAAATATA ATAAATAGCT 



20 



75 



Seq ID NO: 108'Protcin sequence: 
Protein Accession U: . NP_003670.1 



1 11 21 31 41 51 

MDSSV1QRKK VAVIGGG LVG SLQACFLAKR NFQIDVYEAR EDTRVATFTR GRSINLALSH 60 
RGRQALKAVG LEDQTVSQGI PMRARMIHSL SGKKSAIPYG TKSQYILSVS RENLNKDLLT 120 

25 AAEKYPNVKM HFNHRLLKCN PEEGMITVLG SDKVPKDVTC DUVGCDGAY STVRSHLMKK 180 
PRFDYSQQYI PHGYMELTIP PKNGDYAMEP NYLHIWPRNT FMMIALPNMN KSFTCTLFMP 240 
FEEFEKLLTS NDVVDFFQKY FPDAIPLIGE KLLVQDFFLL PAQPMISVKC SSFHFKSHCV 300 
LLGDAAHAIV PFFGQGMNAG FEDCLVFDEL MDKFSNDL5L CLPVFSRUU PDDHA1SDLS 360 
MYNYIEMRAH VNSSWFIFQK NMERFLHAIM PSTFIPLYTM VTFSRIRYHE AVQRWHWQKK 420 

30 VINKGLFFLG SLIAISSTYL LIHYMSPRSF LCLRRPWNWI AHFRNTTCFP AKAVDSLEQI 480 
SNLISR 

Seq ID NO: 109 DNA sequence 
Nucleic Acid Accession #: NM_0061 1 5. 1 

3 5 Coding sequence: 236- 1 765 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

GCTTCAGGGT ACAGCTCCCC CGCAGCCAGA AGCCGGGCCT GCAGCCCCTC AGCACCGCTC 60 
40 CGGG ACACCC CACCCGCTTC CCAGGCGTG A CCTGTCAACA GCAACTTCGC GGTGTGGTG A 1 20 
ACTCTCTGAG GAAAAACCAT TTTGATTATT ACTCTCAGAC GTGCGTGGCA ACAAGTGACT 180 
GAGACCTAG A A ATCCAAGCG TTGGAGGTCC TGAGGCCAGC CTAAGTCGCT TCAAAATGGA 240 
ACGAAGGCGT TTGTGGGGTT CCATTCAGAG CCGATACATC AGCATGAGTG TGTGGACAAG 300 
CCCACGGAGA CTTGTGGAG C TGGCAGGGCA GAGCCTGCTG AAGGATGAGG CC CTGGC CAT 360 
45 TGCCGCCCTG GAGTTGCTGC CCAGGGAGCT CTTCCCGCCA CTCTTCATGG CAGCCTTTGA 420 
CGGGAGACAC AGCCAGACCC TGAAGGCAAT GGTGCAGGCC TGGCCCTTCA CCTGCCTCCC 480 
TCTGGGAGTG CTGATGAAGG GACA ACATCT TCACCTGGAG ACCTTCAAAG CTGTGCTTGA 540 
TGGACTTGAT GTGCTCCTTG CCCAGGAGGT TCGCCCCAGG AGGTGGAAAC TTCAAGTGCT 600 
GGATTTACGG AAGAACTCTC ATCAGGACTT CTGGACTGTA TGGTCTGGAA ACAGGGCCAG 660 
5 0 TCTGTACTCA TTTCCAG AGC CAG AAGCAGC TCAGCCCATG ACAAAG AAGC GAAAAGTAG A 720 
TGGTTTGAGC ACAGAGGCAG AGCAGCCCTT CATTCCAGTA GAGGTGCTCG TAGACCTGTT 780 
CCTCAAGGAA GGTGCCTGTG ATG AATTGTT CTCCTACCTC ATTGAGAAAG TGAAGCGAAA 840 
GAAAAATGTA CTACGCCTGT GCTGTAAGAA GCTGAAGATT TTTGCAATGC CCATGCAGGA 900 
TATCAAGATG ATCCTG AAAA TGGTGCAGCT GGACTCTATT G AAGATTTGG AAGTG ACTTG 960 
5 5 TACCTGG AAG CTACCCACCT TGGCG AAATT TTCTCCTTAC CTGGGCCAGA TGATTAATCT 1020 
GCGTAGACTC CTCCTCTCCC ACATCCATGC ATCTTCCTAC ATTTCCCCGG AGAAGGAAGA 1080 
GCAGTATATC GCCCAGTTCA CCfCTCAGTT CCTCAGTCTG CAGTGCCTGC AGGCTCTCTA 1 140 
TGTGGACTCT TTATTTTTCC TTAGAGGCCG CCTGGATCAG TTGCTCAGGC ACGTGATGAA 1200 
CCCCTTGGAA ACCCTCTCAA TAACTA ACTG CCGGCTTTCG GAAGGGGATG TGATGCATCT 1260 
60 GTCCCAG AGT CCCAG CGTCA GTCAGCTAAG TGTCCTGAGT CTAAGTGGGG TCATGCTGAC 1320 
CGATGTAAGT CCCGAG CCCC TCCAAGCTCT GCTGGAG AGA GCCTCTGCCA CCCTCCAGGA 1380 
CCTGGTCTTT G ATG AGTGTG GG ATCACGGA TG ATCAGCTC CTTGCCCTCC TGCCTTCCCT 1440 
GAGCCACTGC TCCCAGCTTA CAACCTTAAG CTTCTACGGG AATTCCATCT CCATATCTGC 1500 
CTTGCAGAGT CTCCTGCAGC ACCTCATCGG GCTGAGCAAT CTGACCCACG TGCTGTATCC 1560 
65 TGTCCCCCTG G AG AGTTATG AGG ACATCCA TGGTACCCTC CACCTGGAGA GGCTTGCCTA 1 620 
TCTGCATGCC AGGCTCAGGG AGTTGCTGTG TGAGTTGGGG CGGCCCAGCA TGGTCTGGCT 1680 
TAGTGCCAAC CCCTGTCCTC ACTGTGGGGA CAGAACCTTC TATGACCCGG AGCCCATCCT 1740 
GTGCCCC 1GT TTCATGCCTA ACTAGCTGGG TGCACATATC AAATGCTTCA TTCTGCATAC 1800 
TTGGACACT A A AGCCAGG AT G TGCATGCAT CTTGAAGCAA CAAAGCAGCC ACAGTTTCAG 1 860 
70 ACAAATGTTC AGTGTG AGTG AGG AA A ACAT GTTCAGTGAG GAAAAAACAT TCAGACAAAT 1920 
GTTCAGTGAG GAAAAAAAGG GGAAGTTGGG GATAGGCAGA TGTTG ACTTG AGGAGTTAAT 1980 
GTGATCTTTG GGGAGATACA TCTTATAGAG TTAGAAATAG AATCTGAATT TCTAAAGGGA 2040 
GATTCTGGCT TGGG AAGTAC ATGT AG G AGT TAATCCCTGT GTAGACTGTT GTAAAGAAAC 2100 
TGTTG A A AAT AAAGAGAAGC A ATG TG A AGC AAAAAAAAAA AAAAAAAA 



Seq ID NO: 1 10 Protein sequence: 
Protein Accession 6 : NP_006 1 06. 1 
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1 11 21 31 41 51 
I I I I I I 

MERRRLWGSI QSRYISMSVW TSPRRLVELA GQSLLKDEAL A1AALELLPR ELFPPLFMAA 60 
FDGRHSQTLK AMVQAWPFTC LPLGVLMKGQ HLHLETFKA V LDGLDVLLAQ EVRPRRWKLQ 120 
VLDLRKNSHQ DFWTVWSGNR ASLYSFPEPE AAQPMTKKRK VDGLSTEAEQ PFTPVEVLVD 180 
LFLKEGACDE LFSYLIEKVK RKKNVLRLCC KKLKIFAMPM QDIKM1LKMV QLDSIEDLEV 240 
TCTWKLPTLA KFSPYLGQM1 NLRRLLLSHI HASSYISPEK EEQYIAQFTS QFLSLQCLQA 300 
LYVDSLFFLR GRLDQLLRHV MNPLETLSIT NCRLSEGDVM HLSQSPSVSQ LSVLSLSGVM 360 
LTDVSPEPLQ ALLERASATL QDLVFDECGI TDDQLLALLP SLSHCSQLTT LSFYGNSISI 420 
SALQSLLQHL IGLSNLTHVL YPVPLESYED IHGTLHLERL AYLHARLREL LCELGRPSMV 480 
WLSANPCPHC GDRTFYDPEP ILCPCFMPN 



Seq ID NO: 1 1 1 DNA sequence 

Nucleic Acid Accession #\ NM 0038 1 5 

Coding sequence: 8-2452 (underlined sequences correspond to start and stop codons) 

I II 21 31 41 51 
I I I I I 1 

CGCTGCCATG CGGCTGGCGC TGCTCTGGGC CCTGGGGCTC CTGGGCGCGG GCAGCCCTCT 60 
GCCTTCCTGG CCGCTCCCAA ATATAGGTGG CACTGAGGAG CAGCAGGCAG AGTCAGAGAA 120 
GGCCCCGAGG GAGCCCTTGG AGCCCCAGGT CCTTCAGG AC GATCTCCCAA TTAGCCTCAA 1 80 
AAAGGTGCTT CAGACCAGTC TGCCTGAGCC CCTG AGGATC AAGTTGGAGC TGGACGGTGA 240 
CAGTCATATC CTGG AGCTGC TACAGAATAG GGAGTTGGTC CCAGGCCGCC CAACCCTGGT 300 
GTGGTACCAG CCCGATGGCA CTCGGGTGGT CAGTGAGGGA CACACTTTGG AGAACTGCTG 360 
CTACCAGGGA AGAGTGCGGG GATATGCAGG CTCCTGGGTG TCCATCTGCA CCTGCTCTGG 420 
GCTCAGAGGC TTGGTGGTCC TGACCCCAGA GAGAAGCTAT ACCCTGGAGC AGGGGCCTGG 480 
GGACCTTCAG GGTCCTCCCA TTATTTCGCG AATCCAAGAT CTCCACCTGC CAGGCCACAC 540 
CTGTGCCCTG AGCTGGCGGG AATCTGTACA CACTCAGACG CCACCAGAGC ACCCCCTGGG 600 
ACAGCGCCAC ATTCGCCGGA GGCGGGATGT GGTAACAGAG ACCAAGACTG TGGAGTTGGT 660 
GATTGTGGCT GATCACTCGG AGGCCCAGAA ATACCGGG AC TTCCAGCACC TGCTAAACCG 720 
CACACTGGAA GTGGCCCTCT TGCTGGACAC ATTCTTCCGG CCCCTGAATG TACGAGTGGC 780 
ACTAGTGGGC CTGGAGGCCT GG ACCCAGCG TGACCTGGTG GAGATCAGCC CAAACCCAGC 840 
TGTCACCCTC GAAAACTTCC TCCACTGGCG CAGGGCACAT TTGCTGCCTC GATTGCCCCA 900 
TGACAGTGCC CAGCTGGTGA CTGGTACTTC ATTCTCTGGG CCTACGGTGG GCATGGCCAT 960 
TCAGAACTCC ATCTGTTCTC CTGACTTCTC AGGAGGTGTG AACATGGACC ACTCCACCAG 1020 
CATCCTGGGA GTCGCCTCCT CCATAGCCCA TGAGTTGGGC CACAGCCTGG GCCTGGACCA 1080 
TGATTTGCCT GGGAATAGCT GCCCCTGTCC AGGTCCAGCC CCAGCCAAG A CCTGCATCAT 1 140 
GGAGGCCTCC ACAGACTTCC TACCAGGCCT GAACTTCAGC AACTGCAGCC GACGGGCCCT 1200 
GGAGAAAGCC CTCCTGG ATG G AATGGGCAG CTGCCTCTTC GAACGGCTGC CTAGCCTACC 1260 
CCCTATGGCT GCTTTCTGCG GAAATATGTT TGTGGAGCCG GGCGAGCAGT GTGACTGTGG 1 320 
CTTCCTGGAT GACTGCGTCG ATCCCTGCTG TGATTCTTTG ACCTGCCAGC TGAGGCCAGG t380 
TGCACAGTGT GCATCTGACG G ACCCTGTTG TCAAAATTGC CAGCTGCGCC CGTCTGGCTG 1440 
GCAGTGTCGT CCTACCAGAG GGGATTGTGA CTTGCCTGAA TTCTGCCCAG GAGACAGCTC 1500 
CCAGTGTCCC CCTG ATG TCA GCCTAGGGGA TGGCGAGCCC TGCGCTGGCG GGCAAGCTGT 1560 
GTGCATGCAC GGGCGTTGTG CCTCCTATGC CCAGCAGTGC CAGTCACTTT GGGGACCTGG 1620 
AGCCCAGCCC GCTGCGCC AC TTTGCCTCCA GACAGCTAAT ACTCGGGG AA ATGCnTTGG 1 680 
GAGCTGTGGG CGCAACCCCA GTGGCAGTTA TGTGTCCTGC ACCCCTAGAG ATGCCATTTG 1740 
TGGGCAGCTC CAGTGCCAGA CAGGTAGGAC CCAGCCTCTG CTGGGCTCCA TCCGGGATCT 1800 
ACTCTGGGAG ACAATAGATG TGAATGGGAC TGAGCTGAAC TGCAGCTGGG TGCACCTGGA 1860 
CCTGGGCAGT GATGTGGCCC AGCCCCTCCT GACTCTGCCT GGCACAGCCT GTGGCCCTGG 1920 
CCTGGTGTGT ATAGACCATC GATGCCAGCG TGTGGATCTC CTGGGGGCAC AGGAATGTCG 1980 
AAGCAAATGC CATGGACATG GGGTCTGTGA CAGCAACAGG CACTGCTACT GTGAGGAGGG 2040 
CTGGGCACCC CCTGACTGCA CCACTCAGCT CAAAGCAACC AGCTCCCTGA CCACAGGGCT 2100 
GCTCCTCAGC CTCCTGGTCT TATTGGTCCT GGTG ATGCTT GGTGCCGGCT ACTGGTACCG 21 60 
TGCCCGCCTG CACCAGCGAC TCTGCCAGCT CAAGGGACCC ACCTGCCAGT ACAGGGCAGC 2220 
CCAATCTGGT CCCTCTGAAC GGCCAGGACC TCCGCAGAGG GCCCTGCTGG CACGAGGCAC 2280 
TAAGTCTCAG GGGCCAGCCA AGCCCCCACC CCCAAGGAAG CCACTGCCTG CCGACCCCCA 2340 
GGGCCGGTGC CCATCGGGTG ACCTGCCCGG CCCAGGGGCT GGAATCCCGC CCCTAGTGGT 2400 
ACCCTCCAGA CCAGCGCCAC CGCCTCCGAC AGTGTCCTCG CTCTACCT CT GA CCTCTCCG 2460 
GAGGTTCCGC TGCCTCCAAG CCGGACTTAG GGCTTCAAGA GGCGGGCGTG CCCTCTGGAG 2520 
TCCCCTACCA TGACTGAAGG CGCCAGAG AC TGGCGGTGTC TTAAGACTCC GGGCACCGCC 2580 
ACGCGCTGTC AAGCAACACT CTGCGGACCT GCCGGCGTAG TTGCAGCGGG GGCTTGGGGA 2640 
GGGGCTGGGG GTTGGACGGG ATTGAGGAAG GTCCGCACAG CCTGTCTCTG CTCAGTTGCA 2700 
ATAAACGTGA CATCTTGGGA GCGTTAAAAA AAAAAAAAAA 

Seq ID NO: 1 12 Protein sequence: 
Protein Accession #: NP_O03 806.2 

( II 21 31 41 51 
I I 1 I I I 

MRLALLWALG LLGAGSPLPS WPLPNIGGTE EQQAESEKAP REPLEPQVLQ DDLPISLKKV 60 
LQTSLPEPLR IKLELDGDSH ILELLQNREL VPGRPTLVWY QPDGTRWSE GHTLENCCYQ 120 
GRVRGYAGSW VSICTCSGLR GLVVLTPERS YTLEQGPGDL QGPPIISRIQ DLHLPGHTCA 1 80 
LSWRESVHTQ TPPEHPLGQR HIRRRRDVVT ETKTVELVIV ADHSEAQKYR DFQHLLNRTL 240 
EVALLLDTFF RPLNVRVALV GLEAWTQRDL VEISPNPAVT LENFLHWRRA HLLPRLPHDS 300 
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10 



60 



AQLVTGTSFS GPTVGMAIQN SICSPDFSGG VNMDHSTSIL GVASSIAHEL GHSLGLDHDL 360 
PGNSCPCPGP APAKTCIMEA STDFLPGLNF SNCSRRALEK ALLDGMGSCL FERLPSLPPM 420 
AAFCGNMFVE PGEQCDCGFL DDCVDPCCDS LTCQLRPGAQ CASDGPCCQN CQLRPSGWQC 480 
RPTRGDCDLP EFCPGDSSQC PPDVSLGDGE PCAGGQAVCM HGRCASYAQQ CQSLWGPGAQ 540 
PAAPLCLQTA NTRGNAFGSC GRNPSGSYVS CTPRDAICGQ LQCQTGRTQP LLGS1RDLLW 600 
ETIDVNGTEL NCSWVHLDLG SDVAQPLLTL PGTACGPGLV CIDHRCQRVD LLGAQECRSK. 660 
CHGHGVCDSN RHCYCEEGWA PPDCTTQLKA TSSLTTGLLL SLLVLLVLVM LGAGYWYRAR 720 
LHQRLCQLKG PTCQYRAAQS GPSERPGPPQ RALLARGTKS QGPAKPPPPR KPLPADPQGR 780 
CPSGDLPGPG AGIPPLVVPS RPAPPPPTVS SLYL 

Seq ID NO: 113 DNA sequence 

Nucleic Acid Accession 0: NMJ>024 1 6 

Coding sequence: 40-417 (underlined sequences correspond to start and stop codons) 



15 1 II 21 31 41 51 

ATCCAATACA GGAGTGACTT GGAACTCCAT TCTATCAC TA TG AAGAAAAG TGGTGTTCTT 60 
TTCCTCTTGG GCATCATCTT GCTGGTTCTG ATTGGAGTGC AAGGAACCCC AGTAGTG AGA 1 20 
AAGGGTCGCT GTTCCTGCAT CAGCACC A AC C A AGGGACTA TCCACCTACA ATCCTTG AAA 1 80 
20 GACCTTAAAC AATTTGCCCC AAGCCCTTCC TGCGAGAAAA TTGAAATCAT TGCTACACTG 240 
AAGAATGGAG TTCAAACATG TCTAAACCCA GATTCAGCAG ATGTGAAGGA ACTGATTAAA 300 
AAGTGGG AG A AACAGGTCAG CCAAAAGAAA AAGCAAAAGA ATGGGAAAAA ACATCAAAAA 360 
AAG AAAGTTC TGAAAGTTCG AAAATCTCAA CGTTCTCGTC AAAAG AAGAC TACATA^GAG 420 
ACCACTTCAC CAATAAGTAT TCTGTGTTAA AAATGTTCTA TTTTAATTAT ACCGCTATCA 480 
25 TTCCAAAGGA GGATGGCATA TAATACAAAG GCTTATTAAT TTGACTAGAA AATTTAAAAC 540 
ATTACTCTGA AATTGTAACT AAAGTTAGAA AGTTG ATTTT AAGAATCCAA ACGTTAAGAA 600 
TTGTTAAAGG CTATGATTGT CTTTGTTCTT CTACCACCCA CCAGTTGAAT TTCATCATGC 660 
TTAAGGCCAT GATTTTAGCA ATACCCATGT CTACACAGAT GTTCACCCAA CCACATCCCA 720 
CTCACAACAG CTGCCTGGAA GAGCAGCCCT AGGCTTCCAC GTACTGCAGC CTCCAGAGAG 780 
30 TATCTGAGGC ACATGTCAGC AAGTCCTAAG CCTGTTAGCA TGCTGGTGAG CCAAGCAGTT 840 
TGAAATTGAG CTGGACCTCA CCAAGCTGCT GTGGCCATCA ACCTCTGTAT TTGAATCAGC 900 
CTACAGGCCT CACACACAAT GTGTCTGAGA GATTCATGCT GA TTGTT ATT GGGTATCACC 960 
ACTGGAGATC ACCAGTGTGT GGCTTTCAGA GCCTCCTTTC TGGCTTTGGA AGCCATGTGA 1020 
TTCCATCTTG CCCGCTCAGG CTGACCACTT TATTTCTTTT TGTTCCCCTT TGCTTCATTC 1080 
3 5 AAGTCAGCTC TTCTCCATCC TACCACAATG CAGTGCCTTT CTTCTCTCCA GTGCACCTGT 1 140 
CATATGCTCT GATTTATCTG AGTCAACTCC TTTCTCATCT TGTCCCCAAC ACCCCACAGA 1200 
AGTGCTTTCT TCTCCCAATT CATCCTCACT CAGTCCAGCT TAGTTCAAGT CCTGCCTCTT 1260 
AAATAAACCT TTTTGGACAC ACAAATTATC TTAAAACTCC TGTTTCACTT GGTTCAGTAC 1320 
CACATGGGTG AACACTCAAT GGTTAACTAA TTCTTGGGTG TTTATCCTAT CTCTCCAACC 1380 
40 AGATTGTCAG CTCCTTGAGG GCAAGAGCCA CAGTATATTT CCCTGTTTCT TCCACAGTGC 1440 
CTAATAATAC TGTGGAACTA GGTTTTAATA ATTTTTTAAT TGATGTTGTT ATGGGCAGGA 1500 
TGGCAACCAG ACCATTGTCT CAGAGCAGGT GCTGGCTCTT TCCTGGCTAC TCCATGTTGG 1560 
CTAGCCTCTG GTAACCTCTT ACTTATTATC TrCAGGACAC TCACTACAGG GACCAGGGAT 1620 
G ATGCAACAT CCTTGTCTTT TTATGACAGG ATGTTTGCTC AGCTTCTCCA ACAATAAGAA 1680 
45 GCACGTGGTA AAACACTTGC GGATATTCTG GACTGTTTTT AAAAAATATA CAGT TTACCG 1740 
AAAATCATAT AATCTTACAA TGAAAAGGAC TTTATAGATC AGCCAGTGAC CAACCTTTTC 1800 
CCAACCATAC AAAAATTCCT TTTCCCGAAG G AAAAGGGCT TTCTCAATAA GCCTCAGCTT 1860 
TCTAAGATCT AACAAGATAG CCACCGAGAT CCTTATCGAA ACTCATTTTA GGCAAATATG 1920 
AGTTTTATTG TCCGTTTACT TGTTTCAGAG TTTGTATTGT G ATTATCAAT TACCACACCA 1980 
5 0 TCTCCCATG A AG AAAGGG AA CGGTGAAGTA CTAAGCGCTA GAGGAAGCAG CCAAGTCGGT 2040 
TAGTGGAAGC ATGATTGGTG CCCAGTTAGC CTCTGCAGGA TGTGGAAACC TCCTTCCAGG 2100 
GGAGGTTCAG TGAATTGTGT AGGAGAGGTT GTCTGTGGCC AGAATTTAAA CCTATACTCA 2160 
CTTTCCCAAA TTGAATCACT GCTCACACTG CTGATG ATTT AGAGTGCTGT CCGGTGGAGA 2220 
TCCCACCCGA ACGTCTTATC TAATCATG AA ACTCCCTAGT TCCTTCATGT AACTTCCCTG 2280 
5 5 AAAA ATCTAA GTGTTTCATA AATTTGAG AG TCTGTG ACCC ACTTACCTTG CATCTCACAG 2340 
GTAG ACAGTA TATAACTAAC AACCAAAGAC TACATATTGT CACTGACACA CACGTTATAA 2400 
TCATTTATCA TATATATACA TACATGCATA CACTCTCAAA GCAAATAATT TTTCACTTCA 2460 
AAACAGTATT GACTTGTATA CCTTGTAATT TGAAATATTT TCTTTGTTAA AATAGAATGG 2520 
TATCAATAAA TAGACCATTA ATCAG 



Seq ID NO: 1 14 Protein sequence: 
Protein Accession U: NP_002407 



1 11 21 31 41 51 
65 | | { ) | | 

MKKSGVLFLL GIILLVLIGV QGTPVVRKGR CSCISTNQGT IHLQSLKDUC QFAPSPSCEK 60 
IEIIATLKNG VQTCLNPDSA DVKELIKKWE KQVSQKKXQK. NGKKHQKKKV LKVRKSQRSR 120 . 
QKKTT 

70 Seq ID NO: 115 DNA sequence 

Nucleic Acid Accession U: NMJJ0323 8. 1 

Coding sequence: 182-1426 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

CAAGCAGGAT ACGTTTTTCT GTTGGGCATT GACTAGATTG TTTGC AAAAG TTTCGCATCA 60 
AAAACAAACA ACAACAACAA AAAACCAAAC AACTCTCCTT GATCTATACT TTGAGAATTG 120 
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TTG ATTTCTT TTTTTTTATT CTGACTTTTA AAAACAACTT TTTTTTCCAC TTTTTTA A AA 1 80 
AATGCACTAC TQTGTGCTGA GCGCTTTTCT GATCCTGCAT CTGGTCACGG TCGCGCTCAG 240 
CCTGTCTACC TGCAGCACAC TCGATATGGA CCAGTTCATG CGCAAGAGGA TCGAGGCGAT 300 
CCGCGGGCAG ATCCTGAGCA AGCTGAAGCT CACCAGTCCC CCAGAAGACT ATCCTGAGCC 360 
CGAGGAAGTC CCCCCGGAGG TGATTTCCAT CTACAACAGC ACCAGGGACT TGCTCCAGGA 420 
GAAGGCGAGC CGGAGGGCGG CCGCCTGCGA GCGCGAGAGG AGCGACGAAG AGTACTACGC 480 
CAAGG AGGTT TACA AAATAG ACATGCCGCC CTTCTTCCCC TCCGAAAATG CCATCCCGCC 540 
CACTTTCTAC AG ACCCTACT TCAGAATTGT TCG ATTTG AC GTCTCAGCAA TGG AG AAGAA 600 
TGCTTCCAAT TTGGTGAAAG CAGAGTTCAG AGTCTTTCGT TTGCAGAACC CAAAAGCCAG 660 
AGTGCCTGAA CAACGGATTG AGCTATATCA G ATTCTCAAG TCCAAAGATT TAACATCTCC 720 
AACCCAGCGC TACATCGACA GCAAAGTTGT GAAAACAAGA GCAGAAGGCG AATGGCTCTC 780 
CTTCGATGTA ACTGATGCTG TTCATGAATG GCTTCACCAT AAAGACAGGA ACCTGGGATT 840 
TAAAATAAGC TTACACTGTC CCTGCTGCAC TTTTGTACCA TCTAATAATT ACATCATCCC 900 
AAATAAAAGT GAAGAACTAG AAGCAAGATT TGCAGGTATT GATGGCACCT CCACATATAC 960 
CAGTGGTGAT CAGAAAACTA TAAAGTCCAC TAGGAAAAAA AACAGTGGGA AGACCCCACA 1020 
TCTCCTGCTA ATGTTATTGC CCTCCTACAG ACTTGAGTCA CAACAGACCA ACCGGCGGAA 1080 
GA AGCGTGCT TTGGATGCGG CCTATTGCTT TAG AAATGTG CAGG ATAATT GCTGCCTACG 1 140 
TCCACTTTAC ATTG ATTTCA AGAGGGATCT AGGGTGGAAA TGGATACACG AACCCAAAGG 1200 
GTACAATGCC AACTTCTGTG CTGGAGCATG CCCGTATTTA TGGAGTTCAG ACACTCAGCA 1260 
CAGCAGGGTC CTGAGCTTAT ATAATACCAT AAATCCAGAA GCATCTGCTT CTCCTTGCTG 1320 
GGTGTCCCAA GATTTAGAAC CTCTAACCAT TCTCTACTAC ATTGGCAAAA CACCCAAGAT 1380 
TG AACAGCTT TCTAATATG A TTGTAAAGTC TTGCAAATGC AGCTAAAATT CTTGG AAAAG 1 440 
TGGCAAGACC AAAATGACAA TGATGATGAT AATGATGATG ACGACGACAA CGATGATGCT 1500 
TGTAACAAGA AAACATAAGA GAGCCTTGGT TCATCAGTGT TAAAAAATTT TTGAAAAGGC 1560 
GGTACTAGTT CAGACACTTT GGAAGTTTGT GTTCTGTTTG TTAAAACTGG CATCTGACAC 1620 
AAAAAAAGTT GAAGGCCTTA TTCTACATTT CACCTACTTT GTAAGTGAGA GAGACAAGAA 1680 
GCAAATTTTT TTAAA 

Seq ID NO: 116 Protein sequence: 
Protein Accession //: NPJJ03229. 1 

1 11 21 31 41 51 

MHYCVLSAFL ILHLVTVALS LSTCSTLDMD QFMRKRIEAI RGQILSKLKL TSPPEDYPEP 60 
EEVPPEVISI YNSTRDLLQE KASRRAA ACE RERSDEEYYA KEVYKIDMPP FFPSENAIPP 120 
TFYRPYFRJV RFDVSAMEKN ASNLVKAEFR VFRLQNPKAR VPEQRXELYQ ILKSKDLTSP 1 80 
TQRYIDSKW KTRAEGEWLS FDVTDAVHEW LHHKDRNLGF KISLHCPCCT FVPSNNYIIP 240 
NKSEELEARF AGIDGTSTYT SGDQKTIKST RKKNSGKTPH LLLMLLPSYR LESQQTNRRK 300 
KRALDAAYCF RNVQDNCCLR PLY1DFKRDL GWKWIHEPKG YNANFCAGAC PYLWSSDTQH 360 
SRVLSLYNTI NPEASASPCC VSQDLEPLTI LYYIGKTPKI EQLSNMIVKS CKCS 

Scq ID NO: 1 17 DNA sequence 

Nucleic Acid Accession #: NM_O00O95.1 

Coding sequence: 26-2299 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I 1 I I 

CAGCACCCAG CTCCCCGCCA CCGCCATGGT CCCCGACACC GCCTGCGTTC TTCTGCTCAC 60 
CCTGGCTGCC CTCGGCGCGT CCGGACAGGG CCAGAGCCCG TTGGGCTCAG ACCTGGGCCC 120 
GCAGATGCTT CGGGAACTGC AGGAAACCAA CGCGGCGCTG CAGGACGTGC GGGACTGGCT 180 
GOGGCAGCAG GTCAGGG AGA TCACGTTCCT GAAAA ACACG GTGATGGAGT GTGACGCGTG 240 
CGGGATGCAG CAGTCAGTAC GCACCGGCCT ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 300 
GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGCG GCCGCTGCGG 360 
CCCCTGCCCC GCGGGCTTCA CGGGCAACGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 
CGCCCACCCC TGCTTCCCCC G AGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCG A 480 
GGCTTGCCCG CCGGGGTACA GCGGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA 540 
GGCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 600 
CCCCAACTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 660 
CTTCGTGGGC GACCAGGCGT CCGGCTGCCA GCGCGGCGCA CAGCGCTTCT GCCCCGACGG 720 
CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA G AGCGCGATG GCTCGCGGTC 780 
GTGCGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTCGCG ACACTGACCT 840 
AGACGGCTTC CCGGACGAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 
CGTG ACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 
CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC CGCTGGTGCG 1020 
GAACCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCO ACAACTGCCG 1080 
GTCCCAGAAG AACGACGACC AAAAG GACAC AGACCAGGAC GGCCGGGGCG ATGCGTGCGA 1140 
CGACGACATC GACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCCCTA GGGTACCCAA 1200 
CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGG GGAT GCCTGTGACA ACTGTCCCCA 1260 
GAAGAGCAAC CCGGATCAGG CGGATGTGGA CCACGACTTT GTGGGAGATG CTTGTGACAG 1320 
CGATCAAGAC CAGG ATGGAG ACGGACATCA GGACTCTCGG GACAACTGTC CCACGGTGCC 1380 
TAACAGTGCC CAGGAGGACT CAGACCACGA TGGCCAGGGT GATGCCTGCG ACGACGACGA 1440 
CGACAATGAC GG AGTCCCTG ACAGTCGGGA CAACTGCCGC CTGGTG CCTA ACCCCGGCCA 1500 
GGAGGACGCG GACAGGGACG GCGTGGGCGA CGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 
GGTGGTAGAC AAGATCGACG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 1620 
GGCCTTCCAG ACAGTCGTGC TGGACCCGGA GGGTGACGCG CAGATTGACC CCAACTGGGT 1680 
GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 
GGGTTACACT GCCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 
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GGATGACGAC TATGCGGGCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT 1 860 
CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCGAAC CCCTTCCGTG CTGTGGCCGA 1920 
GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 
CGCTCTGTGG CATACAGGAG ACACAGAGTC CCAGGTGCGG CTGCTGTGGA AGGACCCGCG 2040 
5 AAACGTGGGT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 
GGGCTACATC AGGGTGCG AT TCTATG AGGG CCCTGAGCTG GTGGCCGACA GCAACGTGGT 2160 
CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 
CATCTGGGCC AACCTGCGTT ACCGCTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 2280 
TCAGCTGCGG CAAGCCTAGG GACCAGGGTG AGGACCCGCC GGATGACAGC CACCCTCACC 2340 
1 0 GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 2400 
AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 
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ScqIDNO: 118 Protein sequence: 
Protein Accession #: NP_0000S6. 1 



1 11 21 31 41 51 

MVPDTACVLL LTLAALG ASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQQVREIT 60 
FLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGFCFPGV AC1QTESGGR CGPCPAGFTG 120 

20 NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG PTHQG VGLAF AKANKQVCTD 180 
INECETGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 240 
ADCVLERDGS RSCVCRVGWA GMGILCGRDT DLDGFPDEKL RCPEPQCRKD NCVTVPNSGQ 300 
EDVDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRNTD EDKWGDACDN CRSQKNDDQIC 360 
DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQKSNPDQAD 420 

25 VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVPDS 480 
RDNCRLVPNP GQEDADRDG V GDVCQDDFDA DKVVDKIDVC PENAEVTLTD FRAFQTVVLD 540 
PEGDAQIDPN WVVLNQGREI VQTMNSDPGL AVGYTAFNGV DFEGTFHVNT VTDDDYAGFI 600 
FGYQDSSSFY VVMWKQMEQT YWQANPFRAV AEPGIQLKAV K.SSTGPGEQL RNALWHTGDT 660 
ESQVRLLWKD PRNVGWKDKK SYRWFLQHRP QVGYIRVRFY EGPELVADSN VVLDTTMRGG 720 

3 0 RLGVFCFSQE NIIWANLRYR CNDTIPEDYE THQLRQA 



Seq ID NO: 1 19 DNA sequence 

Nucleic Acid Accession NM_0I421 1 

Coding sequence: 1 57-1479 (underlined sequences correspond to start and slop codons) 



I II 21 31 41 51 

GGGACAGGGC TGAGGATGAG GAGAACCCTG GGG ACCCAGA AGACCGTGCC TTGCCCGGAA 60 
GTCCTGCCTG TAGGCCTGAA GGACTTGCCC TAACAGAGCC TCAACAACTA CCTGGTGATT 120 

40 CCTACTTCAG CCCCTTGGTG TGAGCAGCTT CTCAACATGA ACTACAGCCT CCACTTGGCC 1 80 
TTCGTGTGTC TGAGTCTCTT CACTGAGAGG ATGTGCATCC AGGGGAGTCA GTTCAACGTC 240 
GAGGTCGGCA GAAGTGACAA GCTTTCCCTG CCTGGCTTTG AGAACCTCAC AGCAGGATAT 300 
AACAAATTTC TCAGGCCCAA TTTTGGTGGA GAACCCGTAC AGATAGCGCT GACTCTGGAC 360 
ATTGCAAGTA TCTCTAGCAT TTCAGAGAGT AACATGG ACT ACACAGCCAC CATATACCTC 420 

45 CGACAGCGCT GGATGGACCA GCGGCTGGTG TTTGAAGGCA ACAAGAGCTT CACTCTGGAT 480 
GCCCGCCTCG TGGAGTTCCT CTGGGTGCCA GATACTTACA TTGTGGAGTC CAAGAAGTCC 540 
TTCCTCCATG AAGTCACTGT GGGAAACAGG CTCATCCGCC TCTTCTCCAA TGGCACGGTC 600 
CTGTATGCCC TCAGAATCAC GACAACTGTT GCATGTAACA TGGATCTGTC TAAATACCCC 660 
ATGGACACAC AGACATGCAA GTTGCAGCTG GAAAGCTGGG GCTATGATGG AAATGATGTG 720 

5 0 G AGTTCACCT GGCTG AG AGG GAACG ACTCT GTGCGTGGAC TGGAACACCT GCGGCTTGCT 780 
CAGTACACCA TAG AGCGGT A TTTCACCTTA GTCACCAG AT CGCAGCAGGA GACAGG AAAT 840 
TACACTAGAT TGGTCTTACA GTTTG AGCTT CGGAGG AATG TTCTGTATTT CATTTTGGAA 900 
ACCTACGTTC CTTCCACTTT CCTGGTGGTG TTGTCCTGGG TTTCATTTTG GATCTCTCTC 960 
GATTCAGTCC CTGCAAGAAC CTGCATTGGA GTGACGACCG TGTTATCAAT GACCACACTG 1020 

55 ATG ATCGGGT CCCGCACTTC TCTTCCCAAC ACCAACTGCT TCATCAAGGC CATCGATGTG 1080 
TACCTGGGGA TCTGCTTTAG CTTTGTGTTT GGGGCCTTGC TAGAATATGC AGTTGCTCAC 1 140 
TACAGTTCCT TACAGCAGAT GGCAGCCAAA G ATAGGGGGA CAACAAAGGA AGTAGAAGAA 1200 
GTCAGTATTA CTAATATCAT CAACAGCTCC ATCTCCAGCT TTAAACGGAA GATCAGCTTT 1260 
GCCAGCATTG AAATTTCCAG CGACAACGTT GACTACAGTG ACTTGACAAT GAAAACCAGC 1320 

60 GACAAGTTCA AGTTTGTCTT CCGAG AAAAG ATGGGCAGGA TT GTTG ATT A TTT CACAATT 1 380 
CAAAACCCCA GTAATGTTG A TCACTATTCC AAACTACTGT TTCCTTTGAT TTTTATGCTA 1440 
GCCAATGTAT TTTACTGGGC ATACTACATG TATTTTTGAG TCAATGTTAA ATTTCTTGCA 1500 
TGCCATAGGT CTTCAACAGG ACAAGATAAT GATGTAAATG GTATTTTAGG CCAAGTGTGC 1560 
ACCCACATCC AATGGTGCTA CAAGTGACTG AAATAATATT TGAGTCTTTC TGCTCAAAGA 1620 

65 ATGAAGCTCC AACCATTGTT CTAAGCTGTG TAGAAGTCCT AGCATTATAG GATCTTGTAA 1680 
TAGAAACATC AGTCCATTCC TCTTTCATCT TAATCAAGGA CATTCCCATG GAGCCCAAGA 1740 
TTACAAATGT ACTCAGGGCT GTTTATTCGG TGGCTCCCTG GTTTGCATTT ACCTCATATA 1 800 
AAGAATGGGA AGG AGACCAT TGGGTAACCC TCAAGTGTCA GAAGTTGTTT CTAAAGTAAC 1860 
TATACATGTT TTTTACTAAA TCTCTGCAGT GCTTATAAAA TACATTGTTG CCTATTTAGG 1920 

70 GAGTAACATT TTCTAGTTTT TGTTTCTGGT TAA AATG AAA TATGGGCTTA TGTCAATTCA 1980 
TTGGAAGTCA ATGCACTAAC TCAATACCAA G ATGAGTTTT TAAATAATGA ATATTATTTA 2040 
ATACCACAAC AGAATTATCC CCAATTTCCA ATAAGTCCTA TCATTG AAAA TTCAAATATA 2100 
AGTGAAGAAA AAATTAGTAG ATCAACAATC TAAACAAATC CCTCGGTTCT AAGATACAAT 2160 
GGATTCCCCA TACTGGAAGG ACTCTGAGGC TTTATTCCCC C ACTATG CAT ATCTTATCAT 2220 

75 TTTATTATTA TACACACATC CATCCTAAAC TATACTAAAG CCCTTTTCCC ATGCATGGAT 2280 
GGAAATGG AA GA M 1 11 1 f O TAACTTGTTC TAGAAGTCTT AATATGGGCT GTTGCCATGA 2340 
AGGCTTGCAG AATTG AGTCC ATTTTCTAGC TGCCTTTATT CACATAGTGA TGGGGTACTA 2400 
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AAAGTACTGG GTTGACTCAG AGAGTCGCTG TCATTCTGTC ATTGCTGCTA CTCTAACACT 2460 
GAGCAACACT CTCCCAGTGG CAGATCCCCT GTATCATTCC AAGAGGAGCA TTCATCCCTT 2520 
TGCTCTAATG ATCAGGAATG ATGCTTATTA GAAAACAAAC TGCTTGACCC AGGAACAAGT 2580 
GGCTTAGCTT AAGTAAACTT GGCTTTGCTC AGATCCCTG A TCCTTCCAGC T GGTC TGCTC 2640 
5 TGAGTGGCTT ATCCCGCATG AGCAGGAGCG TGCTGGCCCT GAGTACTGAA CTTTCTGAGT 2700 
AACAATGAGA CACGTTACAG AACCTATGTT CAGGTTGCGG GTGAGCTGCC CTCTCCAAAT 2760 
CCAGCCAGAG ATGCACATTC CTCGGCCAGT CTCAGCCAAC AGTACCAAAA GTGATTTTTG 2820 
AGTGTGCCAG GGTAAAGGCT TCCAGTTCAG CCTCAGTTAT TTTAGACAAT CTCGCCATCT 2880 
TTAATTTCTT AGCTTCCTGT TCTAATAAAT GCACGGCTTT ACCTTTCCTG TCAG AAATAA 2940 

1 0 ACCAAGGCTC TAAAAG ATGA TTTCCCTTCT GTAACTCCCT AGAGCCACAG GTTCTCATTC 3000 
CTTTTCCCAT TATACTTCTC ACAATTCAGT TTCTATGAGT TTGATCACCT GATTTTTTTA 3060 
ACAAAATATT TCTAACGGGA ATGGGTGGGA GTGCTGGTGA AAAGAGATGA AATGTGGTTG 3120 • 
TATGAGCCAA TCATATTTGT GATTTTTTAA AAAAAGTTTA AAAGGAAATA TCTGTTCTGA 3 180 
AACCCCACTT AAGCATTGTT TTTATATAAA AACAATGATA AAGATGTGAA CTGTGAAATA 3240 

1 5 AATATACCAT ATTAGCTACC CACC 

Seq ID NO: 120 Protein sequence: 
Protein Accession #: NP 055026. 1 

20 I II 21 31 41 51 

MNYSLHLAPV CLSLFTERMC IQGSQFNVEV GRSDKLSLPG FENLTAGYNK FLRPNFGGEP 60 
VQIALTLDIA SISSISESNM DYTATIYLRQ RWMDQRLVFE GNKSFTLDAR LVEFLWVPDT 120 
YlVESKKSFL HEVTVGNRLI RLFSNGTVLY ALR1TTTVAC NMDLSKYPMD TQTCKLQLES 1 80 
25 WGYDGNDVEF TWLRGNDSVR GLEHLRLAQY TIERYFTLVT RSQQETGNYT RLVLQFELRR 240 
NVLYFILETY VPSTFLVVLS WVSFWISLDS VPARTCIGVT TVLSMTTLMI GSRTSLPNTN 300 
CFDCAIDVYL GICFSFVFGA LLEYAVAHYS SLQQMAAKDR GTTKEVEEVS ITNIINSSIS 360 
SFKRKISFAS IEISSDNVDY SDLTMKTSDK FKFVFREKMG RIVDYFTIQN PSNVDHYSK.L 420 
LFPLIFMLAN VFYWAYYMYF 



Seq ID NO: 121 DNA sequence 

Nucleic Acid Accession #: NM_001 854 

Coding sequence: 163-5582 (underlined sequences correspond to start and stop codons) 



35 1 11 21 31 41 51 

AACCATCAAA TTTAGAAGAA AAAGCCCTTT GACTTTTTCC CCCTCTCCCT CCCCAATGGC 60 
TGTGTAGCAA ACATCCCTGG CGATACCTTG GAAAGGACGA AGTTGGTCTG CAGTCGCAAT 120 
TTCGTGGGTT GAGTTCACAG TTGTGAGTGC GGGGCTCGGA GATGGAGCCG TGGTCCTCTA 180 

40 GGTGGAAAAC G AAACGGTGG CTCTGGG ATT TCACCGTAAC AACCCTCGCA TTGACCTTCC 240 
TCTTCCAAGC TAGAGAGGTC AGAGGAGCTG CTCCAGTTGA TGTACTAAAA GCACTAGATT 300 
TTCACAATTC TCCAGAGGGA ATATCAAAAA CAACGGGATT TTGCACAAAC AGAAAG AATT 360 
CTAAAGGCTC AG ATACTGCT TACAGAGTTT CAAAG CAAG C ACAACTCAGT GCCCCAACAA 420 
AACAGTTATT TCCAGGTGGA ACTTTCCCAG AAGACTTTTC AATACTATTT ACAGTAAAAC 480 

45 CAAAAAAAGG AATTCAGTCT TTCCTTTTAT CTATATATAA TGAGCATGGT ATTCAGCAAA 540 
TTGGTGTTGA GGTTGGGAG A TCACCTGTTT TTCTGTTTGA AGACCACACT GGAAAACCTG 600 
CCCCAGAAGA CTATCCCCTC TTCAGAACTG TTAACATCGC TGACGGGAAG TGGCATCGGG 660 
TAGCAATCAG CGTGGAGAAG AAAACTGTGA CAATGATTGT TGATTGTAAG AAGAAAACCA 720 
CG AAACCACT TG ATAGAAGT GAG AG AGCA A TTGTTG ATAC CAATGG AATC ACGGTTTTTG 780 

5 0 G AACAAGGAT TTTGGATGAA GAAGTTTTTG AGGGGG ACAT TCAGCAGTTT TTGATCACAG 840 
GTGATCCCAA GGCAGCATAT GACTACTGTG AGCATTATAG TCCAGACTGT GACTCTTCAG 900 
CACCCAAGGC TGCTCAAGCT CAGGAACCTC AGATAGATGA GTATGCACCA GAGGATATAA 960 
TCGAATATGA CTATGAGTAT GGGGAAGCAG AGTATAAAGA GGCTG AAAGT GTAAC AGAG G 1020 
GACCCACTGT AACTG AGGAG ACAATAGCAC AGACGGAGGC AAACATCGTT GATGATTTTC 1080 

55 AAGAATACAA CTATGGAACA ATGGAAAGTT ACCAGACAGA AGCTCCTAGG CATGTTTCTG 1 140 
GGACAAATG A GCCAAATCCA GTTGAAGAAA TATTTACTGA AGAATATCTA ACGGGAGAGG 1200 
ATTATGATTC CCAG AGGAAA AATTCTGAGG ATACACTATA TGAAAACAAA GAAATAGACG 1260 
GCAGGGATTC TGATCTTCTG GTAGATGG AG ATTTAGGCGA ATATGATTTT TATGAATATA 1320 
AAGAATATGA AGATAAACCA ACAAGCCCCC CTAATGAAGA ATTTGGTCCA GGTGTACCAG 1380 

60 CAGAAACf G A TATTACAGAA ACAAGCATAA ATGGCCATGG TGCATATGGA GAGAAAGGAC 1440 
AGAAAGGAGA ACCAGCAGTG GTTGAGCCTG GTATGCTTGT CGAAGGACCA CCAGGACCAG 1500 
CAGGACCTGC AGGTATTATG GGTCCTCCAG GTCTACAAGG CCCCACTGGA CCCCCTGGTG 1560 
ACCCTGGCGA TAGGGGCCCC CCAGGACGTC CTGGCTTACC AGGGGCTGAT GGTCTACCTG 1620 
GTCCTCCTGG TACTATGTTG ATGTTACCGT TCCGTTATGG TGGTGATGGT TCCAAAGGAC 1680 

65 CAACCATCTC TGCTCAGG AA GCTCAGG CTC AAGCTATTCT TCAGCAGGCT CGGATTGCTC 1740 
TGAGAGGCCC ACCTGGCCCA ATGGGTCTAA CTGGAAGACC AGGTCCTGTG GGGGGGCCTG 1800 
GTTCATCTGG GGCCAAAGGT GAGAGTGGTG ATCCAGGTCC TCAGGGCCCT CGAGGCGTCC 1860 
AGGGTCCCCC TGGTCCAACG GGAAAACCTG GAAAAAGGGG TCGTCCAGGT GCAGATGGAG 1920 
GAAGAGGAAT GCCAGGAGAA CCTGGGGCAA AGGGAGATCG AGGGTTTGAT GGACTTCCGG 1980 

70 GTCTGCCAGG TG ACAAAGGT CACAGGGGTG AACGAGGTCC TCAAGGTCCT CCAGGTCCTC 2040 
CTGGTGATGA TGGAATGAGG GGAGAAGATG GAGAAATTGG ACCAAGAGGT CTTCCAGGTG 2100 
AAGCTGGCCC ACGAGGTTTG CTGGGTCCAA GGGGAACTCC AGGAGCTCCA GGGCAGCCTG 2160 
GTATGGCAGG TGTAGATGGC CCCCCAGGAC CAAAAGGGAA CATGGGTCCC CAAGGGGAGC 2220 
CTGGGCCTCC AGGTCAACAA GGGAATCCAG GACCTCAGGG TCTTCCTGGT CCACAAGGTC 2280 

75 CAATTGGTCC TCCTGGTG AA AAAGGACCAC AAGGAAAACC AGGACTTGCT GGACTTCCTG 2340 
GTGCTGATGG GCCTCCTGGT CATCCTGGGA AAGAAGGCCA GTCTGGAGAA AAGGGGGCTC 2400 
TGGGTCCCCC TGGTCCACAA GGTCCTATTG GATNNCCGGG CCCCCGGGGA GTAAAGGGAO 2460 
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CAGATGGTGT CAGAGGTCTC AAGGGATCTA AAGGTGAAAA GGGTGAAGAT GGTTTTCCAG 2520 
GATTCAAAGG TGACATGGGT CTAAAAGGTG ACAGAGGAGA AGTTGGTCAA ATTGGCCCAA 2580 
GAGGGNAAGA TGGCCCTGAA GGACCCAAAG GTCGAGCAGG CCCAACTGGA GACCCAGGTC 2640 
CTTCAGGTCA AGCAGGAGAA AAGGGAAAAC TTGGAGTTCC AGGATTACCA GGATATCCAG 2700 
5 GAAGACAAGG TCCAAAGGGT TCCACTGGAT TCCCTGGGTT TCCAGGTGCC AATGGAGAGA 2760 
AAGGTGCACG GGGAGTAGCT GGCAAACCAG GCCCTCGGGG TCAGCGTGGT CCAACGGGTC 2820 
CTCGAGGTTC AAGAGGTGCA AGAGGTCCCA CTGGGAAACC TGGGCCAAAG GGCACTTCAG 2880 
GTGGCGATGG CCCTCCTGGC CCTCCAGGTG AAAGAGGTCC TCAAGGACCT CAGGGTCCAG 2940 
TTGGATTCCC TGGACCAAAA GGCCCTCCTG GACCACCAGG AAGGATGGGC TGCCCAGGAC 3000 

1 0 ACCCTGGGCA ACGTGGGG AG ACTGGATTTC AAGGCAAGAC CGGCCCTCCT GGGCCAGGGG 3060 
GAGTGGTTGG ACCACAGGGA CCAACCGGTG AGACTGGTCC AATAGGGGAA CGTGGGTATC 3120 
CTGGTCCTCC TGGCCCTCCT GGTG AGCAAG GTCTTCCTGG TGCTGCAGGA AAAG AAGGTG 3 1 80 
CAAAGGGTGA TCCAGGTCCT CAAGGTATCT CAGGGAAAGA TGGACCAGCA GGATTACGTG 3240 
GTTTCCCAGG GGAAAGAGGT CTTCCTGGAG CTCAGGGTGC ACCTGGACTG AAAGGAGGGG 3300 

1 5 AAGGTCCCCA GGGCCCACCA GGTCCAGTTG GCTCACCAGG AGAACGTGGG TCAGCAGGTA 3360 
CAGCTGGCCC AATTGGTTTA CGAGGGCGCC CGGGACCTCA GGGTCCTCCT GGTCCAGCTG 3420 
GAGAGAAAGG TGCTCCTGGA GAAAAAGGTC CCCAAGGGCC TGCAGGGAGA GATGGAGTTC 3480 
AAGGTCCTGT TGGTCTCCCA GGGCCAGCTG GTCCTGCCGG CTCCCCTGGG GAAGACGGAG 3540 
ACAAGGGTGA AATTGGTGAG CCGGGACAAA AAGGCAGCAA GGGTGGCAAG GGAGAAAATG 3600 

20 GCCCTCCCGG TCCCCCAGGT CTTCAAGG AC CAGTTGGTGC CCCTGGAATT GCTGGAGGTG 3660 
ATGGTGAACC AGGTCCTAGA GGACAGCAGG GGATGTTTGG GCAAAAAGGT GATGAGGGTG 3720 
CCAGAGGCTT CCCTGGACCT CCTGGTCCAA TAGGTCTTCA GGGTCTGCCA GGCCCACCTG 3780 
GTGAAAAAGG TGAAAATGGG GATGTTGGTC CATGGGGGCC ACCTGGTCCT CCAGGCCCAA 3840 
GAGGCCCTCA AGGTCCCAAT GGAGCTGATG GACCACAAGG ACCCCCAGGT TCTGTTGGTT 3900 

25 CAGTTGGTGG TGTTGGAGAA AAGGGTGAAC CTGGAGAAGC AGGAAACCCA GGGCCTCCTG 3960 
GGGAAGCAGG TGTAGGCGGT CCC AAAG GAG AAAGAGGAGA GAAAGGGG AA GCTGGTCCAC 4020 
CTGGAGCTGC TGGACCTCCA GGTGCCAAGG GGCCGCCAGG TGATGATGGC CCTAAGGGTA 4080 
ACCCGGGTCC TGTTGGTTTT CCTGGAGATC CTGGTCCTCC TGGGGAACTT GGCCCTGCAG 4140 
GTCAAGATGG TGTTGGTGGT GACAAGGGTG AAGATGG AGA TCCTGGTCAA CCGGGTCCTC 4200 

3 0 CTGGCCCATC TGGTG AGGCT GGCCCACCAG GTCCTCCTGG AAAACG AGGT CCTCCTGG AG 4260 

CTGCAGGTGC AGAGGGAAGA C AAGGTG AAA AAGGTGCTAA GGGGGAAGCA GGTGCAGAAG 4320 
GTCCTCCTGG AAAAACCGGC CCAGTCGGTC CTCAGGGACC TGCAGGAAAG CCTGGTCCAG 4380 
AAGGTCTTCG GGGCATCCCT GGTCCTGTGG GAGAACAAGG TCTCCCTGGA GCTGCAGGCC 4440 
AAGATGGACC ACCTGGTCCT ATGGGACCTC CTGGCTTACC TGGTCTCAAA GGTGACCCTG 4500 

3 5 GCTCCAAGGG TGA AAAGGG A CATCCTGGTT TAATTGGCCT GATTGGTCCT CCAGGAG AAC 4560 
AAGGGGAAAA AGGTGACCGA GGGCTCCCTG GAACTCAAGG ATCTCCAGGA GCAAAAGGGG 4620 
ATGGGGGAAT TCCTGGTCCT GCTGGTCCCT TAGGTCCACC TGGTCCTCCA GGCTTACCAG 4680 
GTCCTCAAGG CCCAAAGGGT AACAAAGGCT CTACTGG ACC CGCTGGCCAG A AAGGTG AC A 4740 
GTGGTCTTCC AGGGCCTCCT GGGCCTCCAG GTCCACCTGG TGAAGTCATT CAGCCTTTAC 4800 

40 CAATCTTGTC CTCCAAAAAA ACGAGAAGAC ATACTG AAGG CATGCAAGCA GATGCAGATG 4860 
ATAATATTCT TGATTACTCG G ATGGAATGG AAGAAATATT TGGTTCCCTC AATTCCCTGA 4920 
AACAAGACAT CG AGCATATG AAATTTCCAA TGGGTACTCA GACCAATCCA GCCCGAACTT 4980 
GTAAAGACCT GCAACTCAGC CATCCTGACT TCCCAGATGG TGAATATTGG ATTGATCCTA 5040 
ACCAAGGTTG CTCAGGAGAT TCCTTCAAAG TTTACTGTAA TTTCACATCT GGTGGTGAGA 5100 

45 CTTGC ATTTA TCCAG ACA AA AAATCTG AGG GAGTAAG AAT TTCATCATGG CCAAAGG AG A 5 1 60 
AACCAGGAAG TTGGTTTAGT G A ATTTA AG A GGGGAAAACT GCTTTCATAC TTAGATGTTG 5220 
AAGGAAATTC CATCAATATG GTGCAAATGA CATTCCTGAA ACTTCTGACT GCCTCTGCTC 5280 
GGCAAAATTT CACCTACCAC TGTCATCAGT CAGCAGCCTG GTATGATGTG TCATCAGG AA 5340 
GTTATGACAA AGC ACTTCGC TTCCTGGGAT CAAATGATG A GGAGATGTCC TATGACAATA 5400 

50 ATCCTTTTAT CAAAACACTG TATGATGGTT GTACGTCCAG AAAAGGCTAT GAAAAAACTG 5460 
TCATTGAAAT CAATACACCA AAAATTGATC AAGTACCTAT TGTTGATGTC ATGATCAGTG 5520 
ACTTTGGTGA TCAGAATCAG AAGTTCGGAT TTGAAGTTGG TCCTGTTTGT TTTCTTGGCT 5580 
AAG ATTAAG A CAA AG AACAT ATCAAATCAA CAGAAAATGT ACCTTGGTGC CACCAACCCA 5640 
TTTTGTGCCA CATGCAAGTT TTGAATAAGG ATGTATGGAA AACAACGCTG CATATACAGG 5700 

5 5 TACCATTTAG G AAATACCGA TGCCTTTGTG GGGGCAGAAT CACAGACAAA AGCTTTG AAA 5760 
ATCATAAAGA TATAAGTTGG TGTGGCTAAG ATGGAAACAG GGCTGATTCT TGATTCCCAA 5820 
TTCTCAACTC TCCTTTTCCT ATTTGAATTT CTTTGGTGCT GTAGAAAACA AAAAAAGAAA 5880 
AATATATATT CATAAAAAAT ATGGTGCTCA TTCTCATCCA TCCAG GATGT ACTAAAACAG 5940 
TGTGTTTAAT AAATTGTAAT TATTTTGTGT ACAGTTCTAT ACTGTTATCT GTGTCCATTT 6000 

60 CCAAAACTTG CACGTGTCCC TGAATTCCGC TGACTCTAAT TTATGAGGAT GCCGAACTCT 6060 
GATGGCAATA ATATATGTAT TATG AAAATG AAGTTATGAT TTCCGATGAC CCTAAGTCCC 6120 
TTTCTTTGGT TAATG ATGAA ATTCCTTTGT GTGTGTTT 

Seg ID NO: 122 Protein sequence: 
65 Protein Accession #: NPJD0I845 



1 11 21 31 41 51 

70 MEPWSSRWKT KRWLWDFTVT TLALTFLFQA REVRGAAPVD VLKALDFHNS PEGISKTTG F 60 
CTNRKNSKGS DTAYRVSKQA QLSAPTKQLF PGGTFPEDFS ILFTVKPKKG IQSFLLSIYN 120 
EHGIQQIGVE VGRSPVFLFE DHTGKPAPED YPLFRTVNIA DGKWHRVAIS VEKKTVTMTV 180 
DCKKXTTKPL DRSERAIVDT NGITVFGTR1 LDEEVFEGDI QQFLITGDPK AAYDYCEHYS 240 
PDCDSSAPKA AQAQEPQIDE YAPEDIIEYD YEYGEAEYKE AESVTEGPTV TEETIAQTEA 300 

75 NTVDDFQEYN YGTMESYQTE APRHVSGTNE PNPVEEIFTE EYLTGEDYDS QRKNSEDTLY 360 
ENKEIDGRDS DLLVDGDLGE YDFYEYKEYE DKPTSPPNEE FGPGVPAETD ITETSINGHG 420 
AYGEKGQKGE PAVVEPGMLV EGPPGPAGPA GIMGPPGLQG PTGPPGDPGD RGPPGRPGLP 480 
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G ADGLPGPPG TMLMLPFRYG GDGSKGPTIS AQEAQAQA1L QQARJALRGP PGPMGLTGRP 540 
GPVGGPGSSG AKGESGDPGP QGPRGVQGPP GPTGKPGKRG RPGADGGRGM PGEPGAKGDR 600 
GFDGLPGLPG DKGHRGERGP QGPPGPPGDD GMRGEDGEIG PRGLPGEAGP RGLLGPRGTP 660 
GAPGQPGMAG VDGPPGPKGN MGPQGEPGPP GQQGNPGPQG LPGPQGPIGP PGEKGPQGKP 720 
GLAGLPGADG PPGHPGKEGQ SGEK.GALGPP GPQGPIGXPG PRGVKGADGV RGLKGSKGEK 780 
GEDGFPGFKG DMGLKGDRGE VGQIGPRGXD GPEGPKGRAG PTGDPGPSGQ AGEKGKLGVP 840 
GLPGYPGRQG PKGSTGFPGF PGANGEKGAR GVAGKPGPRG QRGPTGPRGS RGARGPTGKP 900 
GPK.GTSGGDG PPGPPGERGP QGPQGPVGFP GPKGPPGPPG RMGCPGHPGQ RGETGFQGKT 960 
GPPGPGGWG PQGPTGETGP IGERGYPGPP GPPGEQGLPG AAGKEGAKGD PGPQGISGKD 1020 
GPAGLRGFPG ERGLPGAQGA PGLKGGEGPQ GPPGPVGSPG ERGSAGTAGP IGLRGRPGPQ 1080 
GPPGPAGEKG APGEKGPQGP AGRDGVQGPV GLPGPAGPAG SPGEDGDK.GE IGEPGQKGSK 1 140 
GGKGENGPPG PPGLQGPVGA PGIAGGDGEP GPRGQQGMFG QKGDEGARGF PGPPGPIGLQ 1200 
GLPGPPGEKG ENGDVGPWGP PGPPGPRGPQ GPNGADGPQG PPGSVGSVGG VGEKGEPGEA 1260 
GNPGPPGEAG VGGPKGERGE KGEAGPPGAA GPPGAKGPPG DDGPKGNPGP VGFPGDPGPP 1320 
GELGPAGQDG VGGDK.GEDGD PGQPGPPGPS GEAGPPGPPG KRGPPGAAGA EGRQGEKGAK 1380 
GEAGAEGPPG KTGPVGPQGP AGKPGPEGLR GIPGPVGEQG LPG AAGQDGP PGPMGPPGLP 1440 
GUCGDPGSKG EKGHPGLIGL IGPPGEQGEK GDRGLPGTQG SPGAKGDGGI PGPAGPLGPP 1500 
GPPGLPGPQG PKGNKGSTGP AGQK.GDSGLP GPPGPPGPPG EVIQPLPILS SKKTRRHTEG 1560 
MQADADDNIL DYSDGMEEIF GSLNSUCQDI EHMKFPMGTQ TNPARTCKDL QLSHPDFPDG 1620 
EYWIDPNQGC SGDSFKVYCN FTSGGETCrY PDKKSEGVRI SSWPKEKPGS WFSEFKRGKL 1680 
LSYLDVEGNS INMVQMTFUC LLTASARQNF TYHCHQSAAW YDVSSGSYDK ALRFLGSNDE 1740 
EMSYDNNPFI KTLYDGCTSR KGYEKTV1EI NTPKIDQVPI VDVMISDFGD QNQKFGFEVG 1800 
PVCFLG 

Seq ID NO: 123 DNA sequence 

Nucleic Acid Accession //: NMJH5886 

Coding sequence: 485-1261 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GAATTCCCCC CCCCCCCCCC TCACTTGGTG TGTCTATATG TCTGGCAG AC ATTATCAGCA 60 
CATTCTCTGT TGTTACCTGT GATTCATTTT TTCTTCACTC TCCAGGTGAA TTTCAATTGC 120 
TGAAAATTTC CCACTGAAAA TATGCAGTAA TATATTTTGT GGTTCAGACA TTTGGGGCAA 180 
ATGGTTCACA TTCATTTTAG GGTTAGTGGT CATGCTGTTT ATTTTTCTCT GCTATACAAA 240 
GTTCCTCTTA GGGGTCTGCC TCATGACACT AAAAAATGAA TAGAGATTCT ACTGTAGGTT 300 
ATCTCCTAGG CTTGAGTTCA ACATTTGTTT GGATTTTTGA AGA AAGTCA A ATCAAGCAAT 360 
GCTCCCAAAT GATGTCTTTG TAAATTCATA CCCTCTGGCC CTATTTTTTT TCATAGACCC 420 
TAACTCTACC TTTCTGCTTT AAAGCAAAGT AAACTCGGTG GCCTCTTCTT CTCCACCCCT 480 
CAAAATGATA GCAATCTCTG CCGTCAGCAG TGCACTCCTG TTCTCCCTTC TCTGTGAAGC 540 
AAGTACCGTC GTCCTACTCA ATTCCACTGA CTCATCCCCG CCAACCAATA ATTTCACTGA 600 
TATTGAAGCA GCTCTG AAAG CACAATTAG A TTCAGCGGAT ATCCCCAAAG CCAGGCGGAA 660 
GCGCTACATT TCGCAG AATG ACATG ATCGC CATTCTTGAT TATCATAATC AAGTTCGGGG 720 
CAAAGTGTTC CCACCGGCAG CAAATATGGA ATATATGGTT TGGG ATGAAA ATCTTGCAAA 780 
ATCGGCAG AG GCTTGGGCGG CTACTTGCAT TTGGG ACCAT GGACCTTCTT ACTTACTGAG 840 
ATTTTTGGGC CAAAATCTAT CTGTACGCAC TGGAAGATAT CGCTCTATTC TCCAGTTGGT 900 
CAAGCCATGG TATGATGAAG TG AAAG ATT A TGCTTTTCCA TATCCCCAGG ATTGCAACCC 960 
CAGATGTCCT ATGAGATGTT TTGGTCCCAT GTGCACACAT TATACGCAGA TGGTTTGGGC 1020 
CACTTCCAAT CGGATAGGAT GCGCAATTCA TGCTTGCCAA AACATGAATG TTTGGGGATC 1080 
TGTGTGGCG A CGTGCAGTTT ACTTGGTATG CAACTATGCC CCAAAGGGCA ATTGGATTGG 1 140 
AGAAGCACCA TATAAAGTAG GGGTACCATG TTCATCTTGT CCTCCAAGTT ATGGGGGATC 1200 
TTGTACTGAC AATCTGTGTT TTCCAGGAGT TACGTCAAAC TACCTGTACT GGTTTAAATA 1260 
AGTTTACCTT TTCCTCCAGG AAATATAATG ATTTCTGGG A ACATGGGCAT GTATATATAT 1320 
ATATGGAGAG AGAATTTTGC ACATATTATA CATATTTTGT GCTAATCTTG TTTTCCTCTT 1380 
AGTATTCCTT TGTATAAATT AGTGTTTGTC TAGCATGTTT GTTTAATCCT TTGGG AATTC 

Scq ID NO: 124 Protein sequence: 
Protein Accession #: NP_056970.1 

1 11 21 31 41 51 

MIAISAVSSA LLFSLLCEAS TVVLLNSTDS SPPTNNFTDI EAALKAQLDS ADIPKARRKR 60 
YISQNDMIAI LDYHNQVRGK VFPPAANMEY MVWDENLAKS AEAWAATCIW DHGPSYLLRF 120 
LGQNLSVRTG RYRSILQLVK PWYDEVKDYA FPYPQDCNPR CPMRCFGPMC THYTQMVWAT 180 
SNR1GCAIHA CQNMNVWGSV WRRAVYLVCN YAPKGNWIGE APYKVGVPCS SCPPSYGGSC 240 
TDNLCFPGVT SNYLYWFK 



Seq ID NO: 125 DNA sequence 

Nucleic Acid Accession U: NM J)0 1 793 

Coding sequence: 54-2543 (underlined sequences correspond to start and stop codons) 
I 11 21 31 41 51 

GCGGAACACC GGCCCGCCGT CGCGGCAGCT GCTTCACCCC TCTCTCTGCA GCCATGGGGC 60 
TCCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 120 
CCTCCGAGCC GTGCCGGGCG GTC1TCAGGG AGGCTGAAGT GACCTTGGAG GCGGGAGGCG 180 
CGGAGCAGG A GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAGAGC 240 
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CAGCTCTGTT TAGCACTGAT AATGATGACT TCACTGTGCG GAATGGCG AG ACAGTCCAGG 300 
AAAGAAGGTC ACTGAAGG AA AGG AATCCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 360 
GAAGACACAA GAGAGATTGG GTGGTTGCTC CAATATCTGT CCCTGAAAAT GGCA AGGG TC 420 
COTCCCCCA GAGACTGAAT CAGCTCAAGT CTAATAAAGA TAG AGACACC AAGATTTTCT 480 
ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAGAGAAGG 540 
AGACAGGCTG GTTGTTGTTG AATAAGCCAC TGGACCGGGA GGAGATTGCC AAGTATGAGC 600 
TCTTTGGCCA CGCTGTGTCA GAG AATGGTG CCTCAGTGGA GGACCCCATG AACATCTCCA 660 
TCATCGTGAC CG ACCAG A AT G ACCACAAGC CCAAGTTTAC CCAGGACACC TTCCGAGGG A 720 
GTGTCTTAGA GGGAGTCCTA CCAGGTACTT CTGTGATGCA GGTGACAGCC ACAGATGAGG 780 
ATCATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 840 
AGGACCCACA CGACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 900 
CCAGTGGCCT GGACCGGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 960 
TGGATGGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAGT GGAGATCCTT GATGCCAATG 1020 
ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACGAGGCCCA TGTGCCTGAG AATGCAGTGG 1080 
GCCATGAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC 1 140 
GTGCCACCTA CCTTATCATG GGCGGTGACG ACGGGGACCA TTTTACCATC ACCACOCACC 1200 
CTGAGAGCAA CCAGGGCATC CTGACAACCA GGAAGGGTTT GGATTTTGAG GCCAAAAACC 1260 
AGCACACCCT GTACGTTGAA GTGACCAACG AGGCCCCTTT TGTGCTGAAG CTCCCAACCT 1320 
CCACAGCCAC CATAGTGGTC CACGTGGAGG ATGTGAATGA GGCACCTGTG TTTGTCCCAC 1380 
CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 
CTGCAGAAGA CCCTG ACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAG 1500 
CAGGGTGGCT AGCCATGGAC CCAGACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC 1560 
GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 1620 
ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 1680 
ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCGCC 1740 
ACGTGCTGAA CATCACGG AC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 1800 
TCACAGATGA CTCAGACATC TACTGGACGG CAGAGGTCAA CGAGGAAGGT GACACAGTGG 1860 
TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1920 
ACCATGGCAA CAAAG AGCAG CTG ACGGTGA TCAGGGCCAC TGTGTGCGAC TGCCATGGCC 1 980 
ATGTCGAAAC CTG CCCTG G A CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 2040 
TCCTGGCTCT GCTGTTCCTC CTGCTGGTGC TGCTTTTGTT GGTGAGAAAG AAGCGGAAGA 2100 
TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCCGTGA CAACGTCTTC TACTATGGCG 2160 
AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC CCAGCTCCAC CG AGGTCTGG 2220 
AGGCCAGGCC GG AGGTGGTT CTCCGCAATG ACGTGGCACC AACCATCATC COG ACACCCA 2280 
TGTACCGTCC TAGGCCAGCC AACCCAGATG AAATCGGCAA CTTTATAATT GAGAACCTGA 2340 
AGGCGGCTAA CACAG ACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTCGACTATG 2400 
AGGGCAGCGG CTCCGACGCC GCGTCCCTGA GCTCCCTCAC CTCCTCCGCC TCCGACCAAG 2460 
ACCAAGATTA CGATTATCTG AACGAGTGGG GCAGCCGCTT CAAGAAGCTG GCAGACATGT 2520 
ACGGTGGCGG GGAGGACGAC TAGGCGGCCT GCCTGCAGGG CTGGGGACCA AACGTCAGGC 2580 
CACAGAGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT GAGGACTTCG GAGCTTGTCA 2640 
GGAAGTGGCC GTAGCAACTT GGCGG AGACA GGCTATGAGT CTGACGTTAG AGTGGTTGCT 2700 
TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 
CACCTGGGCC AGGGTTGCCT CAGAGGCCAA GTTTCCAGAA GCCTCTTACC TGCCGTAAAA 2820 
TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2880 
CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 
TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CCA CTGGCC G 3000 
TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GG ATGGATCT CTGCGTTTTT 3060 
ATACTGAGTG TGCCTAGGTT GCCCCTTATT TTTTATTTTC CCTGTTGCGT TGCTATAGAT 3120 
GAAGGGTGAG GACAATCGTG TATATGTACT AGAACTTTTT TATTAAAG AA A 

Seq ID NO: 126 Protein sequence: 
Protein Accession #: NP 00 1 784 

1 11 21 31 41 51 
I I I 1 1 I 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 
QEPALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRHKRDWV VAPISVPENG 120 
KGPFPQRLNQ LKSNKDRDTK IFYSITGPGA DSPPEG VFAV EKETGWLLLN KPLDREEIAK 1 80 
YELFGHAVSE NGASVEDPMN ISI1VTDQND KKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 
DEDDAIYTYN G VVAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 
TDMDGDGSTT TAVAVVEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 
AWRATYUMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVE V TNEAPFVLKL 420 
PTSTATIVVH VEDVNEAPVF VPPSKVVEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 
DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTUD 540 
VNDHGPVPEP RQIT1CNQSP VRHVLN1TDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 
TVVLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 660 
GAVLALLFLL LVLLLLVRKK RK1KEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 720 
GLEARPEVVL RNDVAPT1IP TPMYRPRPAN PDEIGNFHE NLKAANTDPT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 

Scq ID NO: 127 DNA sequence 

Nucleic Acid Accession it: NM_G032S6. 1 

Coding sequence: 60-734 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I 1 1 

CCTGCTGGGG CCGTCCAGTC CCCCAGACCT CACAGGCTCA GTCGCGGATC TGCAGTGTCA 60 

405 
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TGCCTGGGAO CCCTCGGCCC GCGCCAAGCT GGGTGCTGTT GCTGCGGCTG CTGGCGTTGC 120 
TGCGGCCCCC GGGGCTGGGT GAGGCATGCA GCTGCGCCCC GGCGCACCCT CAGCAGCACA 180 
TCTGCCACTC GGCACTTGTG ATTCGGGCCA AAATCTCCAG TGAGAAGGTA GTTCCGGCCA 240 
GTGCAGACCC TGCTGACACT GAAAAAATGC TCCGGTATGA AATCAAACAG ATAAAGATGT 300 
5 TCAAAGGGTT TGAGAAAGTC AAGGATGTTC AGTATATCTA TACGCCTTTT G ACTCTTCCC 360 
TCTGTGGTGT GAAACTAGAA GCCAACAGCC AGAAGCAGTA TCTCTTG ACT GGTCAGGTCC 420 
TCAGTGATGG AAAAGTCTTC ATCCATCTGT GCAACTACAT CGAGCCCTGG GAGGACCTGT 480 
CCTTGGTGCA G AGGG AAAGT CTGAATCATC ACTACCATCT G AACTGTGGC TGCCAA ATCA 540 
CCACCTGCTA CAC AGTACCC TGTACCATCT CGGCCCCTAA CGAGTGCCTC TGG ACAGACT 600 

1 0 GGCTGTTGG A ACG A AAGCTC TATGGTTACC AGGCTCAGC A TTATGTCTGT ATGAAGCATG 660 
TTG ACGGCAC CTGCAGCTGG TACCGGGGCC ACCTGCCTCT CAGG AAGG AG TTTGTTGACA 720 
TCGTTCAGCC CTAGTAGGGA CCAGTGACCA TCACATCCCT TCAAGAGTCC TGAAGATCAA 780 
GCCAGTTCTC CTTCCCTGCA GAGCTTTGGC CATTACCACC TGACCTCTTG CTGCCAGCTA 840 
ATAAGAAGTG CCAAGTGGAC AGTCTGGCCA CTGTCAAGGC AGGGAAGGGG CCATGACTTT 900 

15 TCTGCCCTGC CCTCAGCCTG TTGCCCTGCC TCCCAAACCC CATTAGTCTA GCCTTG TAGC 960 
TGTTACTGCA AGTGTTTCTT CTGGCTTAGT CTGTTTTCTA AAGCCAGGAC TATTCCCTTT 1020 
CCTCCCCAGG AATATGTGTT TTCCTTTGTC TTAATCGATC TGGTAGGGG A GAAATGGCG A 1080 
ATGTCATACA CATGAGATGG TATATCCTTG CGATGTACAG AATCAGAAGG TGGTTTGACA 1 140 
- GCATCATAAA CAGGCTGACT GGCAGGAATG AAAAAAAAAA AAAAAAAAA 

20 

Seq ID NO: 128 Protein sequence: 
Protein Accession #: NP_003247. 1 

1 11 21 31 41 51 

MPGSPRPAPS WVLLLRLLAL LRPPGLGEAC SCAPAHPQQH ICHSALV1RA K1SSEKVVPA 60 
SADPADTEKM LRYEIKQIKM FKGFEKVKDV QYIYTPFDSS LCGVK LEANS QKQYLLTGQV 120 
LSDGKVFIHL CNYIEPWEDL SLVQRESLNH HYHLNCGCQI TTCYTVPCTI SAPNECLWTD 180 
WLLERKLYGY QAQHYVCMKH VDGTCSWYRG HLPLRKEFVD IVQP 

30 

Scq ID NO: 129 DNA sequence 

Nucleic Acid Accession #: NM_007207.2 

Coding sequence: 143-1 591 (underlined sequences correspond to start and stop codons) 
35 1 11 21 31 41 51 

CCACGCGTCC GCAATGAAGC CGAGTGAATG GGGGCTGAAT GTGCGAGTCC ATAGCTGAAG 60 
AGGAGCGCCA GATGGTGGAG GAATACACTT ATTTATGAAA CTGTCTTGAG TTCTTCTTGA 120 
ATTGCCAGTT TTCAGCCTCC TCATGCCTCC GTCTCCTTTA GACGACAGGG TAGTAGTGGC 180 

40 ACTATCTAGG CCCGTCCGAC CTCAGGATCT CAACCTTTGT TTAGACTCTA GTTACCTTGG 240 
CTCTGCCAAC CCAGGCAGTA ACAGCCACCC TCCTGTCATC GCCACCACCG TTGTGTCCCT 300 
CAAGGCTGCG AATCTGACGT ATATGCCCTC ATCCAGCGGC TCTGCCCGCT CGCTGAATTG 360 
TGGATGCAGC AGTGCCAGCT GCTGCACTGT GGCAACCTAC GACAAGGACA ATCAGGCCCA 420 
AACCCAAGCC ATTGCCGCTG GCACCACCAC CACTGCCATC GGAACCTCTA CCACCTGCCC 480 

45 TGCTAACCAG ATGGTCAACA ATAATG AG AA TACAGGCTCT CTAAGTCCAT CAAGTGGGGT 540 
GGGCAGCCCT GTGTCAGGGA CCCCCAAGCA GCTAGCCAGC ATCAAAATAA TCTACCCCAA 600 
TGACTTGGCA AAG AAGATGA CCAAATGCAG CAAGAGTCAC CTGCCGAGTC AGGGCCCTGT 660 
CATCATTGAC TGCAGGCCCT TCATGGAGTA CAACAAGAGT CACATCCAAG G AGCTGTCCA 720 
CATTAACTGT GCCG ATAAGA TCAGCCGGCG GAGACTGCAG CAGG GCAAG A TCACTGTCCT 780 

50 AGACTTGATT TCCTGTAGGG AAGGCAAGGA CTCTTTCAAG AGGATCTTTT CCAAAGAAAT 840 
TATAGTTTAT GATG AGAATA CCAATGAACC AAGCCGAGTG ATGCCCTCCC AGCCACTTCA 900 
CATAGTCCTC GAGTCCCTGA AGAGAG AAGG CAAAGAACCT CTGGTGTTGA AAGGTGGACT 960 
TAGTAGTTTT AAGCAGAACC ATGAAAACCT CTGTGACAAC TCCCTCCAGC TCCAAGAGTG 1020 
CCGGGAGGTG GGGGGCGGCG CATCCGCGGC CTCGAGCTTG CTACCTCAGC CCATCCCCAC 1080 

55 CACCCCTGAC ATCGAGAACG CTGAGCTCAC CCCCATCTTG CCCTTCCTGT TCCTTGGCAA 1 140 
TGAGCAGGAT GCTCAGG ACC TGGACACCAT GCAGCGGCTG AACATCGGCT ACGTCATCAA 1200 
CGTCACCACT CATCTTCCCC TCTACCACTA TGAGAAAGGC CTGTTCAACT ACAAGCGGCT 1260 
GCCAGCCACT GACAGCAACA AGCAGAACCT GCGGCAGTAC TTTGAAGAGG CTTTTGAGTT 1320 
CATTGAGGAA GCTCACCAGT GTGGGAAGGG GCTTCTCATC CACTGCCAGG CTGGGGTGTC 1380 

60 CCGCTCCGCC ACCATCGTCA TCGCTTACTT GATGAAGCAC ACTCGGATGA CCATGACTGA 1440 
TGCTTATAAA TTTGTCAAAG GCAAACGACC AATTATCTCC CCAAACCTTA ACTTCATGGG 1500 
GCAGTTGCTA GAGTTCGAGG AAGACCTAAA CAACGGTGTG ACACCGAGAA TCCTTACACC 1560 
AAAGCTGATG GGCGTGGAGA CGGTTGTG TG AC AATGGTCT GO ATGGA AAG GATTGCTG CT 1620 
CTCCATTAGG AGACAATGAG GAAGG AGGAT GGATTCTGGT TTTTTTTCTT TCTTTTTTTT 1680 

65 TTGTAGTTGG GAGTAAAGTT TGTGAATGG A AACAAACTTG GTTAAACACT TTATTTTTAA 1740 
CAAGTGTAAG AAG ACT AT AC TTTTGATGCC ATTGAGATTC ACCTTCCACA AACTGGCCAA 1800 
ATTAAGGAGG TTAAAG AAGT AATTTTTTTT AAGCCCAACC ATTAAAAATT TAATACAACT 1 860 
TGGTTTCTCC CCCTTTTTCC TTTAAAGCTA NTTTGTAAAA GTTTATGAG 

70 Seq ID NO: 1 30 Protein sequence: 

Protein Accession #: NP J)09 138.1 



75 MPPSPLDDRV VVALSRPVRP QDLNLCLDSS YLGSANPGSN SHPPV1ATTV VSLKAANLTY 60 
MPSSSGSARS LNCGCSSASC CTVATYDKDN Q AQTQ AlAAG TTTTAIGTST TCPANQMVNN 1 20 
NENTGSLSPS SGVGSPVSGT PKQLASDUI YPNDLAXKMT KCSKSHLPSQ GPVIIDCRPF 180 



21 



31 



51 



406 
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MEYNKSHIQG AVHINCADKI SRRRLQQGKJ TVLDUSCRE GKDSFKRIFS KEUVYDENT 240 
NEPSRVMPSQ PLHIVLESLK REGKEPLVLK GGLSSFKQNH ENLCDNSLQL QECREVGGGA 300 
SAASSLLPQP IPTTPDIENA ELTPILPFLF LGNEQDAQDL DTMQRLNIGY VINVTTHLPL 360 
YHYEKGLFNY KRLPATDSNK QNLRQYFEEA FEFIEEAHQC GKGLLIHCQA G VSRSATTVI 420 
5 AYLMKHTRMT MTDAYKFVKG KRPHSPNLN FMGQLLEFEE DLNNGVTPRI LTPKLMGVET 480 
W 

Seq ID NO: 131 DNA sequence 
Nucleic Acid Accession #: NM_O054O9.3 
1 0 Coding sequence: 94-378 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

TTCCTTTCAT GTTCAGCATT TCTACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 60 

1 5 CAACAGCACC AGCAGCAACA GCAAAAAACA AACATGAGTG TGAAGGGCAT GGCTATAGCC 120 
TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAGAGGACGC 180 
TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 240 
ATAATGTACC CAAGTAACAA CTGTGACAAA ATAG AAGTGA TTATTACCCT G AAAGAAAAT 300 
AAAGGACAAC GATGCCTAAA TCCCAAATCG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT 360 

20 GAAAGAAAGA ATTTTTAAAA ATATCAAAAC ATATGAAGTC CTGGAAAAGG GCATCTGAAA 420 
AACCTAGAAC AAGTTTAACT GTG ACTACTG AAATGACAAG AATTCTACAG TAGGAAACTG 480 
AGACTTTTCT ATGGTTTTGT G ACTTTCAAC TTTTGTACAG TTATGTGAAG GATGAAAGGT 540 
GGGTGAAAGG ACCAAAAACA G AAATACAGT CTTCCTGAAT GAATGACAAT CAG AATTCCA 600 
CTGCCCAAAG GAGTCCAGCA ATTAAATGGA TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 660 

25 GGTTACCATC GG AGTTTACA AAGTGCTTTC ACGTTCTTAC TTGTTGTATT ATACATTCAT 720 
GCATTTCTAG GCTAG AG AAC CTTCTAG ATT TGATGCTTAC AACTATTCTG TTGTG ACTAT 780 
GAG AACATTT CTGTCTCTAG AAGTTATCTG TCTGTATTGA TCTTTATGCT ATATTACTAT 840 
CTGTGGTTAC AGTGG AGACA TTG ACATTAT TACTGGAGTC AAGCCCTTAT AAGTCAAAAG 900 
CATCTATGTG TCGTAAAGCA TTCCTCAAAC ATTTTTTCAT GCAAATACAC ACTTCTTTCC 960 

3 0 CCAAATATCA TGTAGCACAT CAATATGTAG GGAAACATTC TTATGCATCA TTTGGTTTGT 1 020 
TTTATAACCA ATTCATTAAA TGTAATTCAT AAAATGTACT ATGAAAAAAA TTATACGCTA 1080 
TGGGATACTG GC A AC AGTGC ACATATTTCA TAACCAAATT AGCAGCACCG GTCTTAATTT 1140 
GATGTTTTT C AACI H I ATT CATTGAGATG TTTTGAAGCA ATTAGGATAT GTGTGTTTAC 1200 
TGTACTTTTT GTTTTGATCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 

3 5 TACAAAATGT TTTTGTCTAC CAAAGAAAAA TGTTG AAAAA TAAGCAAATG TATACCTAGC 1320 
AATCACTTTT ACTTTTTGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1380 
TTGTTCATGC CTATATACTG TAAAATTTAG GTATACTCAA GACTAGTTTA AAGAATCAAA 1440 
GTCATTTTTT TCTCTAATAA ACTACCACAA CCTTTCTTTT TTAAAAAAAA AAA 

40 Seq ID NO: 132 Protein sequence: 

Protein Accession if: NPJJ05400. 1 

1 II 21 31 41 51 

45 MSVKGMAIAL AVILCATVVQ GFPMFKRGRC LCIGPGVKAV KVADIEKASI MYPSNNCDKI 60 
EV1ITLKENK GQRCLNPKSK. QARLIIKKVE RKNF 

Seq ID NO: 133 DNA sequence 
Nucleic Acid Accession ft : NM_0 1 2342 

50 Coding sequence: 373-1 1 55 (underlined sequences correspond to start and stop codons) 

1 It 21 31 41 51 
I I I 1 I I 

CTGGCGCGGG CGGGAGCTGC GGCGGATACC CTTGCGTGCT GTGGAGACCC TACTCTCTTC 60 

5 5 GCTGAGAACG GCCGCTAGCG GGGACTGAAG GCCGGGAGCC CACTCCCGAC CCGGGGCTAG 120 

CGTGCGTCCC TAGAGTCG AG CGGGGCAAGG GAGCCAGTGG CCGCCGACGG GGGACCGGGA 1 80 
AACTTTTCTG GGCTCCTGGA GAGCCCTGTA GCCGCGCTCC ATGCTCCGGC AGCGGCCCGA 240 
AACCCAGCCC CGCCGCTGAC GGAGCCCGCC GCTCCGGGCA GGGCCCATGC CCTGCGCGCT 300 
CCGGGGGTCG TAGCTGCCGC CGAGCCGGGG CTCCGGAAGC CGGCGGGGGC GCCGCGGCCG 360 
60 TGCGGGGCGT CAATGGATCG CCACTCCAGC TACATCTTCA TCTGGCTGCA GCTGGAGCTC 420 
TGCGCCATGG CCGTGCTGCT CACCAAAGGT GAAATTCGAT GCTACTGTGA TGCTGCCCAC 480 
TGTGTAGCCA CTGGTTATAT GTGTAAATCT GAGCTCAGCG CCTGCTTCTC TAGACTTCTT 540 
GATCCTCAG A ACTCAAATTC CCCACTCACC CATGGCTGCC TGG ACTCTCT TGCAAGCACG 600 
ACAGACATCT GCCAAGCCAA ACAGGCCCGA AACCACTCTG GCACCACCAT ACCCACATTG 660 

6 5 GAATGCTGTC ATGAAGACAT GTGCAATTAC AGAGGGCTGC ACGATGTTCT CTCTCCTCCC 720 

AGGGGTGAGG CCTCAGGACA AGGAAACAGG TATCAGCATG ATGGTAGCAG AAACCTTATC 780 
ACCAAGGTGC AGGAGCTG AC TTCTTCCAAA G AGTTGTGGT TCCGGGCAGC GGTCATTGCC 840 
GTGCCCATTG CTGGAGGGCT GATTTTAGTG TTGCTTATTA TGTTGGCCCT GAGGATGCTT 900 
CGAAGTGAAA ATAAG AGGCT GCAGGATCAG CGGCAACAGA TGCTCTCCCG TTTGCACTAC 960 
70 AGCTTTCACG GACACCATTC CAAAAAGGGG CAGGTTGCAA AGTTAG ACTT GGAATGCATG 1 020 
GTGCCGGTCA GTGGGCACGA GAACTGCTGT CTGACCTGTG ATAAAATGAG ACAAGCAGAC 1080 
CTCAGCAACG ATAAG ATCCT CTCGCTTGTT C ACTGGGGCA TGTACAGTGG GCACGGGAAG 1140 
CTGGAATTCG TATGACGGAG TCTTATCTGA ACTACACTTA CTGAACAGCT TGAAGGCCTT 1200 
TTGAGTTCTG CTGGACAGGA GCACTTTATC TGAAGACAAA CTCATTTAAT CATCTTTGAG 1260 

7 5 AG ACAAAATG ACCTCTGCAA AC AG AATCTT GGATATTTCT TCTG AAGGAT TATTTGCACA 1320 

GACTTAAATA CAGTTAAATG TGTTATTTGC TTTTAAAATT ATAAAAAGCA AAGAGAAGAC 1380 
TTTGTACACA CTGTCACCAG GGTTATTTGC ATCCAAGGGA GCTGGAATTG AGTACCTAAA 1440 
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TAAACAAAAA TGTGCCCTAT GTAAGCTTCT ACATCTTGAT TTATTGTAAA GATTTAAAAG 1500 
AAATATATAT ATTTTGTCTG A 

Scq ID NO: 134 Protein sequence: 
Protein Accession U: NP__036474. 1 

I 11 21 31 41 51 

I i i i i i 

MDRHSSYIFI WLQLELCAMA VLLTKGEIRC YCDAAHCVAT GYMCKSELSA CFSRLLDPQN 60 
SNSPLTHGCL DSLASTTDIC QAKQARNHSG TTIFTLECCH EDMCNYRGLH DVLSPPRGEA 120 
SGQGNRYQHD GSRNLITKVQ ELTSSK.ELWF RAAVIAVPIA GGLILVLLIM LALRMLRSEN 1 80 
KRLQDQRQQM LSRJLHYSFHG HHSKKGQVAK LDLECMVPVS GHENCCLTCD KMRQADLSND 240 
KILSLVHWGM YSGHGKLEFV 

Seq ID NO: 135 DNA sequence 

Nucleic Acid Accession #: NM_00I 627. 1 

Coding sequence: 64-1 8 1 S (underlined sequences correspond to start and stop codons) 
1 II 21 31 41 51 

CGGGACGACG CCCCCTCCTG CGGCGTGGAC TCCGTCAGTG GCCCACCAAG AAGGAGGAGG 60 
AATATGGAAT CCAAGGGGGC CAGTTCCTGC CGTCTGCTCT TCTGCCTCTT GATCTCCGCC 120 
ACCGTCTTCA GGCCAGGCCT TGGATGGTAT ACTGTAAATT CAGCATATGG AGATACCATT 1 80 
ATCATACCTT GCCGACTTGA CGTACCTCAG AATCTCATGT TTGGCAAATG GAAATATGAA 240 
AAGCCCGATG GCTCCCCAGT ATTTATTGCC TTCAGATCCT CTACAAAGAA AAGTGTGCAG 300 
TACGACGATG TACCAGAATA CAAAGACAG A TTGAACCTCT CAGAAAACTA CACTTTGTCT 360 
ATCAGTAATG CAAGGATCAG TGATGAAAAG AGATTTGTGT GCATGCTAGT AACTGAGGAC 420 
AACGTGTTTG AGGCACCTAC AATAGTCAAG GTGTTCAAGC AACCATCTAA ACCTGAAATT 480 
GTAAGCAAAG CACTGTTTCT CGAAACAGAG CAGCTAAAAA AGTTGGGTGA CTGCATTTCA 540 
GAAGACAGTT ATCCAGATGG CAATATCACA TGGTACAGGA ATGGAAAAGT GCTACATCCC 600 
CTTGAAGGAG CGGTGGTCAT AATTTTTAAA AAGGAAATGG ACCCAGTGAC TCAGCTCTAT 660 
ACCATGACTT CCACCCTGGA GTACAAG ACA ACCAAGGCTG ACATACAAAT GCCATTCACC 720 
TGCTCGGTGA CATATTATGG ACCATCTGGC CAGAAAACAA TTCATTCTGA ACAGGCAGTA 780 
TTTGATATTT ACTATCCTAC AGAGCAGGTG ACAATACAAG TGCTGCCACC AAAAAATGCC 840 
ATCAAAGAAG GGGATAACAT CACTCTTAAA TGCTTAGGGA ATGGCAACCC TCCCCCAGAG 900 
GAATTTTTGT TTTACTTACC AGGACAGCCC GAAGGAATAA GAAGCTCAAA TACTTACACA 960 
CTGATGGATG TGAGGCGCAA TGCAACAGGA GACTACAAGT GTTCCCTGAT AGACAAAAAA 1020 
AGCATGATTG CTTCAACAGC CATCACAGTT CACTATTTGG ATTTGTCCTT AAACCCAAGT 1080 
GGAGAAGTGA CTAGACAG AT TGGTGATGCC CTACCCGTGT CATGCACAAT ATCTGCTAGC 1 140 
AGGAATGCAA CTGTGGTATG GATGAAAGAT AACATCAGGC TTCGATCTAG CCCGTCATTT 1200 
TCTAGTCTTC ATTATCAGGA TGCTGGAAAC TATGTCTGCG AAACTGCTCT GCAGGAGGTT 1260 
GAAGGACTAA AGAAAAGAGA GTCATTGACT CTCATTGTAG AAGGCAAACC TCAAATAAAA 1320 
ATGACAAAGA AAACTGATCC CAGTGGACTA TCTAAAACAA TAATCTGCCA TGTGGAAGGT 1380 
TTTCCAAAGC CAGCCATTCA GTGGACAATT ACTGGCAGTG GAAGCGTCAT AAACCAAACA 1440 
GAGGAATCTC CTTATATTAA TGGCAGGTAT TATAGTAAAA TTATCATTTC CCCTGAAGAG 1500 
AATGTTACAT TAACTTGCAC AGCAGAAAAC CAACTGGAGA GAACAGTAAA CTCCTTGAAT 1560 
GTCTCTGCTA TAAGTATTCC AGAACACGAT GAGGCAGACG AGATAAGTGA TGAAAACAGA 1620 
GAAAAGGTGA ATGACCAGGC AAAACTAATT GTGGGAATCG TTGTTGGTCT CCTCCTTGCT 1680 
GCCCTTGTTG CTGGTGTCGT CTACTGGCTG TACATGAAGA AGTCAAAGAC TGCATCAAAA 1740 
CATGTAAACA AGGACCTCGG TAATATGGAA GAAAACAAAA AGTTAGAAGA AAACAATCAC 1800 
AAAACTGAAG CCTAAGAGAG AAACTGTCCT AGTTGTCCAG AGATAAAAAT CATATAGACC I860 
AATTGAAGCA TGAACGTGGA TTGTATTTAA GACATAAACA AAGACATTGA CAGCAATTCA 1920 
TGGTTCAAGT ATTAAGCAGT TCATTCTACC AAGCTGTCAC AGGTTTTCAG AGAATTATCT 1980 
CAAGTAAAAC AAATGAAATT TAATTACAAA CAATAAGAAC AAGTTTTGGC AGCCATGATA 2040 
ATAGGTCATA TGTTGTGTTT GGTTCAATTT TTTTTCCGTA AATGTCTGCA CTGAGGATTT 2100 
CTTTTTGGTT TGCCTTTTAT GTAAATTTTT TACGTAGCTA TTTTTATACA CTGTAAGCTT 2 1 60 
TGTTCTGGG A GTTGCTGTTA ATCTG ATGTA TAATGTAATG TTTTTATTTC AATTGTTTAT 2220 
ATGGATAATC TGAGCAGGTA CATTTCTG AT TCTGATTGCT ATCAGCAATG CCCCAAACTT 2280 
TCTCATAAGC ACCTAAAACC CAAAGGTGGC AGCTTGTGAA GATTGGGGAC ACTCATATTG 2340 
CCCTAATTAA AAACTGTGAT TTTTATCACA AGGGAGGGGA GGCCGAGAGT CAGACTGATA 2400 
GACACCATAG GAGCCGACTC TTTGATATGC CACCAGCGAA CTCTCAGAAA TAAATCACAG 2460 
ATGCATATAG ACACACATAC ATA ATGGTAC TCCCAAACTG ACAATTTTAC CTATTCTGAA 2520 
AAAGACATAA AACAGAATT 

Seq ID NO: 136 Protein sequence: 
Protein Accession #: NP_00161 8.1 

I 11 21 31 41 51 

MESKGASSCR LLFCLLISAT VFRPGLGWYT VNSAYGDTII IPCRLDVPQN LMFGKWKYEK 60 
PDGSPVFIAF RSSTKKSVQY DDVPEYKDRL NLSENYTLSI SNARISDEKR FVCMLVTEDN 120 
VFEAPTIVKV FKQPSKPEIV SKALFLETEQ LKKLG DCISE DSYPDGNITW YRNGKVLHPL 1 80 
EGAWIIFKK EMDPVTQLYT MTSTLEYKTT KADIQMPFTC SVTYYGPSGQ KTIHSEQAVF 240 
DIYYPTEQVT IQVLPPKNAI KEGDNITLKC LGNGNPPPEE FLFYLPGQPE GIRSSNTYTL 300 
MDVRRNATGD YKCSLIDKKS MIASTAITVH YLDLSLNPSG EVTRQIGDAL PVSCTISASR 360 
NATWWMKDN IRLRSSPSFS SLH YQDAGNY VCETALQEVE GUCKRESLTL IVEGKPQDCM 420 
TKKTDPSGLS KTIICHVEGF PKPAIQWTIT GSGSVINQTE ESPYINGRYY SKJHSPEEN 480 

408 
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VTLTCTAENQ LERTVNSLNV SA1SIPEHDE ADEISDENRE KVNDQAKUV GIWGLLLAA 540 
LVAGWYWLY MKKSKTASKH VNKDLGNMEE NKKLEENNHK TEA 



Scq ID NO: 137 DNA sequence 

Nucleic Acid Accession*: XMJJ30559 

Coding sequence: 1-1 19 (underlined sequences correspond to start and stop codons) 
1 II 21 31 41 51 

ATGAACCGCA GCCACCGGCA CGGGGCGGGC AGCGGCTGCC TGGGCACTAT GGAGGTGAAG 60 
AGCAAGTTTG GAGCTGAATT TCGTCGGTTT TCGCTGGAAA GATCAAAACC TGG AAAATTT 120 
G^G^ATC^ ™ 
GGCTATGCAG ACATCCATGG AGACTTACTA CCTATAAATA ATGATG ATAA TTATCACAAA 240 
GCTGmCAA CGGCCAATCC ACTGCTTAGG ATATTTATAC AAAAGAAGGA AGAAGCAGAC 300 
K5Sc£ TOCTA^AGA CACGCTAATA AAGAAGAAGA ATGTTTTAAC CAACGTATrG 360 
CGTCCTGACA ACCATAGAAA AAAGCCACAT ATAGTCATTA GTATGCCCCA AGACTTTAGA 420 
S^^ctATTATAGA CGTGGATATT CTCCCAGAAA CGCATCGTAG GGTACGTCTT 480 

cta^ccacatggcVtagaaaaggttccagggatc^^ foo 
g^tctggctc aaagtacagg actattagctgttaatgatg "^^™™™? r ™ 

ATAGAAGTTTCAGGGAAGAG CCTTGATCAA GTAACAGACATGATGATTGC AAATA^GCCGT 720 
MO^TCA TAACAGTGAG ACCGGCAAAC CAGAGGAATA ATGTTGTGAG G AACAGTCGG 780 
ACTTCTGGCA GTTCCGGTCA GTCTACTGAT AACAGCCTTC TTGGCTACCC ACAGCAGATT 840 

ga^Agctto^ 

AATGGAGTGC CACAGCAGAT TCCAAAAGCT GTTCCTAATA CTGAGAGCCT GGAGTCATTA 960 
ACACAGATAG AGCTAAGCTT TG AGTCTGG A CAGAATGGCT TTATTCCCTC TAATG AAGTG 1020 

1080 

AAACTCTTAG AAGAAGATGG AACAATCATA ACATTATGA 

Scq ID NO: 138 Protein sequence: 
Protein Accession ft: XP_030559 

1 11 21 31 41 51 

MNRSHRHGAG SGCLGTMEVK SKFG AEFRRF SLERSKPGKF EEFYGLLQHV HKIPNVDVLV 60 
f^^D^ pSnYHK A VSTANPLLR IFIQKKEEAD YSAFGTDTU KKI^LTNVL 120 
RPDNHRKKPH 1VISMPQDFR PVSSIIDVDI LPETHRRVRL YKYGTEKPLG FY1RDGSSVR 180 
S GLAQSTGLLA VNDEVLEVNG IEVSGKSLDQ VTD^IANSR 240 

NUITVRPAN QRNNWRNSR TSGSSGQSTD NSLLGYPQQI EPSFEPEDED SEEDD IIED 300 
NGVPQQIPKA VPNTESLESL TQIELSFESG QNGFIPSNEV SLAAIASSSN TEFETHAPDQ 360 
KLLEEDGTUTL 
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It is understood that the examples described above in no way serve to limit the 
true scope of this invention, but rather are presented for illustrative purposes. All 
5 publications, sequences of accession numbers, and patent applications cited in this 

specification are herein incorporated by reference as if each individual publication or patent 
application were specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 

1 1 . A method of detecting a breast cancer-associated transcript in a cell 

2 from a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1-25. 

1 2. The method of claim 1, wherein the biological sample comprises 

2 isolated nucleic acids. 

1 3. The method of claim 2, wherein the nucleic acids are mRNA. 

1 4. The method of claim 2, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 5. The method of claim 1, wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1-25. 

1 6 . The method of claim 1 , wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 7. The method of claim 1, wherein the patient is undergoing a therapeutic 

2 regimen to treat breast cancer. 

1 8. The method of claim 1, wherein the patient is suspected of having 

2 breast cancer. 

1 9. An isolated nucleic acid molecule consisting of a polynucleotide 

2 sequence as shown in Tables 1 -25. 

1 10. The nucleic acid molecule of claim 9, which is labeled. 

1 1 1 . An expression vector comprising the nucleic acid of claim 9. 

1 12. A host cell comprising the expression vector of claim 1 1 . 
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1 13, An isolated polypeptide which is encoded by a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1-25. 

1 14. An antibody that specifically binds a polypeptide of claim 13. 

1 15. The antibody of claim 14, further conjugated to an effector component. 

1 1 6. The antibody of claim 15, wherein the effector component is a 

2 fluorescent label. 

1 17. The antibody of claim 15, wherein the effector component is a 

2 radioisotope or a cytotoxic chemical. 

1 18. The antibody of claim 15, which is an antibody fragment. 

1 19. The antibody of claim 15, which is a humanized antibody 

1 20. A method of detecting a breast cancer cell in a biological sample from 

2 a patient, the method comprising contacting the biological sample with an antibody of claim 

3 14. 

1 21. The method of claim 20, wherein the antibody is further conjugated to 

2 an effector component. 

1 22. The method of claim 2 1 , wherein the effector component is a 

2 fluorescent label. 

1 23. A method for identifying a compound that modulates a breast cancer- 

2 associated polypeptide, the method comprising the steps of: 

3 (i) contacting the compound with a breast cancer-associated polypeptide, the 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

5 80% identical to a sequence as shown in Tables 1-25; and 

6 (ii) determining the functional effect of the compound upon the polypeptide. 
1 24. A drug screening assay comprising the steps of 
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2 (i) administering a test compound to a mammal having breast cancer or a cell 

3 isolated therefrom; 

4 (ii) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-25 in a 

6 treated cell or mammal with the level of gene expression of the polynucleotide in a control 

7 cell or mammal, wherein a test compound that modulates the level of expression of the 

8 polynucleotide is a candidate for the treatment of breast cancer. 
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